Archive 2025 (2) July (1) Transformers Don't Need LayerNorm at Inference Time July 23, 2025 June (1) How LLMs go from base models to assistants June 24, 2025 2024 (1) October (1) ARENA Capstone: Hyperparameter tuning for MELBO October 5, 2024