
EP2: The Physics of LLMs - Why Size Matters & When It Doesn't
Scaling Laws & Compute Budget: The sources extensively cover the foundational empirical scaling laws, showing that a model's performance improves smoothly as a power-law with model size, dataset size, and compute budget.
Show notes






