人気の記事一覧

Are Protein Language Models Compute Optimal?

7か月前

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

7か月前

Sakuga-42M Dataset: Scaling Up Cartoon Research

8か月前

Pretraining on the Test Set Is All You Need

9か月前

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

7か月前

Scaling MLPs: A Tale of Inductive Bias

7か月前

Observational Scaling Laws and the Predictability of Language Model Performance

8か月前

Scaling MLPs: A Tale of Inductive Bias

8か月前

Scaling Laws for Transfer

8か月前

Grandmaster-Level Chess Without Search

8か月前

The Quantization Model of Neural Scaling

8か月前