人気の記事一覧

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

6か月前

sDPO: Don't Use Your Data All at Once

7か月前