人気の記事一覧

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

9か月前

sDPO: Don't Use Your Data All at Once

10か月前