人気の記事一覧

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

5か月前

Better & Faster Large Language Models via Multi-token Prediction

6か月前