人気の記事一覧

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

8か月前

Better & Faster Large Language Models via Multi-token Prediction

8か月前