【日本語LLM】Ollamaで利用可能な日本語対応embeddingモデル【Ruri】

2024年10月7日 02:47

はじめに

少しおバカさんのローカルのLLM（Large Language Model）を利用する上で、重要になる技術がRAG（Retrieval-Augmented Generation）です。具体的な手法はさまざまですが、LLM推論時の辞書のような役割をします。

ローカルLLMをGUIで利用できる（Ollama）Open WebUIでは、RAGを利用できますが、利用するためには本体LLM以外に「embeddingモデル」「rerankerモデル」が必要になります。

しかし、ローカルで利用できる日本語対応モデルが少ないのが現状です。

最近Ollamaライブラリに、日本語専用のembeddingモデルが（たぶん初めて）登録されたので紹介と動作レビューしたいと思います。

【PR】Open WebUIの詳細な導入方法や使い方は下記事で紹介しています。

Ruri

Ollamaライブラリに登録されたこちらのkun432氏のモデルを利用します。

説明文に記載のhuggingfaceのページはこちら、名古屋大学研究室のモデルですね。

論文はこちら、

Abstract
We report the development of Ruri, a series of Japanese general text embedding models. While the development of general-purpose text embedding models in English and multilingual contexts has been active in recent years, model development in Japanese remains insufficient. The primary reasons for this are the lack of datasets and the absence of necessary expertise. In this report, we provide a detailed account of the development process of Ruri. Specifically, we discuss the training of embedding models using synthesized datasets generated by LLMs, the construction of the reranker for dataset filtering and knowledge distillation, and the performance evaluation of the resulting general-purpose text embedding models.

引用：https://huggingface.co/papers/2409.07737