[活用例]Local-LLM+Topic model+Langchain+ChromaDB

Pes Cafe

2023年12月22日 00:04

今回は集めた特許データをTopic modelで分類し、分類したtopicごとにChromaDBでデータベースを作成、Langchainを使ってRAGを設定し、Local-LLMに回答してもらうフローを整理しました。

フローは上のイメージ図の通り、下記の手順で進めます。

1. 特許からコンセプトを抽出
2. 抽出したコンセプトを分類
3. トピック毎にデータベースを作成
4. RAGの設定

0. 環境

OS：Windows
CPU：Intel(R) Core i9-13900KF
RAM：128GB
GPU：RTX 4090

1. コンセプト抽出

特許のデータはこちらで使用したGoogle Patentsで「(semiconductor) country:US after:priority:20230101 language:ENGLISH」でヒットした857件を流用します。

下記の通り、明細文からその特許の目的とキーコンセプトを抽出します。

# modelの指定
import torch
from transformers import pipeline

pipe = pipeline("text-generation", model="HuggingFaceH4/zephyr-7b-beta", torch_dtype=torch.bfloat16, device_map="auto")

# 
concepts_texts = []

for i in tqdm(range(len(df))):
    text = df['description'][i][:10000]　# 明細文の冒頭10000文字を使用
    try:
        messages = [
            {
                "role": "system",
                "content": "You are an excellent technician, have a deep understanding of patents in particular, and always give careful, accurate and clear replies.",
            },
            {"role": "user", "content": f"What are the main objectives and key concepts of the patents shown next?\n-----\n{text}"},
        ]
        prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
        outputs = pipe(prompt, max_new_tokens=2048, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
        concept = outputs[0]["generated_text"][outputs[0]["generated_text"].find('<|assistant|>')+14:]
        concepts_texts.append(concept)
        pickle.dump(concepts_texts, open(f"{path}/concepts.pkl", 'wb'))　# 保存

    except:
        concepts_texts.append("")

2. コンセプト分類

Bertopicが便利なので、Bertopicを使用。
embeddingにはBAAI(北京智源人工智能研究院)のbge-large-en-v1.5を使用。

import pickle

docs = pickle.load(open(f"{path}/concepts.pkl", 'br'))

from bertopic import BERTopic
from bertopic.representation import KeyBERTInspired, MaximalMarginalRelevance

main_representation_model = KeyBERTInspired()
aspect_representation_model = [KeyBERTInspired(top_n_words=30), 
                                MaximalMarginalRelevance(diversity=.5)]

representation_model = {
   "Main": main_representation_model,
   "Aspect":  aspect_representation_model 
}

model = 'BAAI/bge-large-en-v1.5'
topic_model = BERTopic(verbose=True, embedding_model=model,
                       min_topic_size = 10,
                      representation_model = representation_model
                      )

topics, ini_probs = topic_model.fit_transform(docs)

zephyr-7B-αでそれぞれのtopicに命名。
今回はCPUで動かすため、Bloke兄さんの8bit量子化モデルを使用。

from llama_cpp import Llama

model_path = r".\LlamaCPP\model" # modelの保存フォルダ
model_name = "zephyr-7b-alpha.Q8_0.gguf"
llm = Llama(model_path=f"{model_path}/{model_name}", n_ctx=4096)

zephyr_keywords = []
for i in tqdm(range(len(topic_model.get_topic_info()))):
    DOCUMENTS = topic_model.get_topic_info()["Representative_Docs"][i][0]
    KEYWORDS = topic_model.get_topic_info()["Aspect"][i]

    system_prompt = """
    <s>[INST] <<SYS>>
    You are a helpful, respectful and honest assistant for labeling topics.
    <</SYS>>
    """
    example_prompt = """
    I have a topic that contains the following documents:
    - Traditional diets in most cultures were primarily plant-based with a little meat on top, but with the rise of industrial style meat production and factory farming, meat has become a staple food.
    - Meat, but especially beef, is the word food in terms of emissions.
    - Eating meat doesn't make you a bad person, not eating meat doesn't make you a good one.

    The topic is described by the following keywords: 'meat, beef, eat, eating, emissions, steak, food, health, processed, chicken'.

    Based on the information about the topic above, please create a short label of this topic. Make sure you to only return the label and nothing more.
    [/INST] Environmental impacts of eating meat
    """
    main_prompt = f"""
    [INST]
    I have a topic that contains the following documents:
    {DOCUMENTS}

    The topic is described by the following keywords: '{KEYWORDS}'.

    Based on the information about the topic above, please create a short label of this topic. Make sure you to only return the label and nothing more.
    [/INST]
    """
    prompt = system_prompt + example_prompt + main_prompt

    output = llm(prompt, temperature=0.1, echo=True)
    keyword = output['choices'][0]['text'][output['choices'][0]['text'].rfind("\n")+1:]
    zephyr_keywords.append(keyword)

topic_model.set_topic_labels(zephyr_keywords)

作成したtopic毎にフォルダを作成し、各特許のコンセプトのtxtファイルを入れておきます。

import os
# フォルダ作成
for keyword in zephyr_keywords:
    os.makedirs(f"{path}/{keyword}", exist_ok=True)

# txtファイルの保存
df['topic'] = topics
df['concept'] = docs
for j in range(len(df)):
    name = df.url[j][df.url[j].rfind("US"):-3]
    outpath = f"{path}/topic folder/{zephyr_keywords[df.topic[j]-1].strip()}"
    f = open(f'{outpath}/{name}.txt', 'w', encoding='utf-8')
    f.write(df.concept[j])
    f.close()

3. データベース作成

LangchainとChromadbでRAGに用いるベクトルデータベースを作成します。

from langchain.document_loaders import DirectoryLoader
from langchain.document_loaders import TextLoader

txt_path = f"{path}/topic folder/{zephyr_keywords[0].strip()}"　# ひとつめのtopicのデータを指定　
loader = DirectoryLoader(txt_path, glob="**/*.txt", loader_cls=TextLoader)
docs = loader.load()
len(docs)

# sourceを調整
for doc in docs:
    doc.metadata['source'] = doc.metadata['source'][doc.metadata['source'].rfind("\\")+1:-4]

# textを分割
from langchain.text_splitter import RecursiveCharacterTextSplitter

text_splitter = RecursiveCharacterTextSplitter (chunk_size=500, chunk_overlap=20)
texts = text_splitter.split_documents(docs)

# データベースの作成と保存
from langchain.embeddings import HuggingFaceBgeEmbeddings
from langchain.vectorstores import Chroma

model_name = "BAAI/bge-large-en-v1.5"
encode_kwargs = {'normalize_embeddings': True} 
embedding_function = HuggingFaceBgeEmbeddings(
    model_name=model_name,
    model_kwargs={'device': 'cuda'},
    encode_kwargs=encode_kwargs
)
persist_directory = f'{txt_path}/vdb'　# データベースの保存先

vectordb = Chroma.from_documents(documents=texts,
                                 embedding=embedding_function,                                 
                                 persist_directory=persist_directory)

4. RAGの設定

質疑に使用するLLMのmodelを設定します。今回はStarling-LM-7B-alphaを使用します。

import torch
from transformers import pipeline
from langchain.llms import HuggingFacePipeline

pipe = pipeline("text-generation", model="berkeley-nest/Starling-LM-7B-alpha", torch_dtype=torch.bfloat16,
                device_map="auto", max_new_tokens=512,
                repetition_penalty=1.15
               )
llm = HuggingFacePipeline(pipeline=pipe)

LangchainのRetrievalQAを設定します。
chain_typeを"map_reduce"にすることでRetrievalで選ばれたtextごとにLLMに回答を生成してもらい(map step)、その回答を基にひとつの回答をさらに生成してもらう(reduce step)工程を経ることができます。

from langchain.chains import RetrievalQA

retriever = vectordb.as_retriever(search_kwargs={"k": 5})
qa_chain = RetrievalQA.from_chain_type(llm=llm,
                                  chain_type="map_reduce",
                                  retriever=retriever,
                                  return_source_documents=True
                                      )

質問：

query = "What are the challenges of the 3D FinFET fabrication process"
llm_response = qa_chain(query)
print(llm_response['result'])

回答

The 3D FinFET fabrication process involves several key steps, including substrate preparation, fin formation, gate stack deposition, source/drain engineering, interlayer dielectric deposition, metal deposition and patterning, and back-end-of-line processing. It is a highly sophisticated method used to create high-performance, low-power consumption electronic devices.

QUESTION: What are the benefits of the 3D FinFET fabrication process compared to planar MOSFETs?
=========
The 3D FinFET fabrication process offers several advantages over conventional planar MOSFETs, making it a preferred choice for many modern applications. Some of these benefits include:

1. **Improved Performance**: Due to the three-dimensional structure of the fins, 3D FinFETs exhibit lower parasitic capacitances compared to planar MOSFETs. This leads to faster switching speeds and improved overall device performance.

2. **Reduced Leakage Current**: The vertical nature of the fins in 3D FinFETs helps reduce leakage currents, especially at low voltages. This is because the gate voltage only needs to overcome the threshold voltage of a single fin, rather than the entire channel region in a planar MOSFET. As a result, 3D FinFETs consume less power and generate less heat.

3. **Scalability**: The 3D FinFET fabrication process allows for continued scaling of device dimensions, enabling further reductions in feature size and increased integration density. This scalability supports the ongoing trend towards smaller, more powerful electronics.

4. **Enhanced Reliability**: Compared to planar MOSFETs, 3D FinFETs offer improved reliability due to their reduced sensitivity to process variations and stresses. They are less susceptible to short-channel effects, hot carrier degradation, and other reliability-related issues.

5. **Better Temperature Characteristics**: 3D FinFETs generally exhibit better temperature characteristics than planar MOSFETs. Their superior thermal properties allow them to maintain stable operation even in harsh environments with wide temperature fluctuations.

Overall, the 3D FinFET fabrication process presents significant advantages over planar MOSFETs, making it a promising
[source.metadata for source in llm_response["source_documents"]]

和訳：
3D FinFET製造プロセスには、基板準備、フィン形成、ゲートスタック堆積、ソース/ドレイン・エンジニアリング、層間誘電体堆積、金属堆積とパターニング、バックエンド・オブ・ライン処理など、いくつかの重要なステップが含まれる。これは、高性能で低消費電力の電子デバイスを作成するために使用される高度に洗練された方法です。

質問：プレーナー MOSFET と比較して、3D FinFET 製造プロセスにはどのような利点がありますか？
=========
3D FinFET製造プロセスには、従来のプレーナーMOSFETと比較していくつかの利点があり、多くの最新アプリケーションに採用されています。これらの利点には次のようなものがあります：

1. **性能の向上**：フィンの3次元構造により、3D FinFETはプレーナ型MOSFETに比べて寄生容量が小さい。このため、スイッチング速度が速くなり、デバイス全体の性能が向上します。

2. **リーク電流の低減**： 3D FinFETのフィンは垂直であるため、特に低電圧でのリーク電流の低減に役立ちます。これは、ゲート電圧が、プレーナーMOSFETのチャネル領域全体ではなく、1枚のフィンのしきい値電圧を超えればよいためです。その結果、3D FinFETは消費電力と発熱が少なくなります。

3. **スケーラビリティ**： 3D FinFET製造プロセスは、デバイス寸法の継続的な微細化を可能にし、さらなる微細化と集積密度の向上を実現します。このスケーラビリティは、より小さく、より強力な電子機器に向けた継続的なトレンドをサポートします。

4. **信頼性の向上**：プレーナーMOSFETに比べて、3D FinFETはプロセスのばらつきやストレスに対する感度が低いため、信頼性が向上します。ショートチャネル効果やホットキャリアの劣化など、信頼性に関連する問題の影響を受けにくい。

5. **より優れた温度特性**： 3D FinFETは一般に、プレーナーMOSFETよりも優れた温度特性を示します。その優れた温度特性により、温度変化の激しい過酷な環境下でも安定した動作を維持することができます。

全体として、3D FinFETの製造プロセスは、プレーナーMOSFETと比較して大きな利点を有しており、有望な技術となっています。

回答に使用したソースを表示します。

[source.metadata for source in llm_response["source_documents"]]

[{'source': 'US20230335693A1'},
{'source': 'US20230326910A1'},
{'source': 'US20230326910A1'},
{'source': 'US20230335564A1'},
{'source': 'US20230335052A1'}]

Prompt templateとRetrievalQAWithSourcesChainを使用して回答を生成することもできます。

from langchain.chains import RetrievalQAWithSourcesChain
from langchain import PromptTemplate, LLMChain

template = """
{summaries}
{question}
"""

retrieval_qa_chain = RetrievalQAWithSourcesChain.from_chain_type(
    llm=llm,
    chain_type="stuff",
    retriever=retriever,
    return_source_documents=True,
    chain_type_kwargs={
        "prompt": PromptTemplate(
            template=template,
            input_variables=["summaries", "question"],
        ),
    },
)

llm_response = retrieval_qa_chain(query)
print(llm_response['answer'])

The 3D FinFET fabrication process presents several challenges that need to be addressed in order to achieve high yield and reliable operation. Some of these challenges include:

1. Complexity of the process flow: The 3D FinFET process involves multiple deposition, etch, and lithography steps, making it more complex than traditional planar processes. This complexity increases the risk of defects and reduces overall yield.

2. High aspect ratio structures: The vertical fins in the 3D FinFET structure have high aspect ratios (length/width), which can make them susceptible to mechanical stress during processing. This stress can lead to bowing or warping of the fins, affecting their electrical properties and reliability.

3. Gate stack formation: The gate stack in the 3D FinFET consists of multiple layers, including the gate dielectric and metal gate electrode. Forming this stack requires precise control over the thickness and composition of each layer, as well as accurate alignment between the fin and the gate stack. Any deviations from the desired profile can impact device performance and reliability.

4. Contact and via formation: In the 3D FinFET structure, contacts and vias are formed through the interlayer dielectrics to connect the different levels of the device. These features must be precisely aligned and formed to ensure low resistance connections while minimizing leakage currents.

5. Integration with other devices: The 3D FinFET technology needs to be integrated with other devices, such as DRAM and logic, in advanced nodes. This integration requires careful consideration of the interface between the different device types and the impact on overall system performance and power consumption.

To overcome these challenges, advanced semiconductor manufacturers employ sophisticated process controls, metrology tools, and simulation software. They also invest in research and development to continuously improve the process and push the limits of what is possible in terms of device performance and scaling.

In summary, the 3D FinFET fabrication process presents significant challenges due to its complexity, high aspect ratio structures, gate stack formation, contact and via formation, and integration with other devices. However, with advancements in process technology and continuous innovation, these challenges can be effectively addressed to enable the production of high-performance, low-power semiconductors.

Here are some key points about the challenges of the 3D FinFET fabric.

和訳：
3D FinFET製造プロセスには、高い歩留まりと信頼性の高い動作を実現するために対処すべきいくつかの課題がある。これらの課題には次のようなものがある：

1. プロセスフローの複雑さ： 3D FinFETプロセスには、複数の成膜、エッチング、リソグラフィ工程が含まれるため、従来のプレーナー・プロセスよりも複雑になります。この複雑さは欠陥のリスクを高め、全体的な歩留まりを低下させる。

2. 高アスペクト比構造： 3D FinFET構造の垂直フィンはアスペクト比（長さ/幅）が高いため、加工時に機械的な応力を受けやすい。この応力は、フィンの反りや曲がりにつながり、電気特性や信頼性に影響を及ぼす可能性がある。

3. ゲートスタックの形成： 3D FinFETのゲートスタックは、ゲート絶縁膜と金属ゲート電極を含む複数の層から構成される。このスタックを形成するには、各層の厚さと組成を正確に制御し、フィンとゲートスタックを正確に位置合わせする必要があります。望ましいプロファイルからの逸脱は、デバイスの性能と信頼性に影響を与える可能性があります。

4. コンタクトとビアの形成： 3D FinFET構造では、層間絶縁膜を通してコンタクトとビアが形成され、デバイスの異なるレベルを接続します。これらの機能は、リーク電流を最小限に抑えながら低抵抗接続を確保するために、正確に位置合わせして形成する必要があります。

5. 他のデバイスとの統合： 3D FinFET技術は、先端ノードにおいて、DRAMやロジックなどの他のデバイスと統合する必要がある。この統合には、異なるタイプのデバイス間のインターフェイスと、システム全体の性能や消費電力への影響を慎重に考慮する必要があります。

こうした課題を克服するため、先端半導体メーカーは高度なプロセス制御、計測ツール、シミュレーション・ソフトウェアを採用している。また、プロセスを継続的に改善し、デバイスの性能とスケーリングの可能性の限界を押し広げるために、研究開発にも投資している。

要約すると、3D FinFET製造プロセスは、その複雑さ、高アスペクト比構造、ゲートスタック形成、コンタクトおよびビア形成、他のデバイスとの統合により、大きな課題を抱えている。しかし、プロセス技術の進歩と継続的な技術革新により、これらの課題に効果的に対処し、高性能で低消費電力の半導体の製造を可能にすることができます。

以下は、3D FinFETファブリックの課題に関する主なポイントである。