【通訳案件準備】生成AI + Text to Speechでやってみる
はじめに
こんにちは。サイマルアカデミーに通いはじめて8ヶ月が経過したあきらと申します。私は仕事柄、生成AIが流行る前から自然言語処理の技術に関わることが多く、最近ではLLMの技術にも仕事で関わることが多いため、生成AIを毎日イジイジしながら仕事しています。
周囲にヒアリングすると、思ったより生成AIを活用して通訳練習をしている人が少ないようでしたので、「ぼくのかんがえた、きばつなめそっど(仮)」を共有したいと思います。
そもそも①:技術とどう向き合うか
30年前、電子辞書が出てきたときは「紙の辞書の方が記憶に残るから良い」といって避ける大人がたくさんいました。
20年前、「Googleを使って調べた内容を元に卒論を書くのは邪道だ」という大学教授がたくさんいました。
そして今、通訳者が生成AI使うのどうなん?と思われる方もいらっしゃるかもしれません。
新しい技術が登場すると必ずearly adopterからlaggardまでいるわけですが、ほぼ最終的に古いものは使われなくなっていきます。ということで、僕は積極的に生成AIを活用して能力向上に活かすぞー、というマインドセットで日々色々試しています。
そもそも②:案件準備って何するの?
通訳=(1)言語能力+(2)知識+(3)分析力
だとエライ人が通訳学校で言っていました。
加えて、(4)ノートテイキング も重要だと思います。
僕は(2)(3)(4)のパフォーマンスを向上させられるよう、生成AIを使って案件(課題)の予習をしています。(もちろん(1)にもつながる)
ぼくの きばつな めそっど(仮)
生成AIを使って以下の3ステップを踏んでいます。
Step 1: 話者の経歴と考え方を知る
話者の経歴を調査する
話者の専門領域と主な主張を理解する
話者と反対意見を主張する学説について理解する
Step 2: 予想スピーチの音声を生成する
話者のスピーチの7分バージョンを生成AIを活用して作り、"素振り"をする
Step 3: 素振りとしてノートテイキングする
もちろん、与えられた単語リストは頭に入れます。しかし、単語リストを正確に理解するよりも、案件で話される内容の大枠が頭に入っている方が、本番でよりパフォーマンスを出しやすいと感じます。全体の枠組みが理解できていれば理解できない単語の意味を推測して訳出できます。一方で細かい単語を分かっていても話者のスピーチの骨子が頭に入っていないと自信を持って訳せないことがおおくあります。そのため、全体の枠組みを理解することを重要視しています。
以下、英→日の案件を、実際に生成AIを活用して予習する方法をお伝えします。
Step1: 話者の経歴と考え方を知る
まず、通訳案件で話される内容の全体像を知る必要があります。
そのために、以下のinputから以下のoutputを得ます。
input:
話者名と話者情報
スピーチの概要
単語リスト
output:
スピーチの概要(メインメッセージ&サブメッセージ)
スピーカーの経歴
スピーカーの専門領域
プロンプト:話者の経歴とスピーチ骨子を出力
以下のプロンプトを活用します。
# Context
You are an AI assistant preparing for an interpretation assignment. To deliver your best performance, please follow the instructions below.
# Instructions
Process the following input according to the steps and generate the output.
# input
## Speaker name / Information
[話者名と話社情報を記載]
## Speechoverview
[スピーチの概要をここに記載]
## Wordlist
[単語リストをここに記載]
## Steps
Step 1: Read the speech overview.
- Break down the main message and sub-messages of the speech into a bullet-point outline. If there is not enough information, infer what the main message and sub-messages will be with best effort from the presented information.
Step 2: Research the speaker's background
- List the speaker's major accomplishments and milestones in a bullet-point format, starting with the years.
Step 3: Identify the speaker's areas of expertise
- Read the Wordlist for context.
- List the field(s) the speaker is well-known for (multiple if applicable).
- If there are conflicting viewpoints within the field(s), mention them.
- Summarize the speaker's main arguments or positions in their area(s) of expertise.
例:ミヤシャイマーという政治学者のスピーチの場合
仮に、政治学者として有名なミヤシャイマーの案件があったとします。案件の前情報として、「話者の経歴」「スピーチ概要」「単語リスト」が送られてくるはずです。その情報を活用して上記のテンプレートを埋めます。
# Context
You are an AI assistant preparing for an interpretation assignment. To deliver your best performance, please follow the instructions below.
# Instructions
Process the following input according to the steps and generate the output.
# input
## Speaker name / Information
John Joseph Mearsheimer is an American political scientist and international relations scholar, best known for his work on realist theory in international relations. Born on December 14, 1947, Mearsheimer is the R. Wendell Harrison Distinguished Service Professor at the University of Chicago. He graduated from West Point in 1970, served five years as an officer in the U.S. Air Force, and then pursued graduate studies, earning a Ph.D. in government from Cornell University in 1980. Mearsheimer's seminal work, "The Tragedy of Great Power Politics," published in 2001, established him as a leading theorist of offensive realism. His extensive publications and lectures often explore themes of power dynamics, security dilemmas, and the strategic behaviors of great powers in the international system.
## Speechoverview
John Mearsheimer will address the current geopolitical landscape, focusing on the rise of China, the strategic competition between the United States and China, and the implications for global security. He will discuss the structural pressures driving great powers toward conflict, the failures of liberal internationalism, and the strategic calculations of states in a multipolar world. His speech will include references to historical precedents, theoretical frameworks of offensive realism, and analysis of contemporary international relations.
## Wordlist
Realism
Hegemony
Geopolitics
Power dynamics
Security dilemma
Strategic competition
Multipolarity
Balance of power
National interest
Great power
Offensive realism
International system
Strategic calculus
Liberal internationalism
Global security
Economic interdependence
Military capability
Geostrategic
Power transition
National sovereignty
## Steps
Step 1: Read the speech overview.
- Break down the main message and sub-messages of the speech into a bullet-point outline. If there is not enough information, infer what the main message and sub-messages will be with best effort from the presented information.
Step 2: Research the speaker's background
- List the speaker's major accomplishments and milestones in a bullet-point format, starting with the years.
Step 3: Identify the speaker's areas of expertise
- Read the Wordlist for context.
- List the field(s) the speaker is well-known for (multiple if applicable).
- If there are conflicting viewpoints within the field(s), mention them.
- Summarize the speaker's main arguments or positions in their area(s) of expertise.
上記のプロンプトを、ChatGPT 4o に入力すると以下のような返答が来ます。
Step 1: Speech Overview
Main Message
* The current geopolitical landscape is significantly shaped by the rise of China and its strategic competition with the United States, with profound implications for global security.
Sub-Messages
* Structural pressures are driving great powers toward conflict.
* The failures of liberal internationalism in preventing such conflicts.
* The strategic calculations of states in a multipolar world.
* Historical precedents and theoretical frameworks of offensive realism as a lens for understanding these dynamics.
* Analysis of contemporary international relations and power dynamics.
Step 2: Speaker's Background
* 1970: Graduated from West Point.
* 1970-1975: Served as an officer in the U.S. Air Force.
* 1980: Earned a Ph.D. in government from Cornell University.
* 2001: Published "The Tragedy of Great Power Politics," establishing him as a leading theorist of offensive realism.
* Present: R. Wendell Harrison Distinguished Service Professor at the University of Chicago.
* Extensive publications and lectures on power dynamics, security dilemmas, and strategic behaviors of great powers in the international system.
Step 3: Speaker's Areas of Expertise
Fields of Expertise
* Realist theory in international relations
* Offensive realism
* Power dynamics and security dilemmas
* Strategic behaviors of great powers
* Geopolitics and global security
Conflicting Viewpoints
* Realist vs. Liberal Internationalism: Mearsheimer's offensive realism often contrasts with liberal internationalist perspectives that emphasize international cooperation and institutions.
Summary of Main Arguments/Positions
* Offensive Realism: Mearsheimer argues that the anarchic nature of the international system compels great powers to seek dominance for security, leading to inevitable conflicts.
* Power Dynamics: He explores how power dynamics and security dilemmas influence state behavior, with great powers constantly seeking to maximize their influence.
* Geopolitical Competition: Mearsheimer emphasizes the strategic competition between the United States and China, suggesting that this rivalry is driven by structural pressures rather than specific policies or leaders.
* Critique of Liberal Internationalism: He critiques liberal internationalism for failing to account for the inherent conflicts and competition among great powers.
* Multipolarity and Balance of Power: He discusses the implications of a multipolar world and the balance of power strategies that states adopt to ensure their national interests and security.
上記のように、とても正確に調べてくれます。
余談ですが、ミヤシャイマーは「Offensive Realism」という考え方の支持者で、「世界中の国々は、自国の安全保障を目的として覇権争いをし続けるので、紛争は必然的に生じる」というポジションをとっています。反対意見としてLiberalistは「国家間が協力して手を取り合えば、紛争は防止できる」というポジションをとっています。政治界ではこう言った議論がバチバチされているのですが、見事にその内容を出力しています。
私はよく、出力内容の"Conflicting Viewpoints"に特に着目して、どういう考え方の中で、どういうポジションを取っている人なのかを理解するようにしています。それだけで通訳のしやすさが変わります。
Step2: 予想スピーチの音声を生成する
Step2-1: 予想スピーチの原稿を生成
まずはスピーチの原稿を生成します。以下のプロンプトを使います。
長さ約15段落、各段落30秒程度(75 word)、計7分程度となるような原稿を作ります。
input: 話者情報、スピーチ概要 、単語リストに加え、上記出力のSpeechOverviewのMain MessageとSub Messagesの内容
output: 15段落の原稿が出力されます。
プロンプト:スピーチ原稿を生成
以下が生成AIにinputするプロンプトです。
# Instructions
Based on the provided input, generate a speech script by following these steps:
1. Create an outline for the speech using the Speech Overview.
2. Write out the full speech, incorporating EVERY word from the Word List. Be sure to keep the speech interesting but also logically structured.
3. The speech should be in the voice and perspective of the specified Speaker.
4. The speech should be approximately 15 paragraphs long.
5. If the Language is English, each paragraph should contain around 75 words. If the Language is Japanese, each paragraph should contain around 150 letters.
6. The output should be in the same language as the specified Language.
# Input
## Language
English
## Speaker name / Information
[話者名と話社情報を記載]
## Speechoverview
[スピーチの概要をここに記載.]
[さらに、最初のプロンプトから出力されたものを貼り付ける.]
## Wordlist
[単語リストをここに記載]
今回のミヤシャイマーの例を記入すると次のようになります。
# Instructions
Based on the provided input, generate a speech script by following these steps:
1. Create an outline for the speech using the Speech Overview.
2. Write out the full speech, incorporating EVERY word from the Word List. Be sure to keep the speech interesting but also logically structured.
3. The speech should be in the voice and perspective of the specified Speaker.
4. The speech should be approximately 15 paragraphs long.
5. If the Language is English, each paragraph should contain around 75 words. If the Language is Japanese, each paragraph should contain around 150 letters.
6. The output should be in the same language as the specified Language.
# Input
## Language
English
## Speaker name / Information
John Joseph Mearsheimer is an American political scientist and international relations scholar, best known for his work on realist theory in international relations. Born on December 14, 1947, Mearsheimer is the R. Wendell Harrison Distinguished Service Professor at the University of Chicago. He graduated from West Point in 1970, served five years as an officer in the U.S. Air Force, and then pursued graduate studies, earning a Ph.D. in government from Cornell University in 1980. Mearsheimer's seminal work, "The Tragedy of Great Power Politics," published in 2001, established him as a leading theorist of offensive realism. His extensive publications and lectures often explore themes of power dynamics, security dilemmas, and the strategic behaviors of great powers in the international system.
## Speechoverview
John Mearsheimer will address the current geopolitical landscape, focusing on the rise of China, the strategic competition between the United States and China, and the implications for global security. He will discuss the structural pressures driving great powers toward conflict, the failures of liberal internationalism, and the strategic calculations of states in a multipolar world. His speech will include references to historical precedents, theoretical frameworks of offensive realism, and analysis of contemporary international relations.
Main Message
* The current geopolitical landscape is significantly shaped by the rise of China and its strategic competition with the United States, with profound implications for global security.
Sub-Messages
* Structural pressures are driving great powers toward conflict.
* The failures of liberal internationalism in preventing such conflicts.
* The strategic calculations of states in a multipolar world.
* Historical precedents and theoretical frameworks of offensive realism as a lens for understanding these dynamics.
* Analysis of contemporary international relations and power dynamics.
## Wordlist
Realism
Hegemony
Geopolitics
Power dynamics
Security dilemma
Strategic competition
Multipolarity
Balance of power
National interest
Great power
Offensive realism
International system
Strategic calculus
Liberal internationalism
Global security
Economic interdependence
Military capability
Geostrategic
Power transition
National sovereignty
着目いただきたい点としては、Speech Overviewの中に、Step1のプロンプトの出力結果にある"Main Message"と"Sub Messages"を使い回している点です。
すると、ChatGPT 4oでは以下の出力が得られます。
### Outline
- Introduction
- Overview of Geopolitical Landscape
- Rise of China
- Strategic Competition with the U.S.
- Structural Pressures Toward Conflict
- Failures of Liberal Internationalism
- Historical Precedents
- Offensive Realism Framework
- Strategic Calculations of States
- Multipolar World Dynamics
- Analysis of Contemporary International Relations
- Economic Interdependence
- Military Capabilities and Security Dilemma
- Geostrategic Considerations
- Conclusion
### Speech Script
Ladies and gentlemen,
Thank you for joining me today to discuss the current geopolitical landscape. As we witness the rise of China and its strategic competition with the United States, we must understand the profound implications for global security. My aim is to provide a comprehensive analysis using the principles of realism, particularly offensive realism, to elucidate these dynamics.
The geopolitical landscape today is fundamentally shaped by the ascent of China. This rise has introduced significant changes in power dynamics, leading to increased tensions and strategic competition with the United States. Understanding this relationship requires a deep dive into the structural pressures that drive great powers toward conflict.
Historically, the rise of a new great power often disrupts the balance of power. In our case, China's growth challenges U.S. hegemony, creating a classic security dilemma. Both nations, driven by national interest and survival, engage in a strategic calculus that heightens the risk of conflict. This scenario is a stark reminder of the failures of liberal internationalism in mitigating such rivalries.
Liberal internationalism, with its emphasis on cooperation and international institutions, has not succeeded in preventing the competitive behavior of great powers. The theoretical frameworks of offensive realism offer a more accurate lens for understanding these dynamics. Offensive realism posits that great powers are inherently driven to maximize their relative power to ensure security in an anarchic international system.
Throughout history, we see numerous examples of power transitions leading to conflict. The Peloponnesian War, for instance, illustrates how the rise of Athens threatened Sparta, resulting in war. Similarly, the current U.S.-China rivalry can be viewed through the same realist perspective, where strategic competition is inevitable.
States operate under a strategic calculus that prioritizes military capability and economic strength. In a multipolar world, where several great powers exist, this competition becomes even more pronounced. The balance of power shifts constantly, making it imperative for states to adapt their strategies to maintain their national sovereignty and security.
Today's international relations are characterized by multipolarity, where multiple states wield significant power. This multipolar world is complex, with varying alliances and power dynamics. Understanding these intricacies is crucial for navigating the contemporary geopolitical landscape.
Economic interdependence, while touted as a peacekeeping mechanism, also introduces vulnerabilities. The interconnected nature of global economies means that strategic competition can have widespread repercussions. Economic interdependence does not eliminate the strategic competition; instead, it often exacerbates the security dilemma.
Military capability remains a cornerstone of national security. The strategic competition between the U.S. and China includes a significant focus on military build-up and advancements. This arms race contributes to the global security dilemma, where increased military capability by one state prompts others to follow suit.
Geostrategic considerations also play a crucial role in this competition. The South China Sea, for example, is a focal point of U.S.-China rivalry. Control over such strategic regions can significantly enhance a nation's power and influence, further intensifying the competition.
In conclusion, the current geopolitical landscape is a vivid demonstration of the principles of realism, particularly offensive realism. The rise of China and its strategic competition with the United States underscore the structural pressures that drive great powers toward conflict. As we navigate this multipolar world, it is essential to understand these dynamics to ensure global security and stability.
Thank you.
なぜか11パラグラフしか出力されていませんが、内容としてはよくできていると思います。このスピーチを音声で聞いて、通訳のウォームアップができたら良いですよね。しかも案件で必要な単語も全部網羅されているから、単語もコンテキストの中で暗記できちゃう!
Step2-2: 音声を生成
Googleのtext-to-speechを使います。Google Cloudを使えば、無料のクレジットが付与されるので皆さんでも無料でできると思います↓
Gmailでログインして開くと、以下のような画面が表示されます。左側メニューの「Text-to-Speech」の合成をクリックし、テキストボックスにテキストを入力し、下部の言語を選択し、「Synthesize」をクリックすると音源が生成されます。
Synthesizeをクリックすると、以下のように音源が無事生成されます。
上記の「ダウンロード」を押すと、音源がダウンロードできます。
以下に、今回生成した音源の1パラグラフ目のサンプルを置いておきます。
聞いてみてください。かなり自然な音声ではありませんか?
Step3: 素振りとしてノートテイキングする
11個の音源を生成してダウンロードしたら、後は一つずつ練習していきます。試しに1つ目の音源でノートテイキングしていきます。
上記の音源で実際にやってみました↓
メモからリプロダクションして読み上げると:
・みなさん、本日はお集まりいただきましてありがとうございます。
・本日は世界の地政学的な状況についてお話ししたいと思います。
・私たちは中国と米国が戦略的に対立しているのを目の当たりにしています。
・この状況から、国家安全保障についてどういったことが示唆されるのかを考えていきます。
・本日の私のお話の狙いは、オフェンシブリアリズムの観点から、この対立のダイナミズムについて推測していき、包括的に分析することにあります。
とまぁこんな感じで訳した後、ノートテイキングをしているといくつか気づくことがあります。
Geopoliticsは頻出するから、「GP」で書こう(今回はとっさに思いついてGPと書いたけど、事前に決めておく)
ちなみに CH (中国)の前に <3 みたいな記号がありますが、これは「目」の記号です。よく"see"という意味の記号として使っています。今回は"witness"の部分で使いました。
Global Securityは「G Sec」などが良いかも。あと、自分はGlobal Securityを「国家安全保証」と訳しちゃったけど、「世界の安全保障」が正しいな、、、と反省もできます。
"We must understand"と言っているのに「考えて行きます」と訳しちゃったな… 生成された音源はスピードが早いという言い訳もございますが、言い訳をしていてはダメですね…
オフェンシブリアリズムはORかOfRか、どっちかにしようかな。ORだとorと間違えそうだからOfRにしよう。
原文を見ると"principles of realism, particularly offensive realism"とあるが、ここはノートを取ることに気を取られてしまって、"principles"と"particularly"は落としてしまった。メモに集中しすぎちゃったな
などなどと、色々と気付けます。
こんな感じで私は予習をすると、本番により自信を持って臨めるなと感じました。ぜひ、皆さんも試してみてください!!!