すげぇ怒られそうな生成AIツールを作ってみた

天京祐輔

2024年10月6日 17:03

みなさんこんにちは！
株式会社ホコサキの代表取締役をやっています天京と申します
今日はダメでしょって怒られそうな僕が作った画像生成AIツールを紹介ます！

何作ったのよ？

動画を見てもらったら早いと思います！

こりゃいかんでしょ

なにをやってるの？

Webサイト上の画像一覧を出して

画像を元に、その画像と同じ画像を出せるプロンプトを出力
そのプロンプトを元にStable diffusion3を使って画像生成

ということをやっています

なぜ、こんなものを…

ちょっと画像生成AIを齧ったことある人なら首がもげるくらい頷いてくれると思うんですが、結局しんどいのは画像を生成するプロンプトを考えることなんですよね
ある画像を出力したいとして、その画像を文章で説明するなんてのは難しいわけですよ
たとえばさっきのひまわりの画像だって自分がプロンプトを考えると「夏の空の下のひまわり畑」くらいの言語化能力しかないわけです

ですがChatGPTに画像渡して同じ画像を生成できるプロンプトを考えてとお願いしたら

「A vibrant field of sunflowers in bloom under a clear, blue sky, numerous sunflowers with bright yellow petals and large, dark centers, facing various directions, each stalk adorned with broad green leaves creating a lush appearance, the scene exudes a sense of abundance and vitality, capturing a summer day, background features an expanse of bright blue sky with sparse white clouds, the composition elevates the sunflowers as the focal point, with the vast sky providing contrast, natural lighting casting soft shadows and highlights, adding depth and dimension, the color palette is dominated by rich yellows and deep greens contrasting with bright blues, textures are vivid with smooth sunflower petals and rough leaves, the overall mood is cheerful and lively, evoking a sense of joy and summertime, (ultra detailed:1.2), (8k resolution:1.1), (masterpiece:1.3), (best quality:1.2), (sharp focus:1.1), (high contrast:1.05), realistic illustration, landscape genre」

という感じで非常に情報量の多いプロンプトを考えてくれるわけです

作ったもの

一つはChrome拡張機能です

Webサイトの画像一覧を読み込んで、その画像と同じ画像を生成できるプロンプトを生成してくれる拡張機能になります

もう一つがプロンプトを実行できるWebサイトです
Chrome拡張機能の「Generate Similar Images」ボタンを押すとこのサイトが自動で立ち上がります
内部はStable diffusion3 を使っています

両方リリースしていないので現時点では利用はできません
全て手作りです

精度はどうなのよ

簡単な構図だとかなり精度はいいです
パッと見ただけではAIが作った画像かは判定できないでしょう

ですが難しい構図だとバグります

プロンプト

A family sitting on a grassy lawn on a sunny day. Main subjects: a man, woman, and child sitting on a picnic blanket. The woman is wearing a light blue dress with a soft texture, and knee-high boots, the man in a green jacket and jeans, the child in a yellow shirt. All are engaged in a casual, joyful moment, focusing on the child who is sitting between both adults. Background: a park with green trees and a bright blue sky, contributing depth. Composition: center-focused, eye drawn toward the family due to central placement and vibrant clothing. Lighting: natural daylight enhancing a warm and inviting mood, casting soft shadows. Color palette: vibrant greens of the grass and trees, bright blue sky, and colorful clothing. Texture and materials: soft cloth of the dress, denim texture of the jeans, glossy boots. Atmosphere and mood: cheerful and relaxed, evoking a sense of warmth and family bonding. (ultra detailed:1.2), (8k resolution:1.1), (masterpiece:1.3), (best quality:1.2), (sharp focus:1.1), (high contrast:1.05), photographic realism.

元画像の構図とかなり近いものはできているんですが、細部がぐちゃぐちゃで悪夢みたいになっています
あと人は生成AI感が強く出てしまいますね

プロンプト

A sleek black sports car driving on an icy road with snow-covered mountains in the background, (ultra detailed:1.2), (8k resolution:1.1), (masterpiece:1.3), (best quality:1.2), (sharp focus:1.1), (high contrast:1.05), high-performance muscle car with glossy finish, front view showing headlights and grille, car in motion with snow kicking up from wheels, expansive mountain range under clear sky, cool tones with blue and white hues, dramatic lighting with natural shadows, dynamic composition with road leading into the horizon, modern automotive photography style.

人工物はかなり精度がいいです

なんで怒られそうなの？

Adobe StackやイラストACとかのフリー素材を販売しているサービスからしたらブチギレものじゃないです？
そういったサービスのWebサイトに行ってブラウザでぽちぽちしたら同じような画像生成できちゃうんですよ
でも出来ちゃったから仕方ないよね

おわりに

Webサイトの画像を読み込んで同じ画像を生成できるプロンプトを生み出すChrome拡張機能は近日リリース予定です！乞うご期待！