Name: （DiT2）メ・ナーゴ FF14 Me-Nago
Author: 七夢

（出展作品）

ファイナルファンタジー１４

（モデル）

メ・ナーゴ

（紹介）

真面目可愛いミコッテ族の女性兵士。登場の仕方から、No3ぐらいには位置づけられているはず。ただ、解放軍は人手が少ないためか、司令官自ら前線で戦うという、なかなかのブラック職場です。

（LoRA資料の条件）・資料は４２枚（重複で読み込ませ無理やり１００枚）・上半身画像が中心。髪型が特殊なため、顔のアップのみも利用。全身画像も存在するが少ない。・サイズはランダム。まったく統一されていない。・画質も統一されていない。（明るい・暗い様々）・背景はランダムですべてあり。・背中に弓と矢筒を装備してある画像と、装備がない画像が混在。・PNG。（１枚のみJPG）・背景は白指定。

（テスト内容）・テスト生成枚数　６０枚（４枚生成×１５）・テストサイズは４パターン（３：５）、（１：１）、（９：１６）、（１５３６×１５３６）すべてのサイズにおいて、以下のプロンプトを指定。・オリジナルプロンプト・全身指定プロンプト・全身指定プロンプト＋背面指定プロンプト・全身指定プロンプト＋側面指定プロンプト

（結果）

・どうなるかな～思いましたが、かなり上手くキャラクターを生成してくれました。破綻した画像は１枚もなかったです。結果として、こだわりにもよりますが、LoRAの作成ハードルはかなり下がったのではないか？という印象をもっています。

・前回、DiT2でフリーレンLoRAを作成したのですが、背景はすべて白で、資料の質は整えられたもので、本LoRAとはまったくの別ものです。比較してみると、フリーレンは、１枚のイラストに２キャラを描写することが多かったのですが、今回の混在型LoRAの場合、２キャラ描写が少なかったです。（ない訳ではない）そのため、資料を統一しない混在型の方が、２キャラ描写への対応が良いようです。

・トリガーワードの状態では、上半身画像で生成されることが多いので、全身指定（Take a full-body image that shows all the way down to your feet.）のプロンプトを入力すれば全身で生成してくれます。

・武器の認識がいまいちでした。弓・矢筒でないものを装備させてきたり、棒を装備させてきたりします。ただ、武器を装備させる確率は大くはありませんでした。

背中に弓と矢筒を背負っている。（Equipped with a bow and quiver on his back.）とプロンプトに入れてあげると、武器を綺麗に描写してくれます。

２キャラ描写できた場合の回避方法は、

シーンとアクションをプロンプトに入力頂くことで、大幅に確率が下がります。「例：１人のキャラクターが桜並木を歩いている。」

マイページの「LoRAに関する検証」フォルダの中の、うごイラ「ＬｏＲＡの分裂・破綻について考える（その６）」～あたりをご覧いただけると、理由が分かるかと思います。

(Exhibited Work)

Final Fantasy XIV

(Model)

Me-Nago

(Introduction)

A serious and cute female Miqo'te soldier. Judging from her appearance, she's probably ranked around No. 3. However, the Liberation Army is understaffed, so the commander herself fights on the front lines—a rather harsh work environment.

(LoRA Reference Requirements)

42 images (forced to 100 by loading duplicates)
Primarily upper body images. Due to her unique hairstyle, close-ups of her face are also used.

Full-body images exist but are few in number.

Random sizes. Completely inconsistent.
Inconsistent image quality. (Various bright and dark)
Random backgrounds are all included.
Images with and without a bow and quiver are mixed.
PNG format. (Only one JPG image)
White background specified.

(Test Details)

Number of test images generated: 60 (4 images generated x 15)
Four test sizes were used: (3:5), (1:1), (9:16), (1536 x 1536) The following prompts were specified for all sizes:
Original prompt
Full body prompt
Full body prompt + Back view prompt
Full body prompt + Side view prompt

(Results)

I wondered how it would turn out, but it generated the characters quite well. There were no corrupted images. As a result, depending on the level of detail, I have the impression that the hurdle for creating LoRA has been significantly lowered.
Last time, I created a Frieren LoRA using DiT2, but the background was all white, and the quality of the reference materials was refined, making it completely different from the actual LoRA. Comparing them, Frieren often depicted two characters in one illustration, but in this mixed-character LoRA, there were fewer instances of two characters being depicted. (It's not impossible) Therefore, a mixed format that doesn't unify the materials seems to handle two-character depictions better.

When triggered by a word, it often generates an upper-body image, so entering the prompt "Take a full-body image that shows all the way down to your feet." will generate a full-body image.
Weapon recognition was not very good. It would equip things other than bows and quivers, or even a stick. However, the probability of equipping a weapon wasn't high.

Entering the prompt "Equipped with a bow and quiver on his back." will depict the weapons nicely.

To avoid the issue of two-character depictions:

Entering the scene and action in the prompt will significantly reduce the probability. "Example: A single character is walking along a row of cherry trees."

You can find the reason by looking at the animated illustration "Thinking about the split and collapse of LoRA (Part 6)" in the "LoRA-related verification" folder on your My Page.

（DiT2）メ・ナーゴ　FF14　Me-Nago

#Final Fantasy XIV

#Miqo'te

Opis

Komentarze