Text-Conditioned Generation
Input Text
-Ours- -3DS2V- -3DILG-
a 3d model of the monster, The standing humanoid shape of the monster, with 2 feet and 2 hands.