SD 1.4 GLIGEN is a text-to-image model, a variant of Stable Diffusion 1.4, enhanced with GLIGEN's grounded generation capabilities. It allows users to specify spatial layouts of objects using bounding boxes and text prompts, enabling more precise control over image synthesis.
SD 1.4 GLIGEN is a version of the Stable Diffusion 1.4 model that allows users to specify where objects should appear in generated images using bounding boxes. This gives much better control over the layout compared to just using text, making it useful for tasks that need precise object placement.
GLIGEN, Grounded Diffusion, Layout-to-Image, SD-GLIGEN
Was this definition helpful?