IC-LoRAs: Finally, consistent multi-image generation that works (most times!)
The new In-Context LoRA paper introduces a surprisingly simple but powerful approach to generating multiple consistent images: What it does: * Generates multiple connected images - this creates the context * Tiny dataset requirement (20-100 images) * No model architecture changes * Smart prompt format for context * 10 LoRAs released by the team, and I trained one too (links below) Real-world Applications: 1. Visual Storytelling * Consistent characters across scenes * Perfect for storyboards/comics 2. Brand Identity * Logo applications * Product mockups 3. Font Design * Test fonts across contexts * Consistent style showcases 4. ??? (For us to find out!)
How It Works: 1. Training: * Concatenate related images into one training sample * Need only 20-100 image sets * Standard LoRA fine-tuning (AI Toolkit by Ostris) * Unique Captioning and Prompting pipeline: * "This set of images shows [overall theme]; [IMAGE1] first scene description, [IMAGE2] second scene description..." * For character consistency: "[IMAGE1] shows <Sarah> in the garden, [IMAGE2] <Sarah> by the fountain..."
LoRAs available at: https://huggingface.co/ali-vilab/In-Context-LoRA The IC-LoRA I trained: https://huggingface.co/glif-loradex-trainer/AP123_movie_shots_ic_lora_experiment_v1
You can try out these LoRAs for free on Glif: - Visual Identity LoRA - Font Design LoRA - PowerPoints IC-LoRA - My Storyboard IC-LoRA
NOTE: These LoRAs work in Forge and ComfyUI!