-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.html
895 lines (814 loc) · 50.9 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<title>In-Context LoRA for Diffusion Transformers</title>
<link href="./style.css" rel="stylesheet">
</head>
<body>
<div class="content">
<h1><strong>
In-Context LoRA for Diffusion Transformers
</strong></h1>
<p id="authors">
<span><a href=""></a></span>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Lianghua Huang</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Wei Wang</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Zhi-Fan Wu</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Yupeng Shi</a>
<span style="display: block; height: 8px;"></span>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Huanzhang Dou</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Chen Liang</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Yutong Feng</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Yu Liu</a>
<a href="" style="pointer-events: none; text-decoration:none; color: black;">Jingren Zhou</a>
<br><br>
<span style="font-size: 22px;">Tongyi Lab</span>
</p>
<p style="text-align: center; font-size: 20px;">
<a href="https://arxiv.org/abs/2410.23775" target="_blank">[Paper]</a>
<a href="./bibtex.txt" target="_blank">[BibTeX]</a>
<a href="https://github.com/ali-vilab/In-Context-LoRA" target="_blank">[Code]</a>
</p>
<br>
<img src="https://img.alicdn.com/imgextra/i3/O1CN01zTsWC71HGO8GJkPnl_!!6000000000730-0-tps-2685-853.jpg"
class="teaser-gif" style="width:100%">
<p style="text-align: left; font-size: 14px; margin-top: 8px;">
<strong>Prompt:</strong>
<em>
“This set of <strong style="color: #ff6f61;">four images</strong> illustrates a young artist's creative process
in a bright and inspiring studio;
<strong style="color: #6b5b95;">[IMAGE1]</strong> she stands before a large canvas, brush in hand, adding
vibrant colors to a partially completed painting,
<strong style="color: #6b5b95;">[IMAGE2]</strong> she sits at a cluttered wooden table, sketching ideas in a
notebook with various art supplies scattered around,
<strong style="color: #6b5b95;">[IMAGE3]</strong> she takes a moment to step back and observe her work, and
<strong style="color: #6b5b95;">[IMAGE4]</strong> she experiments with different textures by mixing paints
directly on the palette, her focused expression showcasing her dedication to her craft.”
</em>
</p>
<br>
<p style="text-align: center; font-size: 18px;">
<em>
In-Context LoRA fine-tunes text-to-image models to <strong>generate image sets</strong> with customizable
intrinsic relationships, optionally <strong>conditioned on another set</strong>, enabling adaptation to a wide
range of tasks.
</em>
</p>
</div>
<div class="content">
<h2 style="text-align: center;">Abstract</h2>
<p>
Recent research <a href="https://arxiv.org/abs/2410.15027" target="_blank">[Huang et al., 2024]</a> has explored
the use of diffusion transformers (DiTs) for <strong>task-agnostic image generation</strong> by simply
concatenating attention tokens across images. However, despite substantial computational resources, the fidelity
of the generated images remains suboptimal. In this study, we reevaluate and streamline this framework by
hypothesizing that <strong style="color: #ff6f61;">text-to-image DiTs inherently possess in-context generation
capabilities</strong>, requiring only minimal tuning to activate them. Through diverse task experiments, we
qualitatively demonstrate that existing text-to-image DiTs can effectively perform in-context generation without
any tuning. Building on this insight, we propose a remarkably simple pipeline to leverage the in-context abilities
of DiTs: (1) concatenate images instead of tokens, (2) perform joint captioning of multiple images, and (3) apply
task-specific LoRA tuning using small datasets (<em>e.g.,</em> 20 ~ 100 samples) instead of full-parameter tuning
with large datasets. We name our models In-Context LoRA (IC-LoRA). This approach requires no modifications to the
original DiT models, only changes to the training data. Remarkably, our pipeline generates high-fidelity image
sets that better adhere to prompts. While task-specific in terms of tuning data, our framework remains
task-agnostic in architecture and pipeline, offering a powerful tool for the community and providing valuable
insights for further research on product-level task-agnostic generation systems. We release our code, data, and
models at <a href="https://github.com/ali-vilab/In-Context-LoRA" target="_blank">here</a>.
</p>
</div>
<div class="content">
<h2>Film Storyboard Generation</h2>
<p style="font-size: 18px">
Each three-image sequence is generated simultaneously using In-Context LoRA. A <strong
style="color: #ff6f61;">placeholder character name</strong> uniquely
references the character’s identity across the images.
</p><br>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01mLrLiR1t6GmWfUHhq_!!6000000005852-0-tps-6146-920.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“In this adventurous three-image sequence, [IMAGE1] <strong style="color: #ff6f61;">Ethan</strong>, an intrepid
archaeologist with a rugged appearance, uncovers an ancient map in a sunlit desert dig site, his excitement
palpable as he brushes away the sand, [IMAGE2] transitioning to a bustling marketplace in a vibrant foreign city
where <strong style="color: #ff6f61;">Ethan</strong> negotiates with local merchants and gathers essential
supplies for his quest, [IMAGE3] and finally, <strong style="color: #ff6f61;">Ethan</strong> treks through a
dense, mist-covered jungle, the towering trees and exotic wildlife emphasizing the challenges and mysteries that
lie ahead on his journey.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01Ak3eHJ1mmDkSlVX0B_!!6000000004996-0-tps-6146-965.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“In a vibrant festival, [IMAGE1] we find <strong style="color: #ff6f61;">Leo</strong>, a shy boy, standing at
the edge of a bustling carnival, eyes wide with awe at the colorful rides and laughter, [IMAGE2] transitioning
to him reluctantly trying a daring game, his friends cheering him on, [IMAGE3] culminating in a triumphant
moment as he wins a giant stuffed bear, his face beaming with pride as he holds it up for all to see.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01MsMFwm1NWlzp2r6u4_!!6000000001578-0-tps-6146-1010.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“In a captivating tale of resilience, [IMAGE1] we see <strong style="color: #ff6f61;">Lena</strong>, a
determined girl, planting seeds in a barren field, her face set with resolve, [IMAGE2] transitioning to her
nurturing the plants, watering them daily, her efforts slowly yielding results, [IMAGE3] culminating in a lush
garden bursting with life, <strong style="color: #ff6f61;">Lena</strong> standing proudly amidst her creation,
symbolizing growth and perseverance.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01g0tqSy1sNqMXQYTF7_!!6000000005755-0-tps-6146-1012.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“In a warm portrayal of family dynamics, [IMAGE1] shows <strong style="color: #ff6f61;">Liam</strong> assisting
his little sister <strong style="color: #ff6f61;">Sophie</strong> with her homework at the dining table, their
expressions serious yet playful, [IMAGE2] shifting to the living room, where <strong
style="color: #ff6f61;">Sophie</strong> triumphantly holds up her completed project, her eyes sparkling with
pride while <strong style="color: #ff6f61;">Liam</strong> shares in her joy, [IMAGE3] concluding with both
siblings snuggled on the couch, engrossed in a movie, their laughter echoing through the cozy space.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01uTWAMH1Gm9zShdqKF_!!6000000000664-0-tps-6146-1003.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“In a tender exploration of first love, [IMAGE1] we see <strong style="color: #ff6f61;">Jamie</strong> nervously
arranging flowers in a park, glancing around as if waiting for someone special, [IMAGE2] transitioning to the
moment <Sam> arrives, their eyes locking in a shy smile that speaks volumes, [IMAGE3] finally showing them
seated on a bench, sharing stories and laughter, surrounded by blooming blossoms, embodying the magic of young
romance.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01EQAsYw1UVaHLyWLBf_!!6000000002523-0-tps-6146-1016.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“In a heartwarming depiction of a community gathering, [IMAGE1] captures <strong
style="color: #ff6f61;">Ella</strong> preparing colorful decorations for a local festival, her excitement
palpable, [IMAGE2] then shifts to her helping <strong style="color: #ff6f61;">Tom</strong> set up a booth, their
teamwork highlighted by laughter and shared smiles, [IMAGE3] culminating with the festival in full swing,
<strong style="color: #ff6f61;">Ella</strong> and <strong style="color: #ff6f61;">Tom</strong> surrounded by
friends, their joy radiating against the festive backdrop.”
</em>
</p>
</div>
<div class="content">
<h2>Portrait Photography</h2>
<p style="font-size: 18px">
Each set of four images is generated concurrently with In-Context LoRA, aiming to maintain consistent subject
identities across images within each set.
</p><br>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01wF7zWJ1QMBgqQ7tz5_!!6000000001961-0-tps-2708-864.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images showcases a teenage girl with curly black hair wearing a stylish denim jacket, each
image highlighting her dynamic personality in urban settings; [IMAGE1] she is skateboarding down a
graffiti-covered alley, a confident smile on her face as she maneuvers around obstacles; [IMAGE2] she is seated
at a trendy café, typing on her laptop with focused determination, the bustling city life visible through the
large windows behind her; [IMAGE3] she stands on a rooftop at sunset, her hair blowing in the breeze as she
gazes thoughtfully over the city skyline; and [IMAGE4] she is laughing with friends at a vibrant street market,
colorful lights and stalls creating a lively atmosphere around her.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN011t0jhe24RIk5X5rjF_!!6000000007387-0-tps-2708-865.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images highlights the playful energy of a young boy in a city playground. [IMAGE1] He climbs up
a jungle gym with a look of determination, his hands gripping the bars as he pulls himself up; [IMAGE2] he
swings high on a set of swings, his head thrown back in laughter as his feet touch the sky; [IMAGE3] a close-up
captures him mid-slide, his eyes wide with excitement as he descends down a bright yellow slide; [IMAGE4] he
races down a pathway lined with trees, his arms pumping with energy as he chases after a soccer ball, his face
alight with joy.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN013lnihi1pxcS3iGgVl_!!6000000005427-0-tps-2708-864.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images showcases a young girl exploring a cozy kitchen setting with her mother, filled with
warmth and affection. [IMAGE1] She stands on a stool, her hands reaching into a bowl of cookie dough as her
mother smiles beside her; [IMAGE2] she’s caught mid-laugh, flour dusted across her cheeks as she playfully
tosses a bit of dough in the air; [IMAGE3] the scene focuses on her concentration as she carefully uses cookie
cutters, her tiny hands pressing down on the dough; [IMAGE4] she proudly holds up a finished tray of cookies,
her face beaming with joy and accomplishment.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01RfFNYd1DMZ0gGS3ZW_!!6000000000202-0-tps-2708-862.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images captures the serene moments of an elderly woman tending to her garden. [IMAGE1] She
kneels beside a bed of blooming flowers, her hands gently pruning a rose bush, the soft morning light
illuminating her silver hair; [IMAGE2] she stands with a watering can, her face calm and peaceful as she
nurtures her plants; [IMAGE3] a close-up reveals her content smile as she examines a budding flower in her hand,
a sense of pride and joy evident; [IMAGE4] she sits on a small bench, sipping tea with her garden behind her,
surrounded by the vibrant colors of her hard work.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01f36xn71XaZRCzXnAd_!!6000000002940-0-tps-2702-802.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images captures a lively day spent at a beach between a mother and her son, highlighting their
playful connection and shared joy; [IMAGE1] the boy runs towards the water, his arms wide open, with the mother
following behind, smiling as she watches him; [IMAGE2] they are knee-deep in the ocean, laughing as they splash
each other, the sunlight reflecting off the water; [IMAGE3] they sit on the sand, the boy intently building a
sandcastle while the mother assists, both focused and relaxed; [IMAGE4] the final image shows the two walking
along the shore at sunset, the mother’s arm draped protectively around her son’s shoulders, their footprints
trailing behind them in the sand.”
</em>
</p>
</div>
<div class="content">
<h2>Font Design</h2>
<p style="font-size: 18px">
Each set of four images is generated concurrently with In-Context LoRA, aiming to achieve a consistent font style
across images within each set.
</p><br>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN012hc0501xjR73MnD2F_!!6000000006479-0-tps-2706-452.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images features a minimalist handwriting font for casual use. [IMAGE1] shows "Everyday" on a
coffee cup; [IMAGE2] displays "Notes" on a small journal; [IMAGE3] has "Live Simply" on a white pillow; [IMAGE4]
shows "Good Vibes" on a cozy blanket, perfect for lifestyle and home decor branding.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN018C4tG51fL0uZTfx1T_!!6000000003989-0-tps-2706-452.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four image displays a tech-inspired sans serif font in minimalist designs. [IMAGE1] features "Tech
Flow" in silver on a circuit board; [IMAGE2] shows "Future World" in neon on a digital background; [IMAGE3] has
"Virtual Space" in blue on a sleek black setting; [IMAGE4] displays "AI Vision" in holographic font, ideal for
technology branding.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01fCb9ik1EAwZrSYZEM_!!6000000000312-0-tps-2706-452.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images presents a stylized font for travel themes. [IMAGE1] displays "Wanderlust" over a
mountain scene; [IMAGE2] features "Explore" on a beach background; [IMAGE3] shows "Adventure" with a compass
illustration; [IMAGE4] has "Journey" on a vintage suitcase, perfect for travel branding.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN017glKoo1rwMbC8OPvr_!!6000000005695-0-tps-2706-452.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images highlights a serif font with Victorian-style details. [IMAGE1] displays "Vintage Charm"
on an old book cover; [IMAGE2] shows "Elegance" on a dark lace background; [IMAGE3] features "Old Times" on a
vintage clock; [IMAGE4] presents "Antique" on an ornate mirror, perfect for historical themes.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01B4VUF21f4WeWvknRj_!!6000000003953-0-tps-2706-452.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images showcases a playful bubble font in a vibrant pop-art style. [IMAGE1] displays "Pop
Candy" in bright pink with a polka dot background; [IMAGE2] shows "Sweet Treat" in purple, surrounded by candy
illustrations; [IMAGE3] has "Yum!" in a mix of bright colors; [IMAGE4] shows "Delicious" against a striped
background, perfect for fun, kid-friendly products.”
</em>
</p>
</div>
<div class="content">
<h2>Home Decoration</h2>
<p style="font-size: 18px">
Each set of four images is generated concurrently using In-Context LoRA, aiming to maintain a consistent
decorative style across images within each set.
</p><br>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01AGzFhU25ZpjiND3I2_!!6000000007541-0-tps-2665-844.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images captures a colorful, nature-inspired living space with touches of green and earthy
textures; [IMAGE1] features a cozy nook with a woven chair draped in green blankets, surrounded by potted plants
and botanical prints on the wall; [IMAGE2] highlights a rustic wooden shelf adorned with small planters,
candles, and woven baskets; [IMAGE3] displays a serene bedroom with a bed made up in white linens, a natural
wood nightstand, and a forest-themed mural; [IMAGE4] shows a close-up of a large plant pot with unique textures
beside a patterned area rug.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01weu6qL1af6CYLu3Mn_!!6000000003356-0-tps-2685-671.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This vibrant set of four image captures a lively home decor scene filled with color and eclectic charm;
[IMAGE1] the first image showcases a cozy living area with pastel-colored walls, a soft blue sofa, wooden
storage units displaying colorful accents, and a unique layered pendant light, [IMAGE2] the second image
features a kitchen setup with open shelves holding assorted kitchenware, a wire grid for organizing mugs above a
white sink, and warm sunlight streaming onto the countertop, [IMAGE3] the third image highlights a bold art wall
with an array of colorful, abstract paintings above a sage green sofa adorned with bright cushions, and [IMAGE4]
the fourth image shows a cheerful dining nook with a blue table, vividly striped cushions, framed artwork on the
sunny yellow wall, and a distinctive green pendant lamp casting a soft glow over the space.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01gvmJUM1H6JQ2geK7Q_!!6000000000708-0-tps-2665-841.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images showcases a rustic living room with warm wood tones and cozy decor elements; [IMAGE1]
features a large stone fireplace with wooden shelves filled with books and candles; [IMAGE2] shows a vintage
leather sofa draped in plaid blankets, complemented by a mix of textured cushions; [IMAGE3] displays a corner
with a wooden armchair beside a side table holding a steaming mug and a classic book; [IMAGE4] captures a cozy
reading nook with a window seat, a soft fur throw, and decorative logs stacked neatly.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN018E4Wlt1giCZxPr1yY_!!6000000004175-0-tps-2665-853.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images showcases a vibrant and cozy kitchen with eclectic decor and warm tones; [IMAGE1]
reveals a colorful countertop with an assortment of spices in glass jars, a vintage kettle, and potted herbs;
[IMAGE2] displays a kitchen island with high chairs, bright red cabinets, and a hanging pot rack; [IMAGE3] shows
an inviting breakfast nook with a patterned bench, floral cushions, and a small round table; [IMAGE4] highlights
a section of open shelving with eclectic dinnerware, vibrant mugs, and unique artwork, creating a warm and
lively ambiance.”
</em>
</p>
</div>
<div class="content">
<h2>PowerPoint Template Design</h2>
<p style="font-size: 18px">
Each set of four images is generated concurrently with In-Context LoRA, aiming to create a cohesive and unified
presentation style across slides within each set.
</p><br>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01YzQ66o1OHUOa4EODe_!!6000000001680-0-tps-6000-865.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images showcases a rustic-themed PowerPoint template for a culinary workshop; [IMAGE1]
introduces "Farm to Table Cooking" in warm, earthy tones; [IMAGE2] organizes workshop sections like
"Ingredients," "Preparation," and "Serving"; [IMAGE3] displays ingredient lists for seasonal produce; [IMAGE4]
includes chef profiles with short bios.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01tmFhGD1IMd99xh74b_!!6000000000879-0-tps-6000-868.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images presents a PowerPoint template designed for a charity fundraiser; [IMAGE1] introduces
"Help Make a Difference" in large, bold text over a background of hands reaching out; [IMAGE2] lists causes like
“Education,” “Healthcare,” and “Water Access” with heart icons; [IMAGE3] displays donation statistics; [IMAGE4]
includes a call-to-action slide with links to donate and volunteer.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN013rS3k81nZE81Qnd0t_!!6000000005103-0-tps-6000-867.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images presents a PowerPoint template for an art history class on surrealism; [IMAGE1] shows
“Exploring Surrealism” over a Dali-inspired background; [IMAGE2] lists iconic surrealist artists like “Dali,”
“Magritte,” and “Ernst”; [IMAGE3] includes a timeline of the surrealist movement; [IMAGE4] showcases famous
artworks with short interpretations.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01WJURXs1nRROa3n0zy_!!6000000005086-0-tps-6000-866.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“This set of four images depicts a colorful and engaging PowerPoint template for a “Food Science” educational
presentation; [IMAGE1] features a cover slide with “Understanding Nutrition” in bold typography and vegetable
illustrations; [IMAGE2] presents topics like “Macronutrients,” “Vitamins,” and “Minerals”; [IMAGE3] includes a
pie chart displaying daily nutrient intake recommendations; [IMAGE4] shows recipe ideas with images and
nutritional benefits.”
</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01OGlYDt1h9DwlTZc1f_!!6000000004234-0-tps-6000-845.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>
“The set of four images displays a vibrant template for a fashion branding presentation; [IMAGE1] introduces the
title “New Collection 2024” with a runway-inspired background; [IMAGE2] lists fashion sections like
“Streetwear,” “Formal,” and “Accessories” with icons; [IMAGE3] includes a color palette guide for the season;
[IMAGE4] presents a trend forecast with illustrated outfit ideas.”
</em>
</p>
</div>
<div class="content">
<h2>Couple Profile Generation</h2>
<p style="font-size: 18px">
Each image pair is generated concurrently with In-Context LoRA, aiming to maintain a consistent style and identity
features across both images in each set.
</p><br>
<div style="display: flex; flex-wrap: wrap; gap: 20px;">
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01Zd4r301vlblFhxo9i_!!6000000006213-0-tps-2116-1055.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This pair of images features a couple as cartoon characters in medieval attire; [IMAGE1] shows a knight
with a plumed helmet and a determined look, holding a small shield, while [IMAGE2] displays a character
dressed as a princess with a crown, smiling as they hold a flower, both against a castle background.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01Qx3E3C1Kp92KlAcdR_!!6000000001212-0-tps-2116-1055.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The pair of images captures a whimsical depiction of a couple in cartoon dragon costumes; [IMAGE1] a
character in a green dragon onesie with pointed ears and a toothy smile peeks towards the right, while
[IMAGE2] shows a character in a purple dragon suit with matching horns, displaying a playful wink, both set
against a cloudy sky background.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01BDI5UY1qEZ6mJeuQA_!!6000000005464-0-tps-2116-1056.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This pair of images portrays a couple of cartoon cats in detective attire; [IMAGE1] a black cat in a
trench coat and fedora holds a magnifying glass and peers to the right, while [IMAGE2] a white cat with a
bow tie and matching hat raises an eyebrow in curiosity, creating a fun, noir-inspired scene against a dimly
lit background.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN012rpfja1aDcR83RVkp_!!6000000003296-0-tps-2116-1056.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The pair of images depicts cartoon characters enjoying music together; [IMAGE1] features a character with
a spiky mohawk and wide headphones, bobbing their head with closed eyes, while [IMAGE2] presents a character
with a ponytail, holding a guitar and also wearing headphones, both set against a dark blue background with
musical notes scattered around.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01hWZwhz1rfsLBbbXMb_!!6000000005659-0-tps-2116-1055.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The pair of images depicts a couple in a cartoon-style grocery shopping scene; [IMAGE1] one character
reaches for a snack on a high shelf with a playful grin, while [IMAGE2] the other character with wide eyes
and a towering cart of food holds a grocery list, all set in a colorful grocery aisle.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01uos6eF1cwABrBL8z7_!!6000000003664-0-tps-2116-1055.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This pair of images capture a couple in a pillow fight; [IMAGE1] a character with tousled hair and a
mischievous grin winds up to swing a fluffy pillow, while [IMAGE2] another character, already hit with
feathers flying around them, has a playful look of shock, both in a cozy bedroom with fluffy bedding.”</em>
</p>
</div>
</div>
</div>
<div class="content">
<h2>Visual Identity Design</h2>
<p style="font-size: 18px">
Each image pair is generated concurrently with In-Context LoRA, aiming to achieve a cohesive and consistent visual
identity across both images in each pair.
</p><br>
<div style="display: flex; flex-wrap: wrap; gap: 20px;">
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01a8LhKZ1I8tGQCxJOA_!!6000000000849-0-tps-2309-1602.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The pair of images highlights a logo and its real-world use for a rustic coffee brand; [IMAGE1] a
striking teal background showcases a logo with a stylized, perched bird in black and white, titled “Bluebird
Roast” in an elegant serif font, with a leafy branch detail underneath; [IMAGE2] this logo is applied to a
coffee mug sitting atop a woven coaster on a dark mahogany table, with a blurred background that emphasizes
the warm tones and classic aesthetic of the branding in a cozy setting.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01oVt0qC1HkcGy63cbJ_!!6000000000796-0-tps-2310-1602.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The pair of images showcases the joyful identity of a produce brand, [IMAGE1] showing a smiling pineapple
graphic and the brand name “Fresh Tropic” in a fun, casual font on a light aqua background; while [IMAGE2]
translates the design onto a reusable shopping tote with the pineapple logo in black, held by a person in a
market setting, emphasizing the brand’s approachable and eco-friendly vibe.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01La2bjz1liGiQBvvKd_!!6000000004852-0-tps-2310-1603.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This pair of images presents an artisan soap brand inspired by botanical elements. [IMAGE1] On a rich
sage green background, delicate gold-foil leaves and flower motifs intertwine around the brand name “Herbal
Haven” in an elegant, serif font, conveying a sophisticated, earthy aesthetic. [IMAGE2] The design is
applied to a set of organic soaps wrapped in handmade paper and twine, placed with real herbs and flowers on
a wooden board, radiating the brand’s commitment to natural beauty and luxury through a warm, inviting
setting.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01kVVKTG257tZo2n7NA_!!6000000007480-0-tps-2310-1602.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This pair of images introduces a sophisticated confectionery brand identity blending elegance and whimsy.
[IMAGE1] The first image resents a whimsical, Art Nouveau-inspired design, featuring a pattern of golden
leaves intertwined with pastel-colored candy shapes on a deep plum background. The brand name "Golden
Garden" appears in a flowing, decorative font, surrounded by delicate floral filigree. [IMAGE2] The design
is applied to a set of artisanal chocolate boxes, displayed with gold-foil accents and delicate paper
flowers, conveying the brand’s high-end and enchanting quality through luxurious textures and intricate
details.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01GIKAYi1bqNZXafCUj_!!6000000003516-0-tps-2315-1606.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“In this set of two images, a bold animal-themed logo is introduced and adapted to a lifestyle product;
[IMAGE1] a simplistic black logo featuring a bear face and the brand name “Bear Lane” on a sky blue
background; [IMAGE2] the design is printed on a gray gym bag and water bottle, with both items positioned on
a wooden gym bench.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 48%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01ue8QQQ1wT7O7RbWD7_!!6000000006308-0-tps-2310-1608.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“In this set of two images, a modern mystical brand identity comes to life. [IMAGE1] Against a deep navy
background, intricate star and moon motifs in metallic silver and soft blush pink shimmer in various sizes,
creating a cosmic, dreamlike atmosphere. The brand name “Celestial Glow” is displayed in a sleek, geometric
font that radiates a mystical yet minimalist vibe. [IMAGE2] The design is adapted onto a glowing glass
misting bottle and a crystal-infused body lotion bottle, arranged on a soft, cloud-like velvet fabric with
crystals and candles, showing the brand’s ethereal charm in self-care products.”</em>
</p>
</div>
</div>
</div>
<div class="content">
<h2>Portrait Illustration</h2>
<p style="font-size: 18px">
Each pair of images is generated with In-Context LoRA, aiming to maintain consistent identity, clothing,
expression, similar pose, and atmosphere between the ‘before’ and ‘after’ illustration versions. Instead of
directly replicating the original photo, the illustration enhances key features with added expressive emphasis.
</p><br>
<div style="display: flex; flex-wrap: wrap; gap: 20px;">
<div class="item" style="flex: 1 1 30%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01X2XFvm1U3e7MPSkNj_!!6000000002462-0-tps-2058-2054.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This image pair presents a transformation from a realistic portrait to a playful illustration, capturing
both detail and artistic flair; [IMAGE1] the photograph shows a woman standing in a bustling marketplace,
wearing a wide-brimmed hat, a flowing bohemian dress, and a leather crossbody bag; [IMAGE2] the illustration
version exaggerates her accessories and features, with the bohemian dress depicted in vibrant patterns and
bold colors, while the background is simplified into abstract market stalls, giving the scene an animated
and lively feel.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 30%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01fX8HPH1DCUIQvnMQa_!!6000000000180-0-tps-2058-2054.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The image pair highlights a transformation from a high-fashion portrait to an artistic interpretation,
capturing elegance in both styles; [IMAGE1] the photo shows a woman wearing a sleek black dress with lace
details, posing against a white studio backdrop, her hair styled in an intricate updo; [IMAGE2] the
illustration reimagines her as a stylized figure, with the lace details transformed into bold, intricate
patterns and her hair exaggerated into voluminous curls, while the background is simplified into a gradient
of soft, muted colors, enhancing the contrast between her formal attire and the artistic rendering.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 30%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01oTYYP01bqNZT0Tw88_!!6000000003516-0-tps-2058-2054.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The image pair showcases the transformation from reality to a stylized interpretation; [IMAGE1] the photo
shows a person with a topknot, wearing a cozy yellow sweater and plaid scarf, standing in front of a shop
window, while [IMAGE2] the illustrated version highlights the warm tones, adding playful, oversized shapes
and bright hues, creating an animated feel with a soft, inviting background.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 30%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01wwlMFR26Mq7F4geJa_!!6000000007648-0-tps-2058-2010.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The image pair illustrates a transformation from a candid photograph to a dynamic illustration, each
capturing distinct artistic qualities; [IMAGE1] the original photo features a man with a beard, wearing a
denim jacket over a graphic tee and black jeans, seated on a staircase with a skateboard beside him, while
[IMAGE2] the illustrated version amplifies his outfit with bold colors, adding stylized graffiti on the
steps and vibrant motion lines around the skateboard.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 30%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01dBfpBt1uADoaHIW2O_!!6000000005996-0-tps-2058-2010.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This image pair captures a transformation from a street-style photograph to a dynamic digital
illustration; [IMAGE1] the photo shows a person wearing a colorful windbreaker jacket, ripped jeans, and
white sneakers, walking along a busy city street with a skateboard tucked under their arm; [IMAGE2] the
illustration simplifies the background into bold, abstract shapes, while the figure’s outfit is brightened
with more vibrant colors and their pose is exaggerated, giving the image a sense of movement and energy that
contrasts with the stillness of the photograph.”</em>
</p>
</div>
<div class="item" style="flex: 1 1 30%; box-sizing: border-box;">
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01f38pcY1H3Z2aneSMz_!!6000000000702-0-tps-2058-2010.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The image pair contrasts a photographic portrait with its illustrated counterpart, showcasing an artistic
reinterpretation; [IMAGE1] the initial photo shows a woman with a high bun, dressed in a classic black
trench coat, holding a bright yellow umbrella, standing on a rainy street, while [IMAGE2] the illustration
accentuates her pose with exaggerated features, making the umbrella the focal point with vivid yellows and
reds, transforming the rain into playful, curving lines.”</em>
</p>
</div>
</div>
</div>
<div class="content">
<h2>Sandstorm Visual Effect</h2>
<p style="font-size: 18px">
Each image pair is generated using In-Context LoRA, aiming to demonstrate strong consistency between the ‘before’
and ‘after’ sandstorm effect images.
</p><br>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01ALVWcr1NnGFrmY9lx_!!6000000001614-0-tps-2689-758.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This image pair showcases the transformation of a cyclist through a sandstorm visual effect; [IMAGE1]
features a cyclist in vibrant gear pedaling steadily on a clear, open road with a serene sky in the background,
highlighting focus and determination, [IMAGE2] transforms the scene as the cyclist becomes enveloped in a fierce
sandstorm, with sand particles swirling intensely around the bike and rider against a stormy, darkened backdrop,
emphasizing chaos and power.”</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01RWlQE81xXWohf4GA2_!!6000000006453-0-tps-2689-753.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“The image pair illustrates the metamorphosis of a musician enhanced by a sandstorm effect; [IMAGE1] the first
image depicts a guitarist playing calmly on a minimalist stage with soft lighting, capturing the essence of
tranquility and artistry, [IMAGE2] the second image erupts into a dynamic sandstorm with sand and debris
swirling around the musician and instrument, set against a tumultuous background, conveying an intense and
electrifying performance.”</em>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01j3B7Ny1HGqWpFcVUY_!!6000000000731-0-tps-2689-760.jpg"
style="width:100%;">
<p style="text-align: left; font-size: 14px; margin-top: 8px; margin-bottom: 28px;">
<strong>Prompt:</strong>
<em>“This pair of images highlights a stunning transformation with a sandstorm visual effect, balancing calm and
intensity; [IMAGE1] features a man in a meditative pose, seated cross-legged in a black outfit against a white
backdrop, eyes closed, [IMAGE2] shows the man shrouded in a fierce explosion of swirling sand particles mixed
with streaks of electric light, against a deeper background, creating a captivating display of serenity
overtaken by chaos.”</em>
</p>
</div>
<div class="content">
<h2>Image-Conditional Generation</h2>
<p style="font-size: 18px">
Examples of image-conditional generation using In-Context LoRA across multiple tasks with training-free SDEdit.
</p>
<p style="font-size: 18px; margin-top: 32px;">
<strong>Portrait Identity Transfer.</strong>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01ncZRGl22GeIbH9YIG_!!6000000007093-0-tps-11275-3459.jpg"
style="width:100%;"><br>
<p style="font-size: 18px">
<strong>Font Style Transfer.</strong>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN01XBeEY61qVVlJ9kUjm_!!6000000005501-0-tps-11275-1788.jpg"
style="width:100%;"><br>
<p style="font-size: 18px">
<strong>Application of Visual Identity.</strong>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i1/O1CN01MrrnTU1a407UQgxp9_!!6000000003275-0-tps-11275-3073.jpg"
style="width:100%;"><br>
<div style="display: flex; flex-wrap: wrap; gap: 20px;">
<div class="item" style="flex: 1 1 36%; box-sizing: border-box;">
<p style="font-size: 18px">
<strong>Portrait to Illustration.</strong>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i4/O1CN01y85XZ71iQST0KUe5O_!!6000000004407-0-tps-4013-3601.jpg"
style="width:100%;">
</div>
<div class="item" style="flex: 1 1 58.2%; box-sizing: border-box;">
<p style="font-size: 18px">
<strong><span style="color: #ff6f61;">Failure case</span> of Sandstorm Visual Effect Application.</strong>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i2/O1CN019FA7xz1CGmOORU6Ng_!!6000000000054-0-tps-6402-3601.jpg"
style="width:100%;">
</div>
</div><br>
<p style="font-size: 18px; margin-top: 32px;">
<strong><span style="color: #ff6f61;">Failure cases</span> of Portrait Identity Transfer.</strong>
</p>
<img class="summary-img"
src="https://img.alicdn.com/imgextra/i3/O1CN01qjJpGW1icMlPtc0VD_!!6000000004433-0-tps-4643-4214.jpg"
style="width:80%;"><br>
<p style="font-size: 18px">
We observe that SDEdit for In-Context LoRA tends to be unstable, often failing to preserve identity. Addressing this issue is left for future work.
</p>
</div>
<div class="content">
<h2>BibTex</h2>
<code> @article{lhhuang2024iclora,<br>
title={In-Context LoRA for Diffusion Transformers},<br>
author={Huang, Lianghua and Wang, Wei and Wu, Zhi-Fan and Shi, Yupeng and Dou, Huanzhang and Liang, Chen and Feng, Yutong and Liu, Yu and Zhou, Jingren},<br>
booktitle={arXiv preprint arxiv:2410.23775},<br>
year={2024}<br>
} </code>
</div>
<br><br>
<footer class="footer">
<div class="container">
<div class="columns is-centered">
<!-- <div class="content"> -->
website template from <a href="https://ali-vilab.github.io/composer-page/" target="_blank">composer</a>
<!-- </div> -->
</div>
</div>
</footer>
<br><br>
</body>
</html>