[Alpha-VLLM Team] Add Lumina-T2X to diffusers #8652

PommesPeter · 2024-06-20T16:25:04Z

What does this PR do?

Add Lumina-T2X to diffusers

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…o lumina

src/diffusers/models/transformers/lumina_nextdit2d.py

…o lumina

Co-authored-by: YiYi Xu <[email protected]>

yiyixuxu

thank you! I did another round of review! I think we are close to merge once these are addressed

src/diffusers/models/attention.py

src/diffusers/models/transformers/lumina_nextdit2d.py

- Removed extra input parameter - Fixed typo on Feed_Forward Co-authored-by: YiYi Xu <[email protected]>

Co-authored-by: YiYi Xu <[email protected]>

…o lumina

Co-authored-by: YiYi Xu <[email protected]>

…o lumina

zhuole1025 · 2024-07-01T08:40:55Z

I reviewed Attenton and LuminaAttnProcessor2_0. looking very nice! I Ieft some questions:) most importantly I want to understand the kv_heads variable we added to Attention - Is this based on your research or some other paper? why do we give k and v smaller dimensions and then duplicate them for the attention calculation?

Yes, this is called Grouped Query Attention proposed in this paper, which can optimize training and inference efficiency.

yiyixuxu

thanks! I think we can merge this soon

I have one question here need your input #8652 (comment) - can you look into to refactor with get_1d_rotary_pos_embed, or create a method similar to it for Lumina

the rest of changes are just to make sure the code structure and naming conventions are consistent with other models - we can certainly help with these to finish it up!

yiyixuxu · 2024-07-01T20:19:24Z

src/diffusers/models/transformers/lumina_nextdit2d.py

+
+        return Transformer2DModelOutput(sample=output)
+
+    def precompute_freqs_cis(


I think we can use this instead?

diffusers/src/diffusers/models/embeddings.py

Line 277 in 0bae6e4

def get_1d_rotary_pos_embed(dim: int, pos: Union[np.ndarray, int], theta: float = 10000.0, use_real=False):

Yes! I have refactored this by adding a new function called get_2d_rotary_pos_embed_lumina using get_1d_rotary_pos_embed inside. I added some lines of code in get_1d_rotary_pos_embed to enable context extrapolation proposed in our paper. (Note that this is a universal method as long as the model use RoPE, such as lumina and hunyuan. Besides, the added argument are disabled in default so they will not influence existing pipelines.)

src/diffusers/models/transformers/lumina_nextdit2d.py

yiyixuxu · 2024-07-01T21:27:34Z

src/diffusers/models/transformers/lumina_nextdit2d.py

+            scale_msa, gate_msa, scale_mlp, gate_mlp = self.adaLN_modulation(adaln_input).chunk(4, dim=1)
+
+            # Self-attention
+            hidden_states = modulate(self.attn_norm1(hidden_states), scale_msa)


can we make a LuminaLayerNormZero similar to AdaLayerNormZero and keep it in this file since it's specific to Luminn?

src/diffusers/models/transformers/lumina_nextdit2d.py

Co-authored-by: YiYi Xu <[email protected]>

…o lumina

PommesPeter and others added 15 commits May 17, 2024 14:11

init lumina-t2i pipeline

0516038

added pipeline code

bd2445e

added flag-dit and next-dit model

65a6991

fixed typo and added test code

a0f7e18

init lumina-t2i pipeline

dfb826e

added pipeline code

e516d50

added flag-dit and next-dit model

6db8b82

fixed typo and added test code

4b598ad

reformated demo and models

609f3db

Add heun sampler for flow matching models

08fcefb

Added Lumina-Next-SFT model to diffusers

576171c

Merge branch 'lumina' of https://github.com/PommesPeter/diffusers int…

b707add

…o lumina

Format code style and fixed merge unused code

f93b903

Updated docs about lumina

1ad8e2b

Fixed timestep scale

627b383

PommesPeter mentioned this pull request Jun 20, 2024

Integrate Lumina-T2X #8641

Open

2 tasks

PommesPeter changed the title ~~Add Lumina-T2X to diffusers~~ [WIP] Add Lumina-T2X to diffusers Jun 20, 2024

PommesPeter added 8 commits June 21, 2024 00:44

Fixed import error

d50b85e

Fixed bug on flow match heun

18762c8

Update: run the pipeline successfully

e3b20b1

Removed unused files

a6d34b4

Fixed bugs

8c40b5c

Fixed bugs

63331ae

Fixed prompt embedding bugs

f45485e

Removed unused code

c49c16b

yiyixuxu reviewed Jun 21, 2024

View reviewed changes

src/diffusers/models/transformers/lumina_nextdit2d.py Outdated Show resolved Hide resolved

zhuole1025 and others added 4 commits June 24, 2024 04:46

Fix bugs

69b02cb

Add lumina tests

cf2da8b

Implement attention in diffusres

759781e

Merge branch 'lumina' of https://github.com/PommesPeter/diffusers int…

5c9739e

…o lumina

PommesPeter and others added 5 commits June 30, 2024 13:49

Update src/diffusers/models/transformers/lumina_nextdit2d.py

a81c554

Co-authored-by: YiYi Xu <[email protected]>

Update src/diffusers/models/attention_processor.py

aee650d

Co-authored-by: YiYi Xu <[email protected]>

Update src/diffusers/models/transformers/lumina_nextdit2d.py

e9a45c3

Co-authored-by: YiYi Xu <[email protected]>

Update src/diffusers/models/attention_processor.py

98eb745

Co-authored-by: YiYi Xu <[email protected]>

Refactor attention output and Removed residual in Attn

df2b7d0

PommesPeter requested a review from yiyixuxu June 30, 2024 07:39

yiyixuxu reviewed Jun 30, 2024

View reviewed changes

PommesPeter and others added 9 commits July 1, 2024 11:13

Apply suggestions from code review

6cd9936

- Removed extra input parameter - Fixed typo on Feed_Forward Co-authored-by: YiYi Xu <[email protected]>

Update src/diffusers/models/transformers/lumina_nextdit2d.py

127d1df

Co-authored-by: YiYi Xu <[email protected]>

Apply suggestions from code review

f51c75c

Co-authored-by: YiYi Xu <[email protected]>

Fixed name of FFN

cc88101

Merge branch 'lumina' of https://github.com/PommesPeter/diffusers int…

e637fd5

…o lumina

Apply suggestions from code review

c70694f

Co-authored-by: YiYi Xu <[email protected]>

Renamed input name

f0904b1

Merge branch 'lumina' of https://github.com/PommesPeter/diffusers int…

6fa84cc

…o lumina

Updated rotary emb

c589ce6

Remove useless codes

0712910

PommesPeter requested a review from yiyixuxu July 1, 2024 17:01

yiyixuxu reviewed Jul 1, 2024

View reviewed changes

PommesPeter and others added 10 commits July 2, 2024 13:03

Apply suggestions from code review

32a163c

Co-authored-by: YiYi Xu <[email protected]>

Updated variable name

d57cc16

Refactor positional embedding

0b197c4

Refactor positional embedding

e232b8c

Updated AdaLN

8ea9c27

Merge branch 'lumina' of https://github.com/PommesPeter/diffusers int…

93d458d

…o lumina

Added comment about time-aware denosing and Fixed a bug from typo

eb94171

Fixed code format and Removed unused code

780c945

Fixed code format and Removed unused code

0f596b6

Removed unpatchify

cf1f237

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Alpha-VLLM Team] Add Lumina-T2X to diffusers #8652

[Alpha-VLLM Team] Add Lumina-T2X to diffusers #8652

PommesPeter commented Jun 20, 2024 •

edited by sayakpaul

Loading

yiyixuxu left a comment

zhuole1025 commented Jul 1, 2024

yiyixuxu left a comment •

edited

Loading

yiyixuxu Jul 1, 2024

zhuole1025 Jul 2, 2024

yiyixuxu Jul 1, 2024


		return Transformer2DModelOutput(sample=output)

		def precompute_freqs_cis(

[Alpha-VLLM Team] Add Lumina-T2X to diffusers #8652

Are you sure you want to change the base?

[Alpha-VLLM Team] Add Lumina-T2X to diffusers #8652

Conversation

PommesPeter commented Jun 20, 2024 • edited by sayakpaul Loading

What does this PR do?

Before submitting

Who can review?

yiyixuxu left a comment

Choose a reason for hiding this comment

zhuole1025 commented Jul 1, 2024

yiyixuxu left a comment • edited Loading

Choose a reason for hiding this comment

yiyixuxu Jul 1, 2024

Choose a reason for hiding this comment

zhuole1025 Jul 2, 2024

Choose a reason for hiding this comment

yiyixuxu Jul 1, 2024

Choose a reason for hiding this comment

PommesPeter commented Jun 20, 2024 •

edited by sayakpaul

Loading

yiyixuxu left a comment •

edited

Loading