Major overhaul / better native integration #92

city96 · 2024-12-11T02:51:52Z

The plan is to do a full rewrite/refactor of this repo to have better integration with most of the native comfy code.
This should make things less fragile (and less cumbersome in general).

Progress/steps:

Major changes:

Text encoders return comfy compatible CLIP objects instead of custom types
Auto config detection wherever possible to minimize user error
Single node with dropdown for all models instead of separate node sets per model
PixArt/DiT/etc models are loaded from unet (diffusion_models) folder instead of checkpoints folder

Other possible ideas/plans:

GGUF support if ComfyUI-GGUF is installed
Generic resolution select node with dropdown and slider/float input
Change to native comfy attention for PixArt/Sana
Add proper ControlNet for PixArt (current version never worked correctly)

don't initialize base Sana model for SanaMS

@SaiZyca

#86 - patch provided by @SaiZyca

frutiemax92 · 2024-12-26T19:14:22Z

If you want to test out the diffusers format of SANA, I just finetuned this model.
https://huggingface.co/frutiemax/themoviedb_1600M_1024px

patientx · 2025-01-01T21:42:44Z

If you want to test out the diffusers format of SANA, I just finetuned this model. https://huggingface.co/frutiemax/themoviedb_1600M_1024px

gives the error :

# ComfyUI Error Report
## Error Details
- **Node ID:** 181
- **Node Type:** EXMUnetLoader
- **Exception Type:** KeyError
- **Exception Message:** 'hidden_size'
## Stack Trace
  File "D:\c2\execution.py", line 327, in execute
    output_data, output_ui, has_subgraph = get_output_data(obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb)
  File "D:\c2\execution.py", line 202, in get_output_data
    return_values = _map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb)
  File "D:\c2\execution.py", line 174, in _map_node_over_list
    process_inputs(input_dict, i)
  File "D:\c2\execution.py", line 163, in process_inputs
    results.append(getattr(obj, func)(**inputs))
  File "D:\c2\custom_nodes\ComfyUI_ExtraModels\nodes.py", line 33, in load_unet
    return (loader_fn(sd),)
  File "D:\c2\custom_nodes\ComfyUI_ExtraModels\Sana\loader.py", line 57, in load_sana_state_dict
    model_config = model_config_from_unet(sd)
  File "D:\c2\custom_nodes\ComfyUI_ExtraModels\Sana\loader.py", line 92, in model_config_from_unet
    if config["hidden_size"] == 1152:

RandomGuyWithIssues · 2025-01-29T05:01:43Z

Any update on this?

arcum42 · 2025-02-08T00:57:04Z

It's worth noting that when Lumina Image 2 support got added to ComfyUI, that also uses Gemma for a text encoder, so you may be able to use the builtin Gemma support now.

frutiemax92 · 2025-02-27T00:09:52Z

Any news on this?

future-knowin · 2025-03-14T03:23:12Z

Any news on this?

I am getting the black output as well.... hopefully this get fix ASAP. any one?

city96 added 13 commits December 10, 2024 22:19

PixArt initial rewrite

3f844ca

Remove HyDiT - supported in main

45423d9

Use native ops for PixArt

d5a47fa

Unused

6a88140

Consolidate loader logic

06a9368

Sana loader logic

5505ab4

Use native ops for sana

7e19ac5

Faster model initialization

2beff5a

don't initialize base Sana model for SanaMS

Patch size logic from common dit

d1f7516

Make mlp args match the one from timm

ca2bc8b

Remove timm as a dependency

ee982c5

Add switch for new xformers

4a11543

#86 - patch provided by @SaiZyca

Add fast rgb preview to Sana

7b3f2ce

city96 mentioned this pull request Dec 12, 2024

Add ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer Support comfyanonymous/ComfyUI#5785

Open

Rename/move

c52d827

city96 mentioned this pull request Dec 12, 2024

Sana - not work - grey image #93

Closed

Gemma first implementation

0f567fc

city96 mentioned this pull request Dec 14, 2024

GEMMA models are not downloading for SANA workflow. #94

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Major overhaul / better native integration #92

Major overhaul / better native integration #92

Uh oh!

city96 commented Dec 11, 2024 •

edited

Loading

Uh oh!

frutiemax92 commented Dec 26, 2024

Uh oh!

patientx commented Jan 1, 2025 •

edited

Loading

Uh oh!

RandomGuyWithIssues commented Jan 29, 2025

Uh oh!

arcum42 commented Feb 8, 2025

Uh oh!

frutiemax92 commented Feb 27, 2025

Uh oh!

future-knowin commented Mar 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Major overhaul / better native integration #92

Are you sure you want to change the base?

Major overhaul / better native integration #92

Uh oh!

Conversation

city96 commented Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

frutiemax92 commented Dec 26, 2024

Uh oh!

patientx commented Jan 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RandomGuyWithIssues commented Jan 29, 2025

Uh oh!

arcum42 commented Feb 8, 2025

Uh oh!

frutiemax92 commented Feb 27, 2025

Uh oh!

future-knowin commented Mar 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

city96 commented Dec 11, 2024 •

edited

Loading

patientx commented Jan 1, 2025 •

edited

Loading