Unexpected validation image on webdataset SDXL controlnet training script 

### Describe the bug

I am using the [webdataset sdxl controlnet training script](https://github.com/huggingface/diffusers/blob/main/examples/research_projects/controlnet/train_controlnet_webdataset.py).

I ran the script two times with 2 different learning rates in which both runs gave unexpected generated validation images:
### LR = 5e-5
Step 1050
![image](https://github.com/huggingface/diffusers/assets/16372429/6ead9a94-20d7-4030-b383-bff0b25a9023)

Step 1100
![image](https://github.com/huggingface/diffusers/assets/16372429/d2dc4e7b-bd45-40e0-85a4-879d963f8dc5)

### LR = 4e-4
Step 50
![image](https://github.com/huggingface/diffusers/assets/16372429/999902f5-b4a8-4ac1-ad48-b409fab5dd10)

Step 100
![image](https://github.com/huggingface/diffusers/assets/16372429/612d4aa4-5b5a-4592-b464-8d3f680eab81)


I understand this [webdataset script](https://github.com/huggingface/diffusers/blob/main/examples/research_projects/controlnet/train_controlnet_webdataset.py) differs from the [example script](https://github.com/huggingface/diffusers/blob/main/examples/controlnet/train_controlnet_sdxl.py) in 2 ways:
1.  The webdataset script uses the EDM formulation
2. The webdataset script also takes the image cropping top left coordinates into consideration when computing the embeddings whereas the [example script](https://github.com/huggingface/diffusers/blob/main/examples/controlnet/train_controlnet_sdxl.py) takes a [pre-defined `crops_coords_top_left_h` and `crops_coords_top_left_w`](https://github.com/huggingface/diffusers/blob/150142c5374d4a7b8a391b3922f2567d352aa593/examples/controlnet/train_controlnet_sdxl.py#L328)

Therefore, is the generated validation images expected as part of the training process?

### Reproduction

These are my parameters:
```
! accelerate launch diffusers/examples/research_projects/controlnet/train_controlnet_webdataset.py \
--pretrained_model_name_or_path="stabilityai/stable-diffusion-xl-base-1.0" \
 --pretrained_vae_model_name_or_path="madebyollin/sdxl-vae-fp16-fix" \
 --output_dir="/workspace/output" \
 --train_shards_path_or_url="/workspace/data/dataset.tar" \
 --eval_shards_path_or_url="/workspace/data/valdiation_dataset.tar" \
 --control_type="seg" \
 --dataloader_num_workers=2 \
 --max_train_samples=20210 \
 --max_eval_samples=2000 \
 --mixed_precision="bf16" \
 --resolution=1024 \
 --learning_rate=5e-5 \
 --lr_scheduler="constant" \
 --max_train_steps=33690 \
 --validation_image "/workspace/segs/image1.png" \
 --validation_prompt "a living room with a red carpet and white walls"\
 --validation_steps=50 \
 --train_batch_size=6 \
 --gradient_accumulation_steps=1 \
 --report_to="wandb" \
 --tracker_project_name="sdxl-controlnet-run-02" \
 --seed=42 \
 --hub_model_id="helloHearthXHome/cn" \
 --checkpointing_steps=1685 \
```

### Logs

_No response_

### System Info

Diffusers v0.29.1

### Who can help?

@sayakpaul @DN

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unexpected validation image on webdataset SDXL controlnet training script #8729

Describe the bug

LR = 5e-5

LR = 4e-4

Reproduction

Logs

System Info

Who can help?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unexpected validation image on webdataset SDXL controlnet training script #8729

Description

Describe the bug

LR = 5e-5

LR = 4e-4

Reproduction

Logs

System Info

Who can help?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions