Skip to content

Expand AMD ROCm Tips readme section#12349

Open
alexheretic wants to merge 1 commit intoComfy-Org:masterfrom
alexheretic:rocm-tips
Open

Expand AMD ROCm Tips readme section#12349
alexheretic wants to merge 1 commit intoComfy-Org:masterfrom
alexheretic:rocm-tips

Conversation

@alexheretic
Copy link
Contributor

@alexheretic alexheretic commented Feb 7, 2026

  • Add suggestion to disable online tuning
  • Add miopen info
  • Add flash attention info
  • Add vram oom suggestion

I have verified all these points to be effective, testing with a gfx1100 on Linux. I suspect they are generally worth at least noting for all AMD users, and this readme is one of the first things amd users might read.

For example I recently tested flash-attention on a 360x640 wan2.2 workflow:

  • --use-flash-attention: 26.06s/it, 26.02s/it
  • --use-flash-attention (tuned attn_fwd): 15.56s/it, 15.12s/it
  • --pytorch-cross-attention: 30.69s/it, 30.97s/it

Add suggestion to disable online tuning
Add miopen info
Add flash attention info
Add vram oom suggestion
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant