Abstract
We introduce Patch Explorer, an interactive interface for visualizing and manipulating the patches as they are processed by cross-attention heads. Built on interventions via NNsight, our interface lets users inspect and manipulate individual attention heads over layers and timesteps. Interaction via the interface reveals that attention heads independently capture semantics, like a unicorn’s horn, in diffusion models. Next to offering a way to analyze its behavior, users can also intervene with Patch Explorer to edit semantic associations within diffusion models, like adding a unicorn horn to a horse. Our interface also helps understand the role of a diffusion timestep through precise interventions. By providing a visualization tool with interactivity based on attention heads, we aim to shed light on their role in generative processes.
| Original language | English |
|---|---|
| Publication date | 2025 |
| Publication status | Published - 2025 |
| Event | CVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision - Nashville, TN, United States Duration: 12 Jun 2020 → … |
Conference
| Conference | CVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision |
|---|---|
| Location | Nashville |
| Country/Territory | United States |
| City | TN |
| Period | 12/06/2020 → … |
Keywords
- Diffusion models
- Interface
- Interpretability
Fingerprint
Dive into the research topics of 'Patch Explorer: Interpreting Diffusion Models through Interaction'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver