Abstract
We introduce Patch Explorer, an interactive interface for visualizing and manipulating the patches as they are processed by cross-attention heads. Built on interventions via NNsight, our interface lets users inspect and manipulate individual attention heads over layers and timesteps. Interaction via the interface reveals that attention heads independently capture semantics, like a unicorn’s horn, in diffusion models. Next to offering a way to analyze its behavior, users can also intervene with Patch Explorer to edit semantic associations within diffusion models, like adding a unicorn horn to a horse. Our interface also helps understand the role of a diffusion timestep through precise interventions. By providing a visualization tool with interactivity based on attention heads, we aim to shed light on their role in generative processes.
| Originalsprog | Engelsk |
|---|---|
| Publikationsdato | 2025 |
| Status | Udgivet - 2025 |
| Begivenhed | CVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision - Nashville, TN, USA Varighed: 12 jun. 2020 → … |
Konference
| Konference | CVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision |
|---|---|
| Lokation | Nashville |
| Land/Område | USA |
| By | TN |
| Periode | 12/06/2020 → … |
Fingeraftryk
Dyk ned i forskningsemnerne om 'Patch Explorer: Interpreting Diffusion Models through Interaction'. Sammen danner de et unikt fingeraftryk.Citationsformater
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver