Spring til hovednavigation Spring til søgning Spring til hovedindhold

Patch Explorer: Interpreting Diffusion Models through Interaction

  • Imke Grabe
  • , Jaden Fiotto-Kaufman
  • , Rohit Gandikota
  • , David Bau
  • Northeastern University

Publikation: Konferencebidrag - EJ publiceret i proceeding eller tidsskriftPaperForskningpeer review

Abstract

We introduce Patch Explorer, an interactive interface for visualizing and manipulating the patches as they are processed by cross-attention heads. Built on interventions via NNsight, our interface lets users inspect and manipulate individual attention heads over layers and timesteps. Interaction via the interface reveals that attention heads independently capture semantics, like a unicorn’s horn, in diffusion models. Next to offering a way to analyze its behavior, users can also intervene with Patch Explorer to edit semantic associations within diffusion models, like adding a unicorn horn to a horse. Our interface also helps understand the role of a diffusion timestep through precise interventions. By providing a visualization tool with interactivity based on attention heads, we aim to shed light on their role in generative processes.
OriginalsprogEngelsk
Publikationsdato2025
StatusUdgivet - 2025
BegivenhedCVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision - Nashville, TN, USA
Varighed: 12 jun. 2020 → …

Konference

KonferenceCVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision
LokationNashville
Land/OmrådeUSA
ByTN
Periode12/06/2020 → …

Fingeraftryk

Dyk ned i forskningsemnerne om 'Patch Explorer: Interpreting Diffusion Models through Interaction'. Sammen danner de et unikt fingeraftryk.

Citationsformater