Skip to main navigation Skip to search Skip to main content

Patch Explorer: Interpreting Diffusion Models through Interaction

  • Imke Grabe
  • , Jaden Fiotto-Kaufman
  • , Rohit Gandikota
  • , David Bau

Research output: Contribution to conference - NOT published in proceeding or journalPaperResearchpeer-review

Abstract

We introduce Patch Explorer, an interactive interface for visualizing and manipulating the patches as they are processed by cross-attention heads. Built on interventions via NNsight, our interface lets users inspect and manipulate individual attention heads over layers and timesteps. Interaction via the interface reveals that attention heads independently capture semantics, like a unicorn’s horn, in diffusion models. Next to offering a way to analyze its behavior, users can also intervene with Patch Explorer to edit semantic associations within diffusion models, like adding a unicorn horn to a horse. Our interface also helps understand the role of a diffusion timestep through precise interventions. By providing a visualization tool with interactivity based on attention heads, we aim to shed light on their role in generative processes.
Original languageEnglish
Publication date2025
Publication statusPublished - 2025
EventCVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision - Nashville, TN, United States
Duration: 12 Jun 2020 → …

Conference

ConferenceCVPR 2025 - 1st Workshop on Mechanistic Interpretability in Computer Vision
LocationNashville
Country/TerritoryUnited States
CityTN
Period12/06/2020 → …

Keywords

  • Diffusion models
  • Interface
  • Interpretability

Fingerprint

Dive into the research topics of 'Patch Explorer: Interpreting Diffusion Models through Interaction'. Together they form a unique fingerprint.

Cite this