Understand

Inspection Dialogue

causal reasoninguser agencyinteractive inquiryaudience adaptation

User question

Can I ask the system follow-up questions?

Consulting signal

Relevant when users need to probe a decision beyond what the static explanation shows: particularly useful in expert-facing tools where caseworkers, clinicians, or analysts need to investigate edge cases.

Overview

Why this pattern exists

Static explanations answer the question the designer anticipated. Inspection Dialogue answers the question the user actually has.

A user reviewing a declined loan may understand the top factors but want to ask: "Would it have changed the result if I'd submitted the income statement from my second job?" A clinician reviewing a diagnosis may want to ask: "Why did you weight this symptom so heavily given the patient's age?" A caseworker may want to ask: "What would this score be if I corrected the employment dates?"

These are follow-up questions, and static attribution panels, bar charts, and factor lists cannot answer them. They require interactive inspection: a mechanism for the user to probe the explanation, test assumptions, and get specific answers to their specific questions.

As AI interfaces increasingly incorporate conversational elements (chat interfaces, copilots, AI assistants), the inspection dialogue pattern becomes both more natural and more important. It bridges the structured display patterns of traditional XAI with the conversational interaction model of LLM-powered systems.

Design goal

Provide an interactive mechanism through which users can ask follow-up questions about a decision or output, probe specific factors, test hypothetical changes, and receive direct answers: without requiring technical knowledge.

Usage guidance

When to use

Users have specific, varied follow-up questions that cannot all be anticipated in a static explanation
The decision involves multiple factors and users need to explore the interaction between them
The interface already incorporates conversational or chat-based interaction
Expert users (caseworkers, analysts, clinicians) need to investigate edge cases or anomalies
The explanation display alone is insufficient to support the user's decision or action

When not to use

Users need speed over depth: interactive inspection adds time and may not be appropriate in high-volume workflows
The system cannot reliably answer follow-up questions without hallucinating or misleading (see Grounding & Hallucination Indicators, 22)
The domain has strict interpretive boundaries where ad hoc explanations risk being misleading or misused
The user's question requires a different pattern: "what can I change?" is better answered by Actionable Recourse than by open-ended inspection

Design

UI primitives

to be added soon

Interactive Control / Question Set

Guided question set

A set of pre-defined, contextually relevant follow-up questions presented as quick-select options. Research on how users question context-aware systems identifies four recurring question types that inspection dialogue must support: - Why: "Why did employment history matter so much?" (explaining the factors) - Why not: "Why wasn't my income enough on its own?" (contrastive, ruling out alternatives) - What if: "Would a higher income have changed the result?" (hypothetical / counterfactual) - How to: "What would I need to do to get a different outcome?" (recourse-seeking) Generating contextually relevant questions from each of these types, based on the actual factors in the decision, provides structured coverage of what users genuinely need to understand.

to be added soon

Interactive Control / Input Field

Free-text question input

An open text field allowing users to ask their own questions. Most powerful but requires the system to understand and answer reliably. Should be paired with grounding indicators to signal when the system is uncertain about its answer.

to be added soon

Interactive Control / Input

What if input

A structured form allowing users to change one variable and see the hypothetical impact: "What if my income were €X?" This is inspection dialogue structured as a counterfactual query: a bridge between inspection and Counterfactual Explanation.

to be added soon

Navigation & Flow / Chat Panel

Explanation chat panel

A side panel with a conversational interface specifically scoped to the current decision. The system can only answer questions about this decision, not general questions about the AI. Scoping prevents the conversation from drifting outside what the system can reliably answer.

to be added soon

Content Block / Card

Follow-up answer card

When a user asks a question, the answer appears as a new card anchored to the relevant part of the explanation, not as a free-floating response. This keeps the inspection grounded in the specific decision rather than abstractly conversational.

to be added soon

Content Block / History Log

Inspection history

A record of the questions asked during an inspection session: useful for audit purposes and for caseworkers who need to document the reasoning behind an override decision.

How to use

Scope the dialogue tightly.

An inspection interface that can answer any question is also an interface that can produce unreliable answers. Scope the system to the decision at hand. "I can answer questions about this specific recommendation" is safer and more trustworthy than a general-purpose AI chat window.

Lead with guided questions, not a blank input.

Most users don't know what questions to ask of an AI system. Pre-defined question options that are contextually generated (based on the factors that appeared in the attribution) remove this burden and guide users toward questions the system can actually answer well.

Show the basis of every answer.

Inspection dialogue answers should surface their reasoning, for example "this is based on the employment factor, which contributed −0.34 to the score", not standalone assertions. Answers without visible basis cannot be evaluated or trusted.

Indicate when a question is outside scope.

If a user asks a question the system cannot reliably answer, say so explicitly rather than generating a plausible-sounding but unreliable answer. This is especially critical in domains with legal or safety implications.

Log inspection sessions for audit.

Questions asked during inspection are part of the decision record. A caseworker who asks "what if I adjust the employment dates?" and then overrides based on the answer has made a decision that should be auditable.

Use cases

flow a

Applicant follow-up question

1. Applicant sees attribution showing "employment gaps" as a top negative factor.
2. A suggested question appears: "Why does employment history matter for this decision?"
3. Applicant selects it.
4. System responds: "Consistent employment history reduces repayment risk. Gaps in the past 24 months were flagged because they suggest income instability." Linked to the attribution factor for context.
5. Applicant understands the reasoning and moves to recourse options.

flow b

Caseworker hypothetical investigation

1. Caseworker suspects the employment data was entered incorrectly.
2. Opens inspection panel and types: "What would the score be if the employment gap were 3 months instead of 12?"
3. System calculates and responds: "With a 3-month gap, the score would increase from 42 to 61: above the referral threshold."
4. Caseworker requests corrected documentation and updates the case.

flow c

Out-of-scope question

1. Applicant asks: "Is this system biased against people like me?"
2. System responds: "I can only answer questions about the specific factors that contributed to your decision. For information about how the system was validated for fairness, see [Model Transparency Card]."
3. User is redirected to the appropriate resource rather than receiving an unreliable generative answer.

Design trade-offs

Interactivity vs. reliability

The more flexible the inspection dialogue, the more risk that the system will produce unreliable answers. Constrain what can be asked to what can be answered reliably.

Depth vs. efficiency

Inspection dialogue is for users who need depth. In high-volume workflows, it may slow down experienced users who don't need it. Make it accessible but not mandatory: a panel that can be opened, not an inline interruption.

Transparency vs. gaming

"What if I change this variable?" is the first step toward gaming the system. In some contexts (loan applications, risk scoring) detailed counterfactual inspection may need to be limited or scoped to authorized users only.

Connections

Relation to other patterns

Explanation Layering

How much detail do I need?

Organize explanatory content into meaningful depth layers: so the right amount of information is available to each user at the right moment, without overwhelming those who need less or frustrating those who need more.

Layering offers depth through structure. Inspection dialogue offers depth through interaction. For most users, layering is sufficient; inspection dialogue is for users who need to go further.

Counterfactual Explanation

What would have changed the outcome?

Show users the minimum or most relevant changes to their situation that would have produced a different outcome: making the decision boundary visible without requiring users to understand the model.

"What if" inspection is functionally a counterfactual query. The patterns share ground and can be combined: counterfactuals are a structured form of inspection dialogue.

Attribution

Why did the system make this decision?

Surface the key factors that influenced an AI decision in a way that is readable, appropriately qualified, and supports the user's ability to evaluate, challenge, or act on the outcome.

Attribution provides the content that inspection dialogue allows users to probe. The two patterns work together: attribution answers the first question, inspection dialogue answers the follow-up.

Human-in-the-Loop

Who can review or override this decision?

Provide authorized users with clear, accessible mechanisms to review, modify, and override AI recommendations, and ensure that all interventions are documented in a way that supports accountability and audit.

For caseworkers, inspection dialogue is part of the override workflow. The investigation a caseworker conducts before an override should be logged as part of the decision record.

Grounding & Hallucination Indicators

Can I trust what the system generated?

Give users visible signals about the factual grounding of AI-generated content: distinguishing between responses that are well-supported by retrievable evidence and those that may contain fabricated, outdated, or unverifiable claims.

In LLM-powered inspection interfaces, grounding indicators are essential. Users need to know when the system's answer to a follow-up question is well-supported vs. potentially fabricated.

Sources

Lim & Dey (2009) — Why-Oriented Explanations in Intelligent Systems

studies how users ask "why" questions of context-aware systems and what kinds of answers satisfy them. Foundational for understanding the natural follow-up question structure that inspection dialogue must support

Miller (2019) — Explanation in Artificial Intelligence: Insights from the Social Sciences

establishes that explanation is inherently interactive and conversational, not a static transfer of information. Directly motivates inspection dialogue as a pattern

Ehsan et al. (2021) — Human-Centered Explainable AI

proposes that explanations must be co-constructed through dialogue between user and system, not delivered as pre-packaged outputs