MiddlebrookData & AI Governance← All insights
Master class · Data & AI governance

Data Governance for AI Reporting

By Barry Middlebrook · Middlebrook Data & AI Governance  ·  Download the PDF →

Building the trusted data foundation that AI-driven reporting depends on — the Why, What, Where, Connect, and How, plus a clear-eyed look at the reporting of the future. This is the full blueprint for making AI reporting accurate, explainable, and compliant.

What's inside

  1. Why: the stakes just changed
  2. What: defining the discipline
  3. Where: the points of control
  4. Connect: how AI talks to your data
  5. How: the practitioner's framework
  6. The future of AI reporting
  7. Glossary — 55 terms

Part 1 — Why: the stakes just changed

For thirty years, a flawed report had a human circuit-breaker. An analyst built it, a manager reviewed it, and a wrong number usually got caught before it reached a decision. Reporting was slow — but it was supervised.

AI removes the circuit-breaker. When AI generates a report — or answers "what was Q3 margin in the Western region?" in plain English — it does so instantly, confidently, and at scale, with no analyst in the loop. That is both a gift and a risk:

AI reporting is only as trustworthy as the data governance beneath it. AI doesn't reduce the need for governance — it makes governance the foundation everything stands on.

Part 2 — What: defining data governance for AI reporting

Data governance, classically, is the system of people, policies, processes, and standards that keep data accurate, consistent, secure, discoverable, and trusted — with clear ownership and accountability. What's new for AI reporting is that you must now govern three surfaces, not one:

  1. The data that feeds the AI — the inputs (sources, pipelines, the warehouse/lakehouse). Classic governance, raised stakes.
  2. The interpretation layer — the meaning the AI uses: metric definitions, business glossary, the semantic/metrics layer, master data. This is where AI most often goes wrong: not bad math, but the wrong definition of "revenue" or "active customer."
  3. The AI's outputs — the generated reports, answers, and narratives, which now need their own quality, provenance, and assurance controls.

Core building blocks (the vocabulary)

TermWhat it meansWhy AI reporting needs it
Data qualityAccuracy, completeness, consistency, timeliness, validity, uniquenessAI can't sanity-check; bad data → bad answer, silently
Data lineageThe traceable path of every number from source to outputMakes AI answers explainable & auditable
Metadata & catalogData about the data; a searchable inventorySo AI (and people) find the right, governed asset
Business glossary / semantic layerOne canonical definition of each metric and entityStops the AI inventing its own "revenue"
Master & reference data (MDM)The single trusted version of core entities (customer, product, account)The nouns the AI reasons over must be unambiguous
Data contractsEnforceable agreements on a dataset's schema, meaning, and SLAsCatch breakage before it reaches the AI
Access governanceWho — and which AI agents — may touch whatLeast privilege; PII protection; AI can't leak what it can't see
Output governanceValidation, provenance, and assurance on AI-generated resultsTrust, compliance, and a human check where it matters

The seven forms of AI reporting

"AI reporting" is not one capability — it shows up across the stack in seven forms, each a different way AI touches a number that reaches a decision-maker or a regulator. Every one depends on the same governed foundation:

You don't govern seven things — you govern one foundation, and all seven get safer at once.

Part 3 — Where: the points of control

Governance for AI reporting must live at every stage of the data's journey. Map the lifecycle, then place a control at each handoff:

The Points of Control — a governed data pipeline A left-to-right data pipeline of eight stages — Source, Ingestion, Storage, Modeling, the Semantic / Metrics Layer (the keystone control), AI Access, Output, and Consumption — each with a governance control beneath it. Data flows left to right and a control sits at every handoff. The semantic layer is the single source of truth that AI should be pointed at instead of raw tables. 01 Source owner & contract 02 Ingestion data contracts 03 Storage quality & validation 04 Modeling lineage & metadata KEYSTONE · SOURCE OF TRUTH 05 Semantic / Metrics Layer canonical metrics & glossary 06 AI Access least-privilege access 07 Output validation & provenance 08 Consumption audit & feedback governance checkpoint at every handoff keystone — point the AI here, never raw tables Data flows left → right — a control sits at every handoff.

Where it breaks without governance: the AI reaches past the semantic layer into raw data, blends two sources that define "customer" differently, uses a stale extract, and produces a confident, un-traceable, wrong number — with no one able to say why. Governance is what closes each of those gaps.

Part 4 — Connect: how AI actually talks to your data

There are a few ways to connect an AI to your data — and the right one, for trustworthy reporting, is exactly where a governance background becomes the value-add. From simplest to most governed:

  1. Text-to-SQL. The AI translates a plain-English question ("Q3 margin by region") into a SQL query, runs it, and narrates the result. The LLM is given the schema, writes SQL, an app executes it, and results go back to the LLM to summarize. Risk: it can write wrong joins, pick the wrong table, or invent a definition. Powerful — but it needs guardrails.
  2. RAG (Retrieval-Augmented Generation). For unstructured data (docs, PDFs, tickets). Content is chunked and stored in a vector database; the AI retrieves the most relevant chunks and answers from them. Great for "what does our policy say," weaker for precise numbers.
  3. Semantic / metrics layer in front (the governed way). Instead of letting the AI touch raw tables, you put a semantic layer between them (dbt Semantic Layer, Cube, LookML, or a metrics API). The AI queries defined metrics ("revenue," "active customer") — not raw SQL — so it physically cannot invent a definition. This is the architecture you want for trustworthy AI reporting.
  4. Tool calling / MCP (the modern standard). The AI is given tools (functions) it can call — e.g., a run_query tool, or increasingly an MCP server (Model Context Protocol — the open standard for connecting AI to data sources). The tool executes the query with all your guardrails baked in. This is how production AI-to-data connections are built in 2026.
  5. AI agents. Orchestrate multiple tool calls plus reasoning for multi-step questions ("compare this quarter to last, and explain the drivers").

The reference architecture

How a governed AI-reporting request flows at runtime. Data flows down, results return up, and the highlighted layers are the governance controls you own — the difference between a demo and a system you can put in front of an auditor.

User questionasked in plain English
OrchestratorLLM + agent · MCP client
▼ enforces
Policy & guardrail layerleast privilege · PII masking · query allow-list · row / column security
ConnectorMCP server / guarded text-to-SQL / semantic-layer API
Semantic / metrics layercanonical metric definitions — the single source of truth
Read replica / governed viewsread-only · least privilege · no access to raw production
▲ results return
Output governancevalidation · provenance · confidence signal · human-in-the-loop
Governed answer → user+ audit log: every question, query & answer recorded

Where you come in — the governance guardrails. Anyone can wire an LLM to a database in an afternoon. The reason most organizations can't staff this is that almost nobody knows how to do it safely and correctly. That is the entire value-add: least privilege & read-only (the AI connects through a service account that can only SELECT from governed views — never write, never see raw everything, pointed at a read replica, never prod); the semantic layer, not raw tables, so definitions are canonical; row/column security + PII masking enforced at the connection; query guardrails (allow-lists, validation, row/cost limits, timeouts); provenance + lineage returned with every answer; and a human-in-the-loop for high-stakes outputs. The connection is easy; the governed connection is the skill.

Get the PDF + occasional governance notes

The printable version of this master class, plus the odd practical note on governing AI reporting. No spam; unsubscribe anytime.

Part 5 — How: the practitioner's framework

This is the work. Ten disciplines, in build order.

  1. Ownership & stewardship. Name a data owner for every critical domain and data stewards who maintain it day to day. Stand up a lightweight governance council to set policy and adjudicate definitions. Governance fails as a technology project and succeeds as an accountability structure.
  2. The canonical metrics / semantic layer. Define each business metric and entity once, in writing, with the formula and the owner. Make that the only thing the AI is allowed to query for reporting. This single move eliminates the most common class of AI-reporting error.
  3. Data quality controls. Profile, measure, and monitor against the six dimensions — accuracy, completeness, consistency, timeliness, validity, uniqueness. Publish data-quality scorecards. Where practical, enforce data contracts so upstream breakage is caught automatically, not discovered in a board deck.
  4. Lineage & metadata. Capture end-to-end lineage and maintain a data catalog + business glossary. This is what lets an AI answer cite its sources — and what lets an auditor trace a number from the report back to the system of record. No lineage, no trustworthy AI reporting.
  5. Master & reference data (MDM). Establish the golden record for core entities. The AI must reason over one customer, one product hierarchy, one chart of accounts — not three conflicting versions.
  6. Access & security governance. Apply least privilege to people and AI agents. Classify and protect sensitive/PII data; the safest way to keep AI from leaking data is to ensure it can't access what it shouldn't see in the first place.
  7. Governing the AI layer. Ground the AI in governed data — retrieval over certified sources and the semantic layer (the disciplined version of "RAG"). Add guardrails: the model may only use approved datasets/definitions, must refuse when data is missing rather than guess, and cannot fabricate metrics.
  8. Output governance & assurance. Treat AI outputs as governed artifacts: automated validation against business rules, provenance on every answer (sources + definitions + as-of date), confidence/uncertainty signaling, human-in-the-loop for high-stakes (financial, regulatory, safety) outputs, and an audit trail of every question and answer.
  9. Monitoring & observability. Watch for data drift (inputs changing) and model drift (behavior changing). Set quality SLAs, alert on breaches, and run an incident-response process for bad answers — the same rigor you'd apply to a production outage.
  10. Compliance & auditability. Map your controls to the frameworks: NIST AI RMF (Govern/Map/Measure/Manage), ISO/IEC 42001 (AI management system), the EU AI Act (risk tiers, transparency, human oversight), and DORA (AI as ICT / operational-resilience risk) — plus existing mandates: SOX, sector rules, and your model-risk program (SR 11-7), which must now be extended to cover the generative and agentic AI it originally excluded. Build the evidence trail as you operate, not in a fire drill before an audit.

A maturity model

LevelStateAI reporting you can trust?
1 — Ad hocNo ownership; definitions vary by spreadsheetNo — AI amplifies the chaos
2 — ReactiveGovernance only after something breaksRisky — you find errors after the decision
3 — DefinedOwners, glossary, quality rules documentedPartially — for governed domains
4 — ManagedQuality measured, lineage captured, semantic layer liveYes — for certified data
5 — AI-trustedOutput governance, monitoring, audit-ready, self-serviceYes — trusted, explainable, at scale

A 90-day starter playbook

Part 6 — The future: AI reporting, done right

Here is the end state this all builds toward. Reporting becomes conversational and self-serve. A leader asks, in plain English, "How did margin trend by region last quarter, and what drove the change?" — and gets an immediate, correct, narrated answer. No ticket to the BI team. No week of waiting. And — because it's governed — that answer is trustworthy:

The human role shifts — and rises. People stop hand-building reports. Their job becomes governing the system that generates them: owning the definitions, curating the trusted data, setting the guardrails, monitoring quality, and standing behind the numbers. The data governance professional moves from back-office to mission-critical — because in a world where AI generates the insights, the scarcest, most valuable thing is the person who can guarantee the insights are right.

Trust becomes the product. Two companies will have the same AI models. The one whose AI reporting is governed — accurate, explainable, compliant — will out-decide the one whose AI confidently makes things up. Governance is the moat.

AI did not make data governance obsolete. It made it the foundation everything else stands on. The organizations — and the professionals — who master Data Governance for AI Reporting won't just survive the shift to AI. They'll be the ones everyone else has to trust.

Want this assessed for your institution?

Take the free 4-minute readiness assessment to see where your AI-reporting gaps are — or book a call to scope a full, expert-led review.

Take the free assessment See pricing & engagement →

Glossary — tech & business terms

Plain-language definitions of every technical and business term used in this master class.

Access governance
Rules controlling who — and which systems or AI agents — may read or use which data; enforced through least privilege.
Agentic AI
AI that plans and takes multi-step actions autonomously — querying, computing, assembling, sometimes acting — rather than answering once. The most powerful and hardest-to-govern form of AI reporting.
AI agent
An AI system that plans and takes multi-step actions by calling tools (e.g., querying data) to reach a goal, rather than answering once.
Audit trail
A time-stamped, logged record of who asked what, which data was used, and what was produced — required for compliance and investigations.
Business glossary
The agreed, plain-language definitions of business terms and metrics (e.g., what "active customer" means), owned by the business.
CFPB
U.S. Consumer Financial Protection Bureau; its consumer-finance reporting and data rules govern mortgage and lending data.
Confidence signal
An indication of how certain an AI is in an answer (and when it lacks governed data), so users know when to trust or verify it.
Connector
The component linking the AI to the data — an MCP server, a guarded text-to-SQL engine, or a semantic-layer API.
Data catalog
A searchable inventory of an organization's data assets and their metadata, so people and AI can find the right, governed data.
Data contract
An enforceable agreement defining a dataset's schema, meaning, quality, and delivery — so upstream changes don't silently break downstream use.
Data drift
A change over time in input-data patterns or distribution that can quietly degrade the accuracy of reports and AI outputs.
Data governance
The people, policies, processes, and standards that keep data accurate, consistent, secure, discoverable, and trusted, with clear ownership.
Data lineage
The traceable path of data from source through every transformation to final use; how any number can be traced back to its origin.
Data owner
The person accountable for a data domain — sets policy and standards and answers for its quality and proper use.
Data quality
The fitness of data for use, measured across six dimensions: accuracy, completeness, consistency, timeliness, validity, and uniqueness.
Data steward
The person who maintains a data domain day to day — applying standards, resolving issues, and curating definitions.
DAMA-DMBOK
The Data Management Body of Knowledge from DAMA International; the industry-standard framework of data-management disciplines.
DORA
The EU Digital Operational Resilience Act; brings ICT — and now AI — risk, third-party dependencies, incident reporting, and resilience testing under one operational-resilience regime for financial entities.
ETL / ELT
Extract-Transform-Load (or Extract-Load-Transform); the pipelines that move and reshape data from sources into a warehouse or lakehouse.
EU AI Act
European Union regulation that classifies AI systems by risk and imposes transparency, human-oversight, and accountability obligations.
Golden record
The single, authoritative, de-duplicated version of a core entity (e.g., one true customer record) produced by master data management.
Governance council
A cross-functional group that sets data and AI governance policy, approves definitions, and resolves disputes.
Governed views
Curated, permission-controlled database views the AI is allowed to read — instead of raw tables.
Hallucination
When an AI produces confident but false or fabricated information — e.g., inventing a metric or a number unsupported by the data.
Human-in-the-loop
A control requiring a person to review or approve an AI output before it is acted on; used for high-stakes decisions.
ISO/IEC 42001
The international management-system standard for governing artificial intelligence responsibly across its lifecycle.
KPI
Key Performance Indicator — a defined metric used to measure progress against a business objective.
Lakehouse / warehouse
Central, governed stores for analytical data; a warehouse is structured, a lakehouse blends that structure with data-lake flexibility.
Least privilege
The security principle of granting the minimum access needed — applied to people and to AI / service accounts alike.
LLM
Large Language Model — the AI (e.g., Claude, GPT) that understands and generates language and powers conversational reporting.
Master data
The core, shared business entities — customers, products, accounts — that must be consistent across every system.
MDM
Master Data Management — the discipline and tooling that create and maintain one trusted golden record per master-data entity.
MCP
Model Context Protocol — an open standard for securely connecting AI models to data sources and tools through governed "servers."
Metadata
Data about data — names, definitions, formats, owners, lineage — that makes data findable and understandable.
Model drift
A decline in an AI model's performance over time as the world or the data changes from what it was built on.
Model risk / SR 11-7
The discipline — and U.S. supervisory guidance — for validating, monitoring, and documenting models. Its revised guidance excludes generative and agentic AI, leaving a gap firms must close by extending model risk to cover them.
NIST AI RMF
The U.S. NIST AI Risk Management Framework; a voluntary structure (Govern, Map, Measure, Manage) for managing AI risk.
Observability
Continuous monitoring of data and AI systems — quality, freshness, drift, errors — so problems are caught early.
Orchestration
The layer that coordinates the AI's steps and tool calls (the application, agent, or MCP client) between the user and the data.
Output governance
Controls applied to AI-generated results: validation, provenance, confidence signaling, audit logging, and human review.
PII
Personally Identifiable Information — data that can identify an individual; subject to privacy law and strict access controls.
Provenance
The record of where a result came from — which sources, definitions, and as-of date — attached to an output so it can be verified.
RAG
Retrieval-Augmented Generation — a pattern where the AI retrieves relevant data or documents and uses them as grounding context.
Read replica
A read-only copy of a production database used for queries and reporting, so the live system isn't risked or slowed.
Reference data
Standardized lookup values (country codes, status codes, categories) used consistently across systems.
Row/column-level security
Access rules restricting which rows or columns a user or AI can see within a dataset (e.g., only their region; mask SSNs).
Schema
The structure of a database: its tables, columns, data types, and relationships.
Semantic / metrics layer
A governed layer that defines business metrics and entities once, so every query — human or AI — uses the same canonical definitions.
Single source of truth
One authoritative, agreed place for a given piece of data or metric, so everyone (and every AI) gets the same answer.
SLA
Service-Level Agreement — a committed standard (e.g., data freshness or quality threshold) a data product must meet, with alerts on breach.
SOX
Sarbanes-Oxley — U.S. law requiring accurate, controlled, auditable financial reporting; a major driver of strong data controls in finance.
SQL
Structured Query Language — the standard language for querying and manipulating data in relational databases.
Text-to-SQL
A pattern where the AI converts a plain-English question into a SQL query, runs it, and returns or narrates the result.
Tool calling
The mechanism by which an AI invokes a defined function (e.g., "run this query") to act, rather than only generating text.
Vector database
A database that stores data as numeric embeddings for similarity search; the retrieval engine behind RAG.