See also (wiki): wiki/model-risk-management.md, wiki/ai-vendor-contracts.md, wiki/vendor-security-questionnaires.md
Executive Summary
- Banks deploying AI must satisfy SR 11-7 model risk management requirements. The framework contains no standard questionnaire — each institution builds its own, typically 50–150 questions for a material AI model, drawing from the OCC Comptroller’s Handbook and now the U.S. Treasury’s new Financial Services AI Risk Management Framework (230 control objectives across seven categories, published February 19, 2026).
- Insurers face a parallel regime. The NAIC launched a 12-state AI Systems Evaluation Tool pilot in March 2026 (running through September 2026) with four structured exhibits covering AI usage inventory, governance, high-risk system detail, and data lineage. Adoption is expected at the NAIC fall meeting in November 2026.
- The common vendor failure mode is identical in both regimes: training data lineage. When an examiner or regulator asks “where did the data come from that trained this model, and do you have legal rights to use it?” — most AI vendors cannot produce a complete answer. Explainability documentation and bias-testing evidence are close seconds.
- The FS-ISAC (Financial Services Information Sharing and Analysis Center) published a two-tier generative AI vendor evaluation template: Level 1 (basic, ~15 questions for R&D/low-risk use) and Level 2 (comprehensive, covering legal, regulatory, model validation, and vendor moderation for production deployments).
- Time-to-complete for a full SR 11-7 model validation cycle on a material AI system runs 3–6 months at large banks and 6–12 months at mid-market institutions with smaller model risk management teams. The NAIC pilot is asking insurers to respond within the March–September 2026 window, effectively a six-month timeline for first-time AI governance documentation.
The SR 11-7 Questionnaire Problem
SR 11-7 does not prescribe a standard questionnaire. It prescribes three pillars — model development documentation, independent validation, and governance — and leaves implementation to each institution. The result: every bank builds its own model risk questionnaire, and every AI vendor answers a different version at every customer.
For traditional models (credit scorecards, interest rate models), this worked. The models were stable, the math was documented, and a competent model risk management (MRM) team could validate in weeks.
AI breaks this in five ways that the existing SR 11-7 corpus file (Pass 93) documents in detail. What matters for procurement is the questionnaire consequence: each of those five failure modes generates a new category of questions that vendors must answer, and most cannot.
What Examiners Actually Ask About AI
Based on the OCC Comptroller’s Handbook (August 2021) and the February 2026 Treasury FS AI Risk Management Framework, the questions cluster into seven categories:
| Category | Typical Questions | Common Vendor Failure |
|---|---|---|
| Training data lineage | Source, legal rights, consent, PII handling, refresh cadence | Cannot produce end-to-end lineage for foundation models |
| Model explainability | Decision path documentation, feature importance, counterfactual analysis | “It’s a transformer” is not an answer; examiners want decision-path documentation for individual outputs |
| Bias and fairness testing | Protected-class impact analysis, disparate impact metrics, fair lending compliance | Testing against internal benchmarks only — no third-party audit |
| Drift monitoring | Performance degradation thresholds, retraining triggers, concept drift detection | No continuous monitoring; quarterly spot checks at best |
| Third-party concentration | Foundation model provider dependencies, single-vendor exposure, API availability SLAs | 80%+ of enterprise AI deployments depend on 3–4 foundation model providers |
| Incident response | AI-specific incident definition, escalation paths, model rollback procedures | AI incidents lumped into general IT incident management |
| Governance and oversight | Board reporting, MRM committee sign-off, model inventory inclusion | AI tools deployed outside model inventory (“shadow models”) |
The 230-Control-Objective Framework
The U.S. Treasury’s Financial Services AI Risk Management Framework, published February 19, 2026, is the most comprehensive AI questionnaire framework in U.S. financial regulation. Developed with 100+ financial institutions, the Financial Services Sector Coordinating Council (FSSCC), and the Cyber Risk Institute (CRI), it includes:
- AI Adoption Stage Questionnaire — a maturity self-assessment that determines which controls apply
- Risk and Control Matrix — 230 mapped control objectives across governance, data, model development, validation, monitoring, third-party risk, and consumer protection
- Guidebook — implementation guidance
- Control Objective Reference Guide — a 400+ page document with evidence examples for each control
The framework integrates with NIST Cybersecurity Framework, enterprise risk management, and SOC 2. For a mid-market bank ($1B–$50B assets) deploying its first production AI system, the 230 controls are the new floor — and most institutions have mapped fewer than 50 of them today.
Who Signs Off
At a bank, the sign-off chain for a material AI model typically runs: model developer → independent validation team (or third-party validator) → MRM committee → Chief Risk Officer → Board risk committee (for material models). The MRM committee meets monthly or quarterly. A single AI model that misses a committee cycle adds 30–90 days to the deployment timeline.
The NAIC Insurance Regime
The NAIC Model Bulletin on Use of AI by Insurers (adopted December 2023) established principles-based expectations for AI governance. In 2025–2026, the NAIC’s Big Data and Artificial Intelligence Working Group operationalized those principles through the AI Systems Evaluation Tool — a structured questionnaire regulators use during market conduct and financial examinations.
The Evaluation Tool Structure
| Exhibit | Focus | What Regulators Want |
|---|---|---|
| A | AI usage inventory | Quantify every AI system in use, including material financial impact and solvency implications |
| B | Governance framework | Risk assessment structure, committee oversight, policy documentation |
| C | High-risk AI systems | Detailed documentation for systems that could cause serious consumer or financial harm |
| D | Data details | Data lineage, quality checks, source documentation, reasonable accommodations |
The 12-State Pilot
The pilot launched March 2, 2026, across California, Colorado, Connecticut, Florida, Iowa, Louisiana, Maryland, Pennsylvania, Rhode Island, Vermont, Virginia, and Wisconsin. Monthly coordination calls among states will inform tool updates in September–October 2026, with formal adoption expected at the November 2026 NAIC fall meeting.
The NAIC is also developing a Third-Party Data and Models Working Group with a broad definition of “third party” encompassing any nongovernmental entity providing data, models, or outputs for insurance activities. A model law on third-party oversight is anticipated in 2026, potentially including vendor licensing requirements.
What Trips Up Insurers
The Monitaur analysis of the pilot identifies four friction points:
- Data hygiene documentation complexity — Exhibit D requires detail on lineage and quality checks that most insurers have never assembled for their AI vendors
- Vendor evaluation authority — Ambiguity about how far an insurer can audit its AI vendor’s underlying model
- Jurisdictional inconsistency — States can modify questions, creating a patchwork compliance burden
- AI vs. general data governance — Distinguishing AI-specific evaluation from existing data governance is harder than it sounds
The FS-ISAC Vendor Evaluation Template
The Financial Services Information Sharing and Analysis Center (FS-ISAC) published a Generative AI Vendor Evaluation and Qualitative Risk Assessment tool — the closest thing to a standardized AI vendor questionnaire in financial services. It uses a two-tier model:
Level 1 (lower-risk engagements — R&D, educational, sandboxed): Basic questions covering GenAI use scope, data privacy notices, foundation model identification, and information security. Approximately 15 core questions.
Level 2 (production deployments with business-process integration or confidential data): Comprehensive assessment adding legal and regulatory compliance, vulnerability management, model validation procedures, and vendor content moderation policies.
The Atlas Systems framework (2026) proposes a similar 15-question core across seven categories: AI usage and scope, data handling, model governance and explainability, security and access controls, compliance, operational resilience, and third-party dependencies.
Key Data Points
| Metric | Value | Source | Date |
|---|---|---|---|
| Control objectives in Treasury FS AI RMF | 230 across 7 categories | U.S. Treasury / FSSCC / CRI | Feb 2026 |
| Reference guide length | 400+ pages | U.S. Treasury FS AI RMF | Feb 2026 |
| NAIC pilot states | 12 | NAIC / Fenwick | Mar 2026 |
| NAIC pilot duration | Mar–Sep 2026 | NAIC | Mar 2026 |
| NAIC evaluation tool exhibits | 4 (A–D) | NAIC | 2026 |
| FS-ISAC Level 1 core questions | ~15 | FS-ISAC | 2024 |
| Organizations reporting AI-related breaches | 13% | IBM | 2025 |
| Breached orgs lacking AI governance policies | 63% | IBM | 2025 |
| AI breaches where orgs lacked AI access controls | 97% | IBM | 2025 |
| Typical SR 11-7 AI validation cycle (large bank) | 3–6 months | Industry practice | 2025–2026 |
| Typical SR 11-7 AI validation cycle (mid-market) | 6–12 months | Industry practice | 2025–2026 |
What This Means for Your Organization
The questionnaire burden is now the binding constraint on AI vendor deployment timelines in regulated industries — not the technology evaluation, not the budget approval, not even the security review. A bank or insurer that starts an AI vendor engagement without understanding the model risk questionnaire requirements will discover them 3–6 months into the process, after the business case has been approved and the vendor has been selected.
Three things to do before the next AI vendor conversation:
-
Map your questionnaire exposure. If you are a bank or bank service provider, the Treasury FS AI RMF’s 230 control objectives are your preparation checklist. If you are an insurer in one of the 12 pilot states, request a copy of the NAIC AI Systems Evaluation Tool exhibits and begin populating them now — before an examiner asks.
-
Pre-build your vendor’s answers. The most common failure mode is sending the questionnaire to the vendor and waiting. Most AI vendors have never completed a model risk questionnaire. Draft the answers yourself based on the vendor’s published documentation, then ask the vendor to confirm or correct. This cuts months from the cycle.
-
Budget for independent validation. SR 11-7 requires independent validation of any model used for material decisions. For AI, “independent” increasingly means a third-party firm with AI-specific validation expertise — not an internal team that validated credit scorecards last year. Budget $50K–$200K per material model depending on complexity.
If the questionnaire layer is where your next AI deployment is stuck — or where you suspect it will get stuck — that is a conversation worth having. Reach out at brandon@brandonsneider.com.
Sources
-
U.S. Federal Reserve, “Supervisory Letter SR 11-7: Guidance on Model Risk Management,” April 4, 2011. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm. Credibility: HIGH — primary regulatory source.
-
U.S. Department of Treasury / FSSCC / Cyber Risk Institute, “Financial Services AI Risk Management Framework,” February 19, 2026. Via Lowenstein Sandler analysis: https://www.lowenstein.com/news-insights/publications/client-alerts/financial-services-ai-risk-management-framework-operationalizing-the-230-control-objectives-before-the-market-wakes-up-data-privacy. Credibility: HIGH — federal framework developed with 100+ institutions.
-
NAIC, “Model Bulletin on the Use of Artificial Intelligence Systems by Insurers,” December 2023. https://content.naic.org/sites/default/files/cmte-h-big-data-artificial-intelligence-wg-ai-model-bulletin.pdf.pdf. Credibility: HIGH — primary regulatory source.
-
Fenwick & West LLP, “NAIC Expands AI Systems Evaluation Tool Pilot Program to 12 States,” 2026. https://www.fenwick.com/insights/publications/naic-expands-ai-systems-evaluation-tool-pilot-program-to-12-states-key-updates-for-insurers-and-ai-vendors-supporting-insurers. Credibility: HIGH — Am Law 100 firm analysis of primary source.
-
GARP, “SR 11-7 in the Age of Agentic AI: Where the Framework Holds – and Where It Strains,” February 2026. https://www.garp.org/risk-intelligence/operational/sr-11-7-age-agentic-ai-260227. Credibility: HIGH — independent risk management professional association.
-
FS-ISAC, “Generative AI Vendor Evaluation & Qualitative Risk Assessment,” 2024. https://www.fsisac.com/hubfs/Knowledge/AI/FSISAC_GenerativeAI-VendorEvaluation&QualitativeRiskAssessment.pdf. Credibility: HIGH — financial services industry consortium.
-
Monitaur, “NAIC AI Systems Evaluation Tool Pilot: A Guide for Insurers,” 2026. https://www.monitaur.ai/blog-posts/naic-ai-systems-evaluation-tool-pilot-a-guide-for-insurers. Credibility: MEDIUM — AI governance vendor, but substantive analysis.
-
Plante Moran, “How the NAIC AI Model Bulletin Is Evolving,” March 2026. https://www.plantemoran.com/explore-our-thinking/insight/2026/03/how-the-naic-ai-model-bulletin-is-evolving. Credibility: MEDIUM — accounting/advisory firm analysis.
-
Atlas Systems, “AI Vendor Risk Assessment Questionnaire for Compliance (2026),” 2026. https://www.atlassystems.com/blog/ai-vendor-risk-questionnaire. Credibility: MEDIUM — vendor-published template, but representative of industry practice.
-
IBM, “Cost of a Data Breach Report,” 2025. Via Atlas Systems. Credibility: HIGH — annual benchmark study (vendor-published but widely cited as independent).
Brandon Sneider | brandon@brandonsneider.com April 2026