EAAPLEnterprise AI Architecture Pattern Library
EAAPLLibraryHuman-in-the-LoopEAAPL-HIL001
EAAPL-HIL001Proven
⇄ Compare

Human Approval Gateway

👁️ Human-in-the-LoopEU AI ActISO/IEC 42001🏭 Field-tested in AU

[EAAPL-HIL001] Human Approval Gateway

Category: Human-in-the-Loop
Sub-category: Oversight / Approval Control
Version: 1.3
Maturity: Mature
Tags: human-oversight, approval-workflow, high-stakes-AI, risk-scoring, audit-trail, EU-AI-Act-Art14
Regulatory Relevance: EU AI Act Articles 14 & 29, APRA CPS 230, ISO 42001 §8.5, NIST AI RMF MANAGE-1.3, GDPR Article 22


1. Executive Summary

High-stakes AI systems — those authorising transactions, making clinical recommendations, or influencing consequential administrative decisions — must not operate with full autonomy. The Human Approval Gateway pattern establishes a mandatory checkpoint between AI recommendation and consequential action, ensuring a qualified human reviews, approves, or rejects AI-proposed actions before they take effect.

This pattern is the foundational implementation of EU AI Act Article 14's human oversight requirement for high-risk AI systems. It defines the approval trigger logic, approval queue design, timeout handling (with configurable responses: escalate, cancel, or time-limited auto-approve with full audit), approval authority levels, and the immutable audit trail required by regulators.

Organisations that implement this pattern have passed EU AI Act conformity assessments, satisfied APRA supervisory expectations for AI in credit decisioning, and reduced AI-driven operational incidents by 40–70% by catching model errors before they produce irreversible consequences.

Target audience: Chief Risk Officers, Chief Compliance Officers, AI Governance leads, Enterprise Architects.


2. Problem Statement

Business Problem

AI systems in consequential domains can cause significant harm when they err without human review. A loan declined by a miscalibrated model, a clinical alert suppressed by a misconfigured AI, or a fraud block applied to a legitimate high-value customer — these outcomes damage customer relationships, create legal liability, and attract regulatory action.

Technical Problem

  • Many AI systems are designed as straight-through processors: AI decision → action, with no human in the loop.
  • Where manual review exists, it is often implemented as an afterthought — a separate queue tool with no integration to the AI decision context, poor UX, and no audit trail.
  • Timeout handling is undefined: if no human responds within SLA, the system either blocks indefinitely or silently auto-approves.
  • Approval authority levels are not enforced: any available agent may approve high-risk decisions regardless of qualifications.
  • The audit trail captures only the final approval but not the AI recommendation, confidence score, or supporting rationale reviewed by the approver.

Symptoms

  • AI errors causing irreversible customer harm that would have been caught by human review.
  • Regulatory audit finds no evidence of human oversight for automated consequential decisions.
  • Approval queue backlogs causing SLA breaches and frustrated customers.
  • Post-incident investigation cannot reconstruct what the human reviewer saw at the time of approval.
  • Different approvers applying inconsistent standards because approval guidelines are not embedded in the review interface.

Cost of Inaction

Dimension Impact
Regulatory EU AI Act Art. 14 violation; APRA enforcement; GDPR Art. 22 breach
Legal Class action from wrongful AI decisions without human oversight
Customer Irreversible harm from AI errors; trust destruction
Operational Uncontrolled AI driving consequential actions without audit trail

3. Context

When to Apply

  • AI systems classified as high-risk under EU AI Act Annex III (credit, employment, law enforcement, clinical, essential services).
  • AI actions that are consequential, difficult to reverse, or have significant financial/health/safety impact.
  • AI systems where model confidence is insufficient for full automation (confidence below 90th percentile threshold).
  • Regulatory or policy mandate for human-in-the-loop (APRA prudential requirement, clinical governance).
  • AI systems operating in domains with active judicial review (immigration, benefits, credit).

When NOT to Apply

  • Trivial, easily reversible AI actions (content formatting, auto-tagging, draft generation).
  • AI systems with demonstrated near-perfect accuracy over a large sample (≥99.5% accuracy) for specific low-risk subtask categories, with formal risk-based exemption documented.
  • Situations where human review latency would cause harm greater than the AI error rate (emergency medical AI where delay kills — use audit-based oversight instead).

Prerequisites

Prerequisite Minimum Viable Preferred
AI confidence scoring Binary pass/fail Calibrated probability score
Approval workflow tooling Email + manual queue Purpose-built approval UI + queue management
Authority level definition Informal Role-based authority matrix
Audit log Basic event log Immutable, structured audit log
Escalation path Informal supervisor escalation Automated escalation with SLA tracking

Industry Applicability

Industry Applicability Driver
Financial Services (credit) Mandatory EU AI Act Annex III; APRA; GDPR Art. 22
Healthcare (clinical AI) Mandatory EU AI Act Annex III; clinical governance; liability
Government (benefits, immigration) Mandatory EU AI Act Annex III; public law; judicial review
Insurance (claims) High Consumer protection; claims dispute risk
Employment / HR High EU AI Act Annex III; employment law
Law Enforcement Mandatory (with restrictions) EU AI Act Annex III; civil liberties

4. Architecture Overview

Design Philosophy

The Human Approval Gateway is designed on four principles: no autonomous action without approved authority, full context for the approver, defined timeout handling, and immutable comprehensive audit.

Approval Trigger Logic. Not every AI output requires human approval — that would be unworkable and provide no benefit for low-risk routine outputs. The gateway applies trigger logic based on three dimensions: (1) Risk score: AI assigns a risk score to each proposed action; actions above the configured risk threshold are routed to the approval queue. The threshold is calibrated to balance oversight burden (approver workload) against risk coverage. (2) Action type: Certain action types always require approval regardless of confidence (e.g., any adverse decision under GDPR Art. 22; any action with regulatory notification obligation). (3) Confidence threshold: If the AI model's calibrated confidence score falls below a configured threshold (e.g., <85% confidence), the action is routed for human review even if the risk score is moderate.

Context Package Design. The approver sees a structured Context Package — not a raw model output. The Context Package contains: the AI recommendation and confidence score; the key factors driving the recommendation (model explanation, e.g., SHAP values in plain language); the entity profile (customer, patient, applicant); relevant historical context; applicable policy and guidelines; the action to be taken if approved; and the consequences of approval vs. rejection. This design prevents approver over-reliance on the AI recommendation by ensuring the approver has sufficient context to form an independent judgment.

Approval Queue Management. The queue implements priority-based routing: high-risk and time-sensitive cases are prioritised; within priority bands, FIFO ordering ensures fairness. Approvers are matched to cases by authority level (junior agents handle standard cases; senior agents handle high-value or complex cases). The queue tracks SLA countdown per item; approaching SLA triggers escalation.

Timeout Handling. Three configurable timeout responses, selected by action type and risk level: (a) Escalate: When the primary approver SLA expires, the case escalates to a senior approver. This is the preferred response for high-risk cases. (b) Cancel: The proposed action is cancelled; the entity is notified that the request is under review. Used when the action is not time-critical and the consequences of auto-approval are severe. (c) Time-limited auto-approve with audit: For time-critical, lower-risk actions, the system auto-approves after SLA expiry with an explicit audit log entry noting the timeout auto-approval. This must be explicitly enabled; it is not the default.

Audit Trail. Every approval event generates an immutable audit record containing: action ID, AI recommendation, confidence score, context package snapshot (what the approver saw), approver identity and authority level, decision (approve/reject/modify), decision rationale (free text + structured reason code), decision timestamp, elapsed review time, and whether the decision aligned with or overrode the AI recommendation. Override decisions are flagged for pattern analysis (systematic overrides → model retraining signal).


5. Architecture Diagram

ARCHITECTURE DIAGRAM
flowchart TD subgraph AISystem["AI System"] AI1[AI Model + Risk Scorer] AI2[Context Packager] end subgraph Queue["Approval Queue"] AQ1{Trigger Check} AQ2[Priority + Authority Routing] AQ3[SLA Monitor] end subgraph Human["Human Review"] UI1[Approver Interface] TH1[Timeout Handler] end subgraph Outcome["Outcome and Audit"] AE1[Action Executor] AL1[(Immutable Audit Log)] end AI1 --> AI2 AI2 --> AQ1 AQ1 -->|low risk auto| AE1 AQ1 -->|requires approval| AQ2 AQ2 --> UI1 AQ3 -->|SLA breach| TH1 UI1 -->|decision| AE1 TH1 -->|escalate or auto| AE1 AE1 --> AL1 style AI1 fill:#dbeafe,stroke:#3b82f6 style AI2 fill:#dbeafe,stroke:#3b82f6 style AQ1 fill:#f3e8ff,stroke:#a855f7 style AQ2 fill:#f0fdf4,stroke:#22c55e style AQ3 fill:#fee2e2,stroke:#ef4444 style UI1 fill:#f0fdf4,stroke:#22c55e style TH1 fill:#fee2e2,stroke:#ef4444 style AE1 fill:#d1fae5,stroke:#10b981 style AL1 fill:#fef9c3,stroke:#eab308

6. Components

Component Type Responsibility Technology Options Criticality
Risk Scorer Processing Assigns risk score to each AI action based on action type, entity risk profile, model confidence Custom rule engine, ML risk scorer, FICO-style rules Critical
Context Packager Processing Assembles context package: SHAP explanation, entity profile, guidelines, action summary Custom Python; SHAP/LIME library; templated rendering High
Approval Queue Messaging + Storage Holds pending approvals; priority ordering; SLA tracking; authority routing AWS SQS, Azure Service Bus, custom PostgreSQL queue Critical
SLA Monitor Processing Tracks approval SLA per queue item; fires escalation or timeout actions Custom scheduler, AWS EventBridge rules, Celery beat High
Approver UI Application Displays context package; captures decision and rationale; enforces authority levels Custom React/Angular app; ServiceNow; Salesforce Flow Critical
Authority Enforcer Middleware Validates that the approving user has required authority level for the action RBAC integration; IAM policy evaluation Critical
Action Executor Service Executes the approved (or timeout-auto-approved) action in downstream systems Custom integration layer; webhook; API call Critical
Immutable Audit Log Storage Stores complete approval event records; tamper-evident AWS S3 Object Lock, Azure Immutable Blob, custom append-only DB Critical
Override Analytics Processing Detects systematic override patterns; generates model retraining signals Custom Python analytics; MLflow experiment tracking High
Escalation Manager Processing Routes escalated cases to appropriate senior approver; tracks escalation chain Workflow engine (Camunda, ServiceNow), custom High

7. Data Flow

Primary Flow

Step Actor Action Output
1 AI System Generates recommendation with confidence score and risk score AI output with metadata
2 Trigger Logic Evaluates risk score, action type, confidence against configured thresholds Route to queue or auto-execute
3 Context Packager Assembles approver context package: explanation, entity profile, policy Structured context package
4 Approval Queue Enqueues case with priority and SLA; routes to appropriate approver by authority level Queued approval item with SLA countdown
5 Approver Reviews context package; forms independent judgment Mental approval decision
6 Approver UI Approver selects decision; enters rationale; submits Decision record
7 Authority Enforcer Validates approver authority level for this action type Authorised decision
8 Action Executor Executes approved action in downstream system Outcome (transaction, decision, notification)
9 Audit Log Records complete approval event: AI rec + context + decision + rationale + timing Immutable audit record
10 Override Analytics Analyses decisions; flags systematic overrides as model retraining signals Retraining signal or operational insight

Error Flow

Error Condition Trigger Response Recovery
Approver queue unavailable Service failure AI actions blocked until queue restored; alert fired Queue HA deployment; failover queue; no auto-execution during outage
SLA breach before timeout handler fires Monitor missed Audit trail records SLA breach; manual investigation Monitor SLA with redundant alert mechanisms
Approver attempts action outside authority level Authority enforcer rejects Action blocked; approver notified; escalation offered Review authority matrix; ensure appropriate approvers available
Action Executor fails after approval Downstream system failure Approval record exists; action not executed; alert fired Retry with idempotency key; reconciliation process

8. Security Considerations

Authentication & Authorisation

  • Approvers authenticate via SSO/MFA; session tokens expire every 4 hours.
  • Authority level enforcement: RBAC roles mapped to action types; enforced by Authority Enforcer component.
  • Separation of duties: the AI system cannot approve its own recommendations; approvers cannot modify the AI context package.

Secrets Management

  • Downstream system integration credentials managed in secrets manager; not exposed to approver UI.

Data Classification

  • Context packages contain entity personal and financial data; classified as Confidential.
  • Audit log contains sensitive decision data; classified as Confidential, retained per regulatory schedule.

Encryption

  • Approval queue messages encrypted in transit (TLS 1.3) and at rest.
  • Audit log encrypted at rest (AES-256); immutable storage (Object Lock).

Auditability

  • Every action from submission to approval to execution is logged with actor, timestamp, and decision.
  • Audit log tamper-evident (hash chaining or Object Lock).
  • All timeout auto-approvals explicitly logged with reason code "TIMEOUT_AUTO_APPROVE".

OWASP LLM Top 10 Mapping

OWASP LLM Risk Relevance Mitigation
LLM09: Overreliance Approvers rubber-stamp AI recommendations without independent review Context package design forces approvers to see entity data directly; approval rationale mandatory
LLM01: Prompt Injection Malicious entity data in context package could manipulate approver presentation Context package renders structured data, not free text; HTML/prompt injection sanitised
LLM02: Insecure Output Handling AI recommendation acted upon without validation Gateway is the validation gate; no action executes without approval
LLM06: Sensitive Information Disclosure Context package exposes entity PII to approvers Access control on approval queue; approver role must be authorised for entity data class

9. Governance Considerations

Responsible AI and EU AI Act Article 14 Compliance

EU AI Act Article 14 requires that high-risk AI systems are "designed and developed in such a way, including with appropriate human-machine interface tools, that they can be effectively overseen by natural persons during the period in which the AI system is in use." This pattern is the direct implementation of Article 14.

Specifically, Article 14 requires: (a) ability to understand AI capabilities and limitations — addressed by the context package confidence score and explanation; (b) ability to disregard, override, or intervene — addressed by the approve/reject/modify decision panel; (c) ability to interrupt operation via stop buttons — addressed by the cancel and defer actions; (d) assign oversight tasks to persons with competence and authority — addressed by the authority level enforcement.

Model Risk Management

  • Systematic override rate (>20% of decisions in a category) triggers mandatory model review.
  • Approval SLA breach rate (>5%) triggers capacity and workflow review.

Human Approval Checkpoints

  • Gateway design (thresholds, authority matrix, timeout policy) approved by AI Risk Committee.
  • Authority matrix reviewed quarterly; updated when action types change.
  • Threshold adjustments require risk committee sign-off.

Governance Artefacts

Artefact Owner Cadence Purpose
Approval Audit Log Compliance Continuous Immutable record of all approval decisions; regulatory submission
Override Rate Report AI Governance Monthly Proportion of AI recommendations overridden; by action type
Authority Matrix Risk Committee Quarterly Documents which roles may approve which action types
Timeout Policy AI Risk Committee Annual Defines timeout responses per action type; signed off by CRO
SLA Compliance Report Operations Weekly Approval SLA achievement; queue backlog; escalation rate
EU AI Act Conformity Documentation Compliance Annual Documents Art. 14 implementation; oversight mechanism description

10. Operational Considerations

Monitoring

Metric Alert Threshold Tooling
Queue depth >100 items (standard); >20 items (high-risk) Queue metrics + Grafana
Approval SLA achievement <90% SLA monitor + PagerDuty
Escalation rate >10% Workflow metrics
Auto-approve (timeout) rate >5% Audit log analytics
Override rate per AI model >20% for any action category Override analytics
Approver throughput (decisions/hour) Below capacity threshold Workforce management system

SLOs

SLO Target Measurement
Standard case approval SLA ≤4 business hours Queue SLA countdown
High-priority case approval SLA ≤30 minutes Queue SLA countdown
Context package assembly latency <2 seconds Context Packager metrics
Approval queue availability 99.95% Health check

Logging

  • All approval events logged in structured JSON; retained 10 years (EU AI Act Art. 12 minimum for high-risk AI).
  • Logs include full context package snapshot; approver identity; decision; rationale; timing.

Incident Management

  • Approval queue outage → P1; AI system operates in hold mode (no actions until queue restored).
  • Override rate spike (>30% in 1 hour) → P2; AI Governance lead notified; model review triggered.

Disaster Recovery

Component RTO RPO Strategy
Approval Queue 15 minutes 0 Active-passive queue replica; dead-letter queue for in-flight items
Approver UI 30 minutes N/A Multi-AZ deployment; read-only fallback shows pending items
Audit Log 4 hours 0 Synchronous write to append-only store; cross-region replication

11. Cost Considerations

Cost Drivers

Cost Driver Typical Range Notes
Approval queue infrastructure $200–$2,000/month Managed queue service; scales with throughput
Approver UI development $50,000–$200,000 one-time Custom build; significant UX investment warranted
Audit log storage $50–$1,000/month 10-year retention; scales with decision volume
Approver workforce cost Dominant cost $40–$150/hour per approver; scales with queue volume
Override analytics $200–$1,000/month Custom analytics pipeline

Optimisations

  • Reduce approver volume by tuning risk threshold (fewer items in queue); validate with risk committee.
  • Automate low-complexity context package assembly to reduce approver cognitive load and review time.
  • Batch similar cases for a single approver session to reduce context-switching overhead.
  • Use confidence calibration to minimise mid-confidence routing; most items should be high-confidence-auto or clearly-routed-to-human.

Indicative Cost Range

Scale Monthly Cost (infrastructure only) Notes
Small (<500 approvals/day) $1,000–$5,000 Approver workforce cost dominates
Medium (500–5,000 approvals/day) $5,000–$20,000 Automation of low-risk sub-decisions reduces workforce
Large (>5,000 approvals/day) $20,000–$100,000 Queue autoscaling; multi-approver workforce management

12. Trade-Off Analysis

Option Comparison

Option Pros Cons Recommended When
A: Human Approval Gateway (this pattern) Full regulatory compliance; prevents AI errors from causing harm; complete audit trail Adds latency and workforce cost; queue backlog risk High-risk AI (EU AI Act Annex III); consequential decisions
B: Post-hoc human review (human reviews after action) Zero latency on AI action Cannot prevent harm from AI errors; not compliant with EU AI Act Art. 14 Low-risk, reversible AI actions only
C: Periodic batch audit (human reviews sample) Low overhead Not real-time; sample may miss errors; fails Art. 14 individual oversight requirement Statistical quality monitoring only, not individual decision oversight
D: Full automation with exception handling Lowest cost; highest throughput No proactive human oversight; regulatory non-compliance for high-risk AI Low-risk, reversible, non-regulated AI only

Architectural Tensions

Tension Trade-Off Resolution
Oversight thoroughness vs. throughput Full review for every item creates backlog Risk-tiered routing: only high-risk items require full review
Approver independence vs. efficiency Context package should not bias approver; but too little info → bad decisions Present AI recommendation clearly as a recommendation, not a directive; ensure approver has independent access to entity data
Audit completeness vs. privacy Full context package in audit log contains PII Encrypt audit log; restrict access; retain PII only as long as required

13. Failure Modes

Failure Likelihood Impact Detection Recovery
Approver rubber-stamping (approving without review) High High — defeats oversight purpose Review time analytics (very short review time = rubber stamp signal); override rate monitoring Mandatory minimum review time; rationale quality scoring; approver calibration training
Queue backlog causes SLA breach Medium High — customer harm from delay; regulatory breach Queue depth monitoring; SLA countdown alerts Surge capacity (on-call approvers); escalation path; emergency threshold adjustment
Context package manipulation (adversarial input) Low High — approver misled Context package rendering uses structured data; sanitisation Strict input validation; structured rendering; not rendering free text from AI system
Timeout auto-approval without oversight Medium Medium — approval without review Timeout auto-approve rate monitoring Restrict auto-approve to pre-approved action types; CDO/CRO sign-off required
Authority enforcement bypass Very Low Critical — unapproved approver authorises high-risk action Authority enforcement in every approval submission Authority enforcer is synchronous gate; no approval executes without authority check

14. Regulatory Considerations

Regulation Article/Clause Requirement Pattern Response
EU AI Act Article 14(1) High-risk AI systems designed to be effectively overseen by natural persons Approval gateway IS the oversight mechanism
EU AI Act Article 14(4)(a) Understand AI system capabilities and limitations Context package includes confidence score and plain-language explanation
EU AI Act Article 14(4)(b) Disregard, override or intervene Reject/modify/defer actions in approver UI
EU AI Act Article 14(4)(c) Interrupt via stop buttons Cancel action available at any approval stage
EU AI Act Article 14(5) Assign oversight to persons with competence and authority Authority matrix enforced by Authority Enforcer
EU AI Act Article 29 Human oversight obligations of deployers Governance artefacts document deployer oversight responsibilities
GDPR Article 22(2)(b) Right not to be subject to purely automated decision making — human oversight required Gateway ensures human reviews consequential decisions
APRA CPS 230 §32 Operational risk management Human approval reduces AI operational risk; DR targets defined
ISO 42001 §8.5 AI system operation controls Human approval gateway is documented operational control
NIST AI RMF MANAGE-1.3 Risk treatment includes human oversight mechanisms Gateway is the MANAGE-1.3 control

15. Reference Implementations

AWS

Component AWS Service
Approval queue Amazon SQS FIFO + DynamoDB for SLA tracking
Approver UI React app on CloudFront + API Gateway + Lambda
Context packager Lambda + SHAP inference on SageMaker
Action executor Step Functions
Audit log S3 Object Lock + CloudWatch Logs Insights
Override analytics Athena + QuickSight

Azure

Component Azure Service
Approval queue Azure Service Bus + Azure Cosmos DB (SLA state)
Approver UI React on Azure Static Web Apps + Azure Functions
Audit log Azure Immutable Blob Storage + Azure Monitor
Override analytics Azure Synapse Analytics

GCP

Component GCP Service
Approval queue Cloud Tasks + Cloud Firestore
Approver UI React on Firebase Hosting + Cloud Functions
Audit log Cloud Storage (retention lock) + BigQuery
Override analytics BigQuery + Looker Studio

On-Premises

Component Technology
Approval queue RabbitMQ / Kafka + PostgreSQL (SLA state)
Approver UI React + FastAPI
Action executor Camunda BPM workflow engine
Audit log PostgreSQL append-only table + MinIO (Object Lock)

Pattern ID Relationship Notes
AI Confidence Threshold Routing EAAPL-HIL005 Depends on Confidence threshold is one trigger for gateway routing
Human Override Pattern EAAPL-HIL006 Specialises Override is a decision type within the approval gateway
Human Escalation Pattern EAAPL-HIL003 Complements Escalation is the timeout response for high-risk cases
Collaborative AI Decision EAAPL-HIL004 Overlaps Gateway is the formal implementation of collaborative decision structure
Active Learning Loop EAAPL-HIL002 Complements Override decisions feed active learning pipeline
Model Rollback EAAPL-MDL004 Triggered by High override rate triggers model rollback consideration

17. Maturity Assessment

Overall Maturity: Mature — Human approval workflows are a well-established operational pattern. EU AI Act Article 14 has accelerated standardisation. Production implementations exist across financial services, healthcare, and government sectors globally.

Dimension Score (1–5) Notes
Architectural clarity 5 Well-defined components; clear ownership model
Tooling maturity 4 Queue, workflow, and audit tools mature; purpose-built approver UX remains custom
Regulatory alignment 5 Best-in-class EU AI Act Art. 14 implementation
Operational complexity 3 Queue management and workforce sizing require ongoing attention
Cost efficiency 3 Approver workforce cost dominant; threshold tuning manages volume
Security 5 Immutable audit, authority enforcement, separation of duties

18. Revision History

Version Date Author Changes
1.0 2023-06-01 EAAPL Working Group Initial publication
1.1 2024-02-15 EAAPL Working Group Added EU AI Act Article 14 detailed mapping; timeout policy section
1.2 2024-09-01 EAAPL Working Group Added Article 14(5) authority competence; override analytics
1.3 2025-03-01 EAAPL Working Group Updated NIST AI RMF MANAGE mapping; EU AI Act enforcement timeline
← Back to LibraryMore Human-in-the-Loop