EAAPL-HIL001Proven

8 signals→

Human Approval Gateway

Human-in-the-LoopEU AI ActISO/IEC 42001Field-tested in AU

[EAAPL-HIL001] Human Approval Gateway

Category: Human-in-the-Loop
Sub-category: Oversight / Approval Control
Version: 1.3
Maturity: Mature
Tags: human-oversight, approval-workflow, high-stakes-AI, risk-scoring, audit-trail, EU-AI-Act-Art14
Regulatory Relevance: EU AI Act Articles 14 & 29, APRA CPS 230, ISO 42001 §8.5, NIST AI RMF MANAGE-1.3, GDPR Article 22

1. Executive Summary

High-stakes AI systems — those authorising transactions, making clinical recommendations, or influencing consequential administrative decisions — must not operate with full autonomy. The Human Approval Gateway pattern establishes a mandatory checkpoint between AI recommendation and consequential action, ensuring a qualified human reviews, approves, or rejects AI-proposed actions before they take effect.

This pattern is the foundational implementation of EU AI Act Article 14's human oversight requirement for high-risk AI systems. It defines the approval trigger logic, approval queue design, timeout handling (with configurable responses: escalate, cancel, or time-limited auto-approve with full audit), approval authority levels, and the immutable audit trail required by regulators.

Organisations that implement this pattern have passed EU AI Act conformity assessments, satisfied APRA supervisory expectations for AI in credit decisioning, and reduced AI-driven operational incidents by 40–70% by catching model errors before they produce irreversible consequences.

Target audience: Chief Risk Officers, Chief Compliance Officers, AI Governance leads, Enterprise Architects.

2. Problem Statement

Business Problem

AI systems in consequential domains can cause significant harm when they err without human review. A loan declined by a miscalibrated model, a clinical alert suppressed by a misconfigured AI, or a fraud block applied to a legitimate high-value customer — these outcomes damage customer relationships, create legal liability, and attract regulatory action.

Technical Problem

Many AI systems are designed as straight-through processors: AI decision → action, with no human in the loop.
Where manual review exists, it is often implemented as an afterthought — a separate queue tool with no integration to the AI decision context, poor UX, and no audit trail.
Timeout handling is undefined: if no human responds within SLA, the system either blocks indefinitely or silently auto-approves.
Approval authority levels are not enforced: any available agent may approve high-risk decisions regardless of qualifications.
The audit trail captures only the final approval but not the AI recommendation, confidence score, or supporting rationale reviewed by the approver.

Symptoms

AI errors causing irreversible customer harm that would have been caught by human review.
Regulatory audit finds no evidence of human oversight for automated consequential decisions.
Approval queue backlogs causing SLA breaches and frustrated customers.
Post-incident investigation cannot reconstruct what the human reviewer saw at the time of approval.
Different approvers applying inconsistent standards because approval guidelines are not embedded in the review interface.

Cost of Inaction

Dimension	Impact
Regulatory	EU AI Act Art. 14 violation; APRA enforcement; GDPR Art. 22 breach
Legal	Class action from wrongful AI decisions without human oversight
Customer	Irreversible harm from AI errors; trust destruction
Operational	Uncontrolled AI driving consequential actions without audit trail

3. Context

When to Apply

AI systems classified as high-risk under EU AI Act Annex III (credit, employment, law enforcement, clinical, essential services).
AI actions that are consequential, difficult to reverse, or have significant financial/health/safety impact.
AI systems where model confidence is insufficient for full automation (confidence below 90th percentile threshold).
Regulatory or policy mandate for human-in-the-loop (APRA prudential requirement, clinical governance).
AI systems operating in domains with active judicial review (immigration, benefits, credit).

When NOT to Apply

Trivial, easily reversible AI actions (content formatting, auto-tagging, draft generation).
AI systems with demonstrated near-perfect accuracy over a large sample (≥99.5% accuracy) for specific low-risk subtask categories, with formal risk-based exemption documented.
Situations where human review latency would cause harm greater than the AI error rate (emergency medical AI where delay kills — use audit-based oversight instead).

Prerequisites

Prerequisite	Minimum Viable	Preferred
AI confidence scoring	Binary pass/fail	Calibrated probability score
Approval workflow tooling	Email + manual queue	Purpose-built approval UI + queue management
Authority level definition	Informal	Role-based authority matrix
Audit log	Basic event log	Immutable, structured audit log
Escalation path	Informal supervisor escalation	Automated escalation with SLA tracking

Industry Applicability

Industry	Applicability	Driver
Financial Services (credit)	Mandatory	EU AI Act Annex III; APRA; GDPR Art. 22
Healthcare (clinical AI)	Mandatory	EU AI Act Annex III; clinical governance; liability
Government (benefits, immigration)	Mandatory	EU AI Act Annex III; public law; judicial review
Insurance (claims)	High	Consumer protection; claims dispute risk
Employment / HR	High	EU AI Act Annex III; employment law
Law Enforcement	Mandatory (with restrictions)	EU AI Act Annex III; civil liberties

4. Architecture Overview

Design Philosophy

The Human Approval Gateway is designed on four principles: no autonomous action without approved authority, full context for the approver, defined timeout handling, and immutable comprehensive audit.

Approval Trigger Logic. Not every AI output requires human approval — that would be unworkable and provide no benefit for low-risk routine outputs. The gateway applies trigger logic based on three dimensions: (1) Risk score: AI assigns a risk score to each proposed action; actions above the configured risk threshold are routed to the approval queue. The threshold is calibrated to balance oversight burden (approver workload) against risk coverage. (2) Action type: Certain action types always require approval regardless of confidence (e.g., any adverse decision under GDPR Art. 22; any action with regulatory notification obligation). (3) Confidence threshold: If the AI model's calibrated confidence score falls below a configured threshold (e.g., <85% confidence), the action is routed for human review even if the risk score is moderate.

Context Package Design. The approver sees a structured Context Package — not a raw model output. The Context Package contains: the AI recommendation and confidence score; the key factors driving the recommendation (model explanation, e.g., SHAP values in plain language); the entity profile (customer, patient, applicant); relevant historical context; applicable policy and guidelines; the action to be taken if approved; and the consequences of approval vs. rejection. This design prevents approver over-reliance on the AI recommendation by ensuring the approver has sufficient context to form an independent judgment.

Approval Queue Management. The queue implements priority-based routing: high-risk and time-sensitive cases are prioritised; within priority bands, FIFO ordering ensures fairness. Approvers are matched to cases by authority level (junior agents handle standard cases; senior agents handle high-value or complex cases). The queue tracks SLA countdown per item; approaching SLA triggers escalation.

Timeout Handling. Three configurable timeout responses, selected by action type and risk level: (a) Escalate: When the primary approver SLA expires, the case escalates to a senior approver. This is the preferred response for high-risk cases. (b) Cancel: The proposed action is cancelled; the entity is notified that the request is under review. Used when the action is not time-critical and the consequences of auto-approval are severe. (c) Time-limited auto-approve with audit: For time-critical, lower-risk actions, the system auto-approves after SLA expiry with an explicit audit log entry noting the timeout auto-approval. This must be explicitly enabled; it is not the default.

Audit Trail. Every approval event generates an immutable audit record containing: action ID, AI recommendation, confidence score, context package snapshot (what the approver saw), approver identity and authority level, decision (approve/reject/modify), decision rationale (free text + structured reason code), decision timestamp, elapsed review time, and whether the decision aligned with or overrode the AI recommendation. Override decisions are flagged for pattern analysis (systematic overrides → model retraining signal).

5. Architecture Diagram

ARCHITECTURE DIAGRAM

flowchart TD subgraph AISystem["AI System"] AI1[AI Model + Risk Scorer] AI2[Context Packager] end subgraph Queue["Approval Queue"] AQ1{Trigger Check} AQ2[Priority + Authority Routing] AQ3[SLA Monitor] end subgraph Human["Human Review"] UI1[Approver Interface] TH1[Timeout Handler] end subgraph Outcome["Outcome and Audit"] AE1[Action Executor] AL1[(Immutable Audit Log)] end AI1 --> AI2 AI2 --> AQ1 AQ1 -->|low risk auto| AE1 AQ1 -->|requires approval| AQ2 AQ2 --> UI1 AQ3 -->|SLA breach| TH1 UI1 -->|decision| AE1 TH1 -->|escalate or auto| AE1 AE1 --> AL1 style AI1 fill:#dbeafe,stroke:#3b82f6 style AI2 fill:#dbeafe,stroke:#3b82f6 style AQ1 fill:#f3e8ff,stroke:#a855f7 style AQ2 fill:#f0fdf4,stroke:#22c55e style AQ3 fill:#fee2e2,stroke:#ef4444 style UI1 fill:#f0fdf4,stroke:#22c55e style TH1 fill:#fee2e2,stroke:#ef4444 style AE1 fill:#d1fae5,stroke:#10b981 style AL1 fill:#fef9c3,stroke:#eab308

6. Components

Component	Type	Responsibility	Technology Options	Criticality
Risk Scorer	Processing	Assigns risk score to each AI action based on action type, entity risk profile, model confidence	Custom rule engine, ML risk scorer, FICO-style rules	Critical
Context Packager	Processing	Assembles context package: SHAP explanation, entity profile, guidelines, action summary	Custom Python; SHAP/LIME library; templated rendering	High
Approval Queue	Messaging + Storage	Holds pending approvals; priority ordering; SLA tracking; authority routing	AWS SQS, Azure Service Bus, custom PostgreSQL queue	Critical
SLA Monitor	Processing	Tracks approval SLA per queue item; fires escalation or timeout actions	Custom scheduler, AWS EventBridge rules, Celery beat	High
Approver UI	Application	Displays context package; captures decision and rationale; enforces authority levels	Custom React/Angular app; ServiceNow; Salesforce Flow	Critical
Authority Enforcer	Middleware	Validates that the approving user has required authority level for the action	RBAC integration; IAM policy evaluation	Critical
Action Executor	Service	Executes the approved (or timeout-auto-approved) action in downstream systems	Custom integration layer; webhook; API call	Critical
Immutable Audit Log	Storage	Stores complete approval event records; tamper-evident	AWS S3 Object Lock, Azure Immutable Blob, custom append-only DB	Critical
Override Analytics	Processing	Detects systematic override patterns; generates model retraining signals	Custom Python analytics; MLflow experiment tracking	High
Escalation Manager	Processing	Routes escalated cases to appropriate senior approver; tracks escalation chain	Workflow engine (Camunda, ServiceNow), custom	High

7. Data Flow

Primary Flow

Step	Actor	Action	Output
1	AI System	Generates recommendation with confidence score and risk score	AI output with metadata
2	Trigger Logic	Evaluates risk score, action type, confidence against configured thresholds	Route to queue or auto-execute
3	Context Packager	Assembles approver context package: explanation, entity profile, policy	Structured context package
4	Approval Queue	Enqueues case with priority and SLA; routes to appropriate approver by authority level	Queued approval item with SLA countdown
5	Approver	Reviews context package; forms independent judgment	Mental approval decision
6	Approver UI	Approver selects decision; enters rationale; submits	Decision record
7	Authority Enforcer	Validates approver authority level for this action type	Authorised decision
8	Action Executor	Executes approved action in downstream system	Outcome (transaction, decision, notification)
9	Audit Log	Records complete approval event: AI rec + context + decision + rationale + timing	Immutable audit record
10	Override Analytics	Analyses decisions; flags systematic overrides as model retraining signals	Retraining signal or operational insight

Error Flow

Error Condition	Trigger	Response	Recovery
Approver queue unavailable	Service failure	AI actions blocked until queue restored; alert fired	Queue HA deployment; failover queue; no auto-execution during outage
SLA breach before timeout handler fires	Monitor missed	Audit trail records SLA breach; manual investigation	Monitor SLA with redundant alert mechanisms
Approver attempts action outside authority level	Authority enforcer rejects	Action blocked; approver notified; escalation offered	Review authority matrix; ensure appropriate approvers available
Action Executor fails after approval	Downstream system failure	Approval record exists; action not executed; alert fired	Retry with idempotency key; reconciliation process

8. Security Considerations

Authentication & Authorisation

Approvers authenticate via SSO/MFA; session tokens expire every 4 hours.
Authority level enforcement: RBAC roles mapped to action types; enforced by Authority Enforcer component.
Separation of duties: the AI system cannot approve its own recommendations; approvers cannot modify the AI context package.

Secrets Management

Downstream system integration credentials managed in secrets manager; not exposed to approver UI.

Data Classification

Context packages contain entity personal and financial data; classified as Confidential.
Audit log contains sensitive decision data; classified as Confidential, retained per regulatory schedule.

Encryption

Approval queue messages encrypted in transit (TLS 1.3) and at rest.
Audit log encrypted at rest (AES-256); immutable storage (Object Lock).

Auditability

Every action from submission to approval to execution is logged with actor, timestamp, and decision.
Audit log tamper-evident (hash chaining or Object Lock).
All timeout auto-approvals explicitly logged with reason code "TIMEOUT_AUTO_APPROVE".

OWASP LLM Top 10 Mapping

OWASP LLM Risk	Relevance	Mitigation
LLM09: Overreliance	Approvers rubber-stamp AI recommendations without independent review	Context package design forces approvers to see entity data directly; approval rationale mandatory
LLM01: Prompt Injection	Malicious entity data in context package could manipulate approver presentation	Context package renders structured data, not free text; HTML/prompt injection sanitised
LLM02: Insecure Output Handling	AI recommendation acted upon without validation	Gateway is the validation gate; no action executes without approval
LLM06: Sensitive Information Disclosure	Context package exposes entity PII to approvers	Access control on approval queue; approver role must be authorised for entity data class

9. Governance Considerations

Responsible AI and EU AI Act Article 14 Compliance

EU AI Act Article 14 requires that high-risk AI systems are "designed and developed in such a way, including with appropriate human-machine interface tools, that they can be effectively overseen by natural persons during the period in which the AI system is in use." This pattern is the direct implementation of Article 14.

Specifically, Article 14 requires: (a) ability to understand AI capabilities and limitations — addressed by the context package confidence score and explanation; (b) ability to disregard, override, or intervene — addressed by the approve/reject/modify decision panel; (c) ability to interrupt operation via stop buttons — addressed by the cancel and defer actions; (d) assign oversight tasks to persons with competence and authority — addressed by the authority level enforcement.

Model Risk Management

Systematic override rate (>20% of decisions in a category) triggers mandatory model review.
Approval SLA breach rate (>5%) triggers capacity and workflow review.

Human Approval Checkpoints

Gateway design (thresholds, authority matrix, timeout policy) approved by AI Risk Committee.
Authority matrix reviewed quarterly; updated when action types change.
Threshold adjustments require risk committee sign-off.

Governance Artefacts

Artefact	Owner	Cadence	Purpose
Approval Audit Log	Compliance	Continuous	Immutable record of all approval decisions; regulatory submission
Override Rate Report	AI Governance	Monthly	Proportion of AI recommendations overridden; by action type
Authority Matrix	Risk Committee	Quarterly	Documents which roles may approve which action types
Timeout Policy	AI Risk Committee	Annual	Defines timeout responses per action type; signed off by CRO
SLA Compliance Report	Operations	Weekly	Approval SLA achievement; queue backlog; escalation rate
EU AI Act Conformity Documentation	Compliance	Annual	Documents Art. 14 implementation; oversight mechanism description

10. Operational Considerations

Monitoring

Metric	Alert Threshold	Tooling
Queue depth	>100 items (standard); >20 items (high-risk)	Queue metrics + Grafana
Approval SLA achievement	<90%	SLA monitor + PagerDuty
Escalation rate	>10%	Workflow metrics
Auto-approve (timeout) rate	>5%	Audit log analytics
Override rate per AI model	>20% for any action category	Override analytics
Approver throughput (decisions/hour)	Below capacity threshold	Workforce management system

SLOs

SLO	Target	Measurement
Standard case approval SLA	≤4 business hours	Queue SLA countdown
High-priority case approval SLA	≤30 minutes	Queue SLA countdown
Context package assembly latency	<2 seconds	Context Packager metrics
Approval queue availability	99.95%	Health check

Logging

All approval events logged in structured JSON; retained 10 years (EU AI Act Art. 12 minimum for high-risk AI).
Logs include full context package snapshot; approver identity; decision; rationale; timing.

Incident Management

Approval queue outage → P1; AI system operates in hold mode (no actions until queue restored).
Override rate spike (>30% in 1 hour) → P2; AI Governance lead notified; model review triggered.

Disaster Recovery

Component	RTO	RPO	Strategy
Approval Queue	15 minutes	0	Active-passive queue replica; dead-letter queue for in-flight items
Approver UI	30 minutes	N/A	Multi-AZ deployment; read-only fallback shows pending items
Audit Log	4 hours	0	Synchronous write to append-only store; cross-region replication

11. Cost Considerations

Cost Drivers

Cost Driver	Typical Range	Notes
Approval queue infrastructure	$200–$2,000/month	Managed queue service; scales with throughput
Approver UI development	$50,000–$200,000 one-time	Custom build; significant UX investment warranted
Audit log storage	$50–$1,000/month	10-year retention; scales with decision volume
Approver workforce cost	Dominant cost	$40–$150/hour per approver; scales with queue volume
Override analytics	$200–$1,000/month	Custom analytics pipeline

Optimisations

Reduce approver volume by tuning risk threshold (fewer items in queue); validate with risk committee.
Automate low-complexity context package assembly to reduce approver cognitive load and review time.
Batch similar cases for a single approver session to reduce context-switching overhead.
Use confidence calibration to minimise mid-confidence routing; most items should be high-confidence-auto or clearly-routed-to-human.

Indicative Cost Range

Scale	Monthly Cost (infrastructure only)	Notes
Small (<500 approvals/day)	$1,000–$5,000	Approver workforce cost dominates
Medium (500–5,000 approvals/day)	$5,000–$20,000	Automation of low-risk sub-decisions reduces workforce
Large (>5,000 approvals/day)	$20,000–$100,000	Queue autoscaling; multi-approver workforce management

12. Trade-Off Analysis

Option Comparison

Option	Pros	Cons	Recommended When
A: Human Approval Gateway (this pattern)	Full regulatory compliance; prevents AI errors from causing harm; complete audit trail	Adds latency and workforce cost; queue backlog risk	High-risk AI (EU AI Act Annex III); consequential decisions
B: Post-hoc human review (human reviews after action)	Zero latency on AI action	Cannot prevent harm from AI errors; not compliant with EU AI Act Art. 14	Low-risk, reversible AI actions only
C: Periodic batch audit (human reviews sample)	Low overhead	Not real-time; sample may miss errors; fails Art. 14 individual oversight requirement	Statistical quality monitoring only, not individual decision oversight
D: Full automation with exception handling	Lowest cost; highest throughput	No proactive human oversight; regulatory non-compliance for high-risk AI	Low-risk, reversible, non-regulated AI only

Architectural Tensions

Tension	Trade-Off	Resolution
Oversight thoroughness vs. throughput	Full review for every item creates backlog	Risk-tiered routing: only high-risk items require full review
Approver independence vs. efficiency	Context package should not bias approver; but too little info → bad decisions	Present AI recommendation clearly as a recommendation, not a directive; ensure approver has independent access to entity data
Audit completeness vs. privacy	Full context package in audit log contains PII	Encrypt audit log; restrict access; retain PII only as long as required

13. Failure Modes

Failure	Likelihood	Impact	Detection	Recovery
Approver rubber-stamping (approving without review)	High	High — defeats oversight purpose	Review time analytics (very short review time = rubber stamp signal); override rate monitoring	Mandatory minimum review time; rationale quality scoring; approver calibration training
Queue backlog causes SLA breach	Medium	High — customer harm from delay; regulatory breach	Queue depth monitoring; SLA countdown alerts	Surge capacity (on-call approvers); escalation path; emergency threshold adjustment
Context package manipulation (adversarial input)	Low	High — approver misled	Context package rendering uses structured data; sanitisation	Strict input validation; structured rendering; not rendering free text from AI system
Timeout auto-approval without oversight	Medium	Medium — approval without review	Timeout auto-approve rate monitoring	Restrict auto-approve to pre-approved action types; CDO/CRO sign-off required
Authority enforcement bypass	Very Low	Critical — unapproved approver authorises high-risk action	Authority enforcement in every approval submission	Authority enforcer is synchronous gate; no approval executes without authority check

14. Regulatory Considerations

Regulation	Article/Clause	Requirement	Pattern Response
EU AI Act	Article 14(1)	High-risk AI systems designed to be effectively overseen by natural persons	Approval gateway IS the oversight mechanism
EU AI Act	Article 14(4)(a)	Understand AI system capabilities and limitations	Context package includes confidence score and plain-language explanation
EU AI Act	Article 14(4)(b)	Disregard, override or intervene	Reject/modify/defer actions in approver UI
EU AI Act	Article 14(4)(c)	Interrupt via stop buttons	Cancel action available at any approval stage
EU AI Act	Article 14(5)	Assign oversight to persons with competence and authority	Authority matrix enforced by Authority Enforcer
EU AI Act	Article 29	Human oversight obligations of deployers	Governance artefacts document deployer oversight responsibilities
GDPR	Article 22(2)(b)	Right not to be subject to purely automated decision making — human oversight required	Gateway ensures human reviews consequential decisions
APRA CPS 230	§32	Operational risk management	Human approval reduces AI operational risk; DR targets defined
ISO 42001	§8.5	AI system operation controls	Human approval gateway is documented operational control
NIST AI RMF	MANAGE-1.3	Risk treatment includes human oversight mechanisms	Gateway is the MANAGE-1.3 control

15. Reference Implementations

AWS

Component	AWS Service
Approval queue	Amazon SQS FIFO + DynamoDB for SLA tracking
Approver UI	React app on CloudFront + API Gateway + Lambda
Context packager	Lambda + SHAP inference on SageMaker
Action executor	Step Functions
Audit log	S3 Object Lock + CloudWatch Logs Insights
Override analytics	Athena + QuickSight

Azure

Component	Azure Service
Approval queue	Azure Service Bus + Azure Cosmos DB (SLA state)
Approver UI	React on Azure Static Web Apps + Azure Functions
Audit log	Azure Immutable Blob Storage + Azure Monitor
Override analytics	Azure Synapse Analytics

GCP

Component	GCP Service
Approval queue	Cloud Tasks + Cloud Firestore
Approver UI	React on Firebase Hosting + Cloud Functions
Audit log	Cloud Storage (retention lock) + BigQuery
Override analytics	BigQuery + Looker Studio

On-Premises

Component	Technology
Approval queue	RabbitMQ / Kafka + PostgreSQL (SLA state)
Approver UI	React + FastAPI
Action executor	Camunda BPM workflow engine
Audit log	PostgreSQL append-only table + MinIO (Object Lock)

Pattern	ID	Relationship	Notes
AI Confidence Threshold Routing	EAAPL-HIL005	Depends on	Confidence threshold is one trigger for gateway routing
Human Override Pattern	EAAPL-HIL006	Specialises	Override is a decision type within the approval gateway
Human Escalation Pattern	EAAPL-HIL003	Complements	Escalation is the timeout response for high-risk cases
Collaborative AI Decision	EAAPL-HIL004	Overlaps	Gateway is the formal implementation of collaborative decision structure
Active Learning Loop	EAAPL-HIL002	Complements	Override decisions feed active learning pipeline
Model Rollback	EAAPL-MDL004	Triggered by	High override rate triggers model rollback consideration

17. Maturity Assessment

Overall Maturity: Mature — Human approval workflows are a well-established operational pattern. EU AI Act Article 14 has accelerated standardisation. Production implementations exist across financial services, healthcare, and government sectors globally.

Dimension	Score (1–5)	Notes
Architectural clarity	5	Well-defined components; clear ownership model
Tooling maturity	4	Queue, workflow, and audit tools mature; purpose-built approver UX remains custom
Regulatory alignment	5	Best-in-class EU AI Act Art. 14 implementation
Operational complexity	3	Queue management and workforce sizing require ongoing attention
Cost efficiency	3	Approver workforce cost dominant; threshold tuning manages volume
Security	5	Immutable audit, authority enforcement, separation of duties

18. Revision History

Version	Date	Author	Changes
1.0	2023-06-01	EAAPL Working Group	Initial publication
1.1	2024-02-15	EAAPL Working Group	Added EU AI Act Article 14 detailed mapping; timeout policy section
1.2	2024-09-01	EAAPL Working Group	Added Article 14(5) authority competence; override analytics
1.3	2025-03-01	EAAPL Working Group	Updated NIST AI RMF MANAGE mapping; EU AI Act enforcement timeline

Track this pattern for APRA/ASIC review

← Back to Library More Human-in-the-Loop →