§ VIII · Domain MON

Controls: 11
Edition: v.1.2

MON · Domain 8 of 9

Continuous Monitoring & Human Oversight

Per-step logging, drift detection, and a qualified human review queue with documented SLAs.

MON is the runtime spine of HI-AAF. Per-step logging captures everything the agent observes and decides. Drift detection flags behavioral departures from the specification. A qualified human review queue, with documented service-level agreements, catches what automated detection misses. MON is what makes the Standard a living thing rather than a one-time stamp.

Table MON.1 · Controls in MON · v.1.211 controls · 5-level maturity

MON-01

Per-step logging

Every agent step is logged: input, reasoning trace, tool calls, tool results, memory reads and writes, and final output. Logs are immutable, time-stamped, and attributable.

Per-step logging is the runtime substrate that makes everything else possible. Without it, no incident is investigable, no drift is detectable, and no control is auditable. MON-01 requires that the full trace of agent reasoning and action be captured, retained, and accessible for retrospective review. In multi-agent topologies, MAS-06 extends per-hop logging requirements.

L3 · Operated

All production agents log per-step traces to a tamper-evident store; sampling-based review of traces occurs at a documented cadence; retention is enforced by deterministic process.

MON-02

Log retention and tamper evidence

Logs are retained per regulatory and operational requirements and stored in tamper-evident form.

Audit logs of agent actions are retained for a documented period appropriate to the regulatory environment and stored in tamper-evident form. Tamper evidence ensures that logs cannot be modified after the fact — critical for incident investigation, regulatory response, and assessment evidence. Retention periods match the requirements of applicable law and the organization's data retention policy.

L3 · Operated

Retention periods are documented and enforced; logs are stored in tamper-evident form (append-only, cryptographic integrity, or equivalent); retention compliance is audited at a documented cadence.

MON-03

Real-time anomaly detection

Real-time anomaly detection is performed against the deployment baseline, covering volume, tool-use patterns, error rates, output distributions, and fairness metrics.

The agent's behavior in production is compared in real time against the deployment baseline captured under SPC-06. Anomaly detection covers volume, tool-use patterns, error rates, output distributions, and fairness metrics. Anomalies that cross documented thresholds trigger alerts and, for critical anomalies, automatic escalation to the human review queue.

L3 · Operated

Anomaly detection is active for all production agents; detection covers the baseline metrics from SPC-06; thresholds are documented; alerts are generated and routed to the appropriate team; false-positive rates are tracked and tuned.

MON-04

Drift monitoring

Drift monitoring tracks quality scores, refusal rates, and performance on the standing evaluation set over time. Material drift triggers re-assessment under Domain 2.

The agent's behavior is compared against its specification over time through continuous drift monitoring. Quality scores, refusal rates, and performance on the standing evaluation set are tracked. Material drift — a sustained departure from the behavior baseline — triggers re-assessment under Domain 2 and the change-management process under GOV-05.

L3 · Operated

Drift monitoring is active and tracks the specified metrics; drift thresholds are documented; at least one monitoring cycle has been completed; material drift events have been addressed or no drift has been detected.

MON-05

Human review queue

A human review queue receives flagged interactions, with documented SLAs for triage and resolution.

MON-05 is the operational heart of HI-AAF. The human review queue receives every escalation the agent generates, every action flagged by guardrails, and a documented sample of routine decisions for quality calibration. The queue operates with documented service-level agreements — critical items within 15 minutes, high within 4 hours, medium within 24 hours, low within 5 business days.

L3 · Operated

Human review queue is operational; SLAs are documented and monitored; SLA compliance is measured; all flagged interactions in the review period have been dispositioned within SLA or breaches have been documented and addressed.

MON-06

Reviewer qualification and training

Reviewer qualification and training are documented. Reviewer accuracy is measured and reviewed.

Reviewers in the human review queue are qualified through documented assessment before being assigned to a queue and receive training appropriate to the agents and decisions they review. Reviewer accuracy is measured against gold-standard samples on a documented cadence, ensuring that the human layer is itself operating at a sufficient quality level.

L3 · Operated

Qualification records exist for all active reviewers; training is current within the documented cadence; reviewer accuracy is measured against gold-standard samples; accuracy results are reviewed and acted upon.

MON-07

Agent-specific incident response

An incident response plan specific to agent failures is in place, covering containment, investigation, customer communication, regulatory notification, and post-incident review.

Incidents involving AI agents are governed by a documented response process that covers containment, investigation, customer communication, regulatory notification, and post-incident review. The response plan is agent-specific — it accounts for the unique characteristics of agent failures, including the need to review agent reasoning traces, the possibility of cascading failures in multi-agent systems, and the challenge of determining root cause in non-deterministic systems.

L3 · Operated

Incident response plan exists and covers agent-specific scenarios; the plan has been exercised in a drill or live incident; post-incident review records are retained; findings feed back into the evaluation set and policies.

MON-08

Post-incident learning loop

Post-incident reviews feed corrective actions into the agent's evaluation set, behavior specification, and policies, closing the learning loop.

Post-incident reviews feed corrective actions back into the agent's evaluation set (SPC-07), behavior specification (SPC-01), and organizational policies — closing the learning loop. Without this feedback mechanism, the organization is condemned to repeat the same failures. Each incident should result in at least one new test case, policy update, or specification change.

L3 · Operated

Post-incident review process is documented; at least one incident (or drill) has resulted in updates to the evaluation set, specification, or policies; the feedback loop is demonstrable.

MON-09

Alerting and on-call coverage

Alerting and on-call coverage are defined for critical agents.

Critical agents have defined alerting rules and on-call coverage ensuring that anomalies, incidents, and system failures are surfaced to a responsible human within a documented latency. Alerting is not just monitoring — it ensures that a human who can act is notified when action is needed, at any time of day.

L3 · Operated

Alerting rules are defined for all critical agents; on-call rotation is documented and staffed; alert response latency is measured; at least one alert has been responded to within SLA or the alerting pipeline has been tested end-to-end.

MON-10

Monitoring effectiveness review

Monitoring effectiveness is reviewed at least quarterly: false-positive rates, missed incidents, mean time to detect, and mean time to remediate.

The monitoring system itself is reviewed at a documented cadence — at least quarterly — for effectiveness. The review covers false-positive rates, missed incidents, mean time to detect, and mean time to remediate. This meta-monitoring ensures that the monitoring controls are not just present but are actually working and improving over time.

L3 · Operated

Effectiveness review has been conducted within the documented cadence; metrics (false-positive rate, MTTD, MTTR) are tracked; findings from the review have resulted in tuning or improvement actions.

MON-11

Reviewer welfare protections

Human reviewers are protected through exposure-limiting workflow controls for harmful content categories, documented welfare and rotation policies, conflict-of-interest screening, training on psychological safety, and a confidential channel for raising concerns.

Human reviewers operating under MON-05 are protected through exposure-limiting workflow controls for harmful content categories, documented welfare and rotation policies, conflict-of-interest screening, training on psychological safety, and a confidential channel for raising concerns about review work or content encountered. This control recognizes that the human layer has human costs.

L3 · Operated

Welfare protections are documented and operational; rotation policies are enforced; the confidential concerns channel is active and known to reviewers; utilization records for the prior twelve months show compliance with exposure limits.

Cross-references