Skip to main content

Governance Decisions

When an agent operation is evaluated, OpenBox returns one of four governance decisions.

Decision Types

DecisionEffectTrust Impact
HALTTerminates entire agent sessionSignificant negative
BLOCKAction rejected, agent continuesNegative
REQUIRE_APPROVALOperation paused for human reviewNeutral (pending)
ALLOWOperation proceeds normallyPositive (compliance recorded)

ALLOW

The operation is permitted to proceed.

When returned:

  • Operation matches allowed patterns
  • Agent trust tier permits the action
  • No policy violations detected

Effect:

  • Operation executes normally
  • Event logged for audit
  • Behavioral score slightly improves

REQUIRE_APPROVAL

OpenBox pauses the operation pending human approval.

When returned:

  • Policy explicitly requires HITL
  • Operation crosses risk threshold
  • Agent trust tier mandates review

Effect:

  • Request appears in the Approvals queue
  • Session Replay shows the operation context and decision timeline
  • Once a reviewer approves or rejects, the operation proceeds or stops

Approval flow:

1. Operation triggers REQUIRE_APPROVAL
2. Request appears in dashboard queue
3a. Approved → Operation proceeds
3b. Rejected → Operation blocked
3c. Timeout → Operation expires

BLOCK

OpenBox blocks the specific operation.

When returned:

  • Policy explicitly blocks this operation
  • Trust tier prohibits the action
  • Behavioral rule violation detected

Effect:

  • Operation does not execute
  • Event logged with denial reason
  • Behavioral score decreases

HALT

The entire agent session is terminated.

When returned:

  • Critical policy violation
  • Multi-step threat pattern detected
  • Agent trust score critically low
  • Explicit termination rule triggered

Effect:

  • Current activity fails
  • Workflow is canceled
  • All pending operations abandoned
  • Agent may be blocked from further execution
  • Significant trust score decrease
  • Alert generated

Decision Precedence

When multiple policies apply, decisions follow precedence:

HALT > BLOCK > REQUIRE_APPROVAL > ALLOW

If any policy returns HALT, the agent session is terminated regardless of other policies.

Decision in Session Replay

Session Replay shows decisions at each operation:

09:14:32.001  DATABASE_READ     customers.find    ✓ ALLOW
09:14:32.045 LLM_CALL gpt-4 ✓ ALLOW
09:14:32.892 EXTERNAL_API_CALL stripe.com ⏸ REQUIRE_APPROVAL
09:14:45.002 APPROVAL_GRANTED user: john@co ✓ APPROVED
09:14:45.123 EXTERNAL_API_CALL stripe.com ✓ ALLOW (resumed)
09:14:46.001 DATABASE_WRITE audit.log ✓ ALLOW

Customizing Decisions

You can tune how the Authorize phase produces decisions:

  1. Policies (OPA/Rego) - Return allow, deny, or require_approval for specific operations and conditions.
  2. Behavioral Rules - Detect multi-step patterns and escalate to BLOCK, REQUIRE_APPROVAL, or HALT.
  3. Trust-tier conditions - Apply stricter decisions for lower-tier agents and relax controls for higher-tier agents.
  4. Approval timeout settings - Configure how long REQUIRE_APPROVAL requests can remain pending before expiring.

Use policy and behavioral-rule testing before rollout to confirm expected outcomes.