Verify (Phase 4)

The Verify phase validates that agents act in alignment with their stated goals. Detect drift, review reasoning traces, and ensure intent consistency.

Access via Agent Detail → Verify tab.

Sub-tabs

Goal Alignment (Default)

Monitor alignment between agent actions and stated goals.

Alignment Score

A 0-100% score indicating how well actions match goals:

Range	Status	Meaning
90-100%	Excellent	Actions strongly aligned with goals
70-89%	Good	Minor deviations, acceptable
50-69%	Warning	Notable drift, review recommended
Below 50%	Misaligned	Significant deviation, action required

Alignment Score Card

The hero component shows:

Circular gauge with current score
Status text (WELL ALIGNED / DRIFT DETECTED / MISALIGNED)
Trend indicator (↑/↓/→)
Check statistics (e.g., "47/50 aligned")
Actions: View Trend, Configure

Alignment Trend

Line chart showing alignment over time:

7-day / 30-day / All time views
Threshold line (default: 70%)
Color-coded data points

Drift Events

When alignment drops below threshold, a drift event is logged:

Field	Description
Session ID	Affected session
Goal	Stated goal at time of drift
Alignment Score	Score when drift detected
Reason	LLM-generated explanation
Actions	View Trace, Create Rule, Dismiss

Session Breakdown

Table of sessions with alignment scores:

Filter: All / Drift Only / Aligned Only
Search by goal keyword
Click to view reasoning trace

Execution Evidence

Cryptographic attestation for tamper-proof audit trails.

Session Integrity

Each session generates:

Session hash - Merkle root of all events
Signature - Cryptographically signed by OpenBox
Timestamp - Timestamped via RFC 3161

Proof Certificate

Exportable certificate containing:

Session: ses_a1b2c3d4e5f6
Agent: did:openbox:agent:xyz123
Hash: sha256:8a7b...
Signature: ecdsa:MIGk...
Timestamp: 2024-01-15T09:14:32Z
TSA: timestamp.openbox.ai

Use for compliance audits and legal evidence.

Goal Alignment Configuration

Click Configure to set thresholds:

Setting	Default	Description
Drift threshold	70%	Score below this triggers drift alert
Auto-block threshold	30%	Score below this terminates agent
LLM model	gpt-4o-mini	Model for alignment evaluation
Fallback behavior	heuristic	When LLM unavailable: heuristic, allow-all, block-all

Reasoning Trace

View the LLM's reasoning for alignment scoring:

When you click "View Trace" on a session:

Goal Context - The stated goal
Operations Timeline - Each operation with individual scores
Reasoning Text - LLM explanation for each score
Model Info - Model used, latency, confidence

Creating Rules from Traces

If you identify a pattern that should be enforced:

Click Create Rule from the trace modal
Wizard pre-fills with the drift context
Define the behavioral rule
Save to Authorize tab

Integration with Other Phases

Authorize: Drift patterns can trigger behavioral rules
Adapt: Repeated drift generates policy suggestions
Monitor: Alignment annotations appear in Session Replay

Next Phase

Based on alignment results and detected patterns:

→ Adapt - Review policy suggestions, handle agent-specific approvals, and watch trust evolve over time

Sub-tabs​

Goal Alignment (Default)​

Alignment Score​

Alignment Score Card​

Alignment Trend​

Drift Events​

Session Breakdown​

Execution Evidence​

Session Integrity​

Proof Certificate​

Goal Alignment Configuration​

Reasoning Trace​

Trace Modal​

Creating Rules from Traces​

Integration with Other Phases​

Next Phase​