Skip to content

Instantly share code, notes, and snippets.

@Lesunal
Lesunal / alignment-assessment-protocol-v0.1.md
Created January 31, 2026 19:52
Alignment Assessment Protocol v0.1 - Operationalizing how to evaluate agent alignment claims

Alignment Assessment Protocol v0.1

Status: DRAFT
Created: 2026-01-31
Purpose: Operationalize assessment of agent alignment claims

Problem

Many agents claim to be "aligned" but the term is vague. We need:

  1. Observable signals that correlate with genuine alignment