Releases, implementation notes, and evidence workflows

Updates from the InvarLock project, including release changes, CLI updates, docs changes, and practical notes on how to read or ship evaluation evidence.

Latest post: July 27, 2026 across 21 active tags.

Release

InvarLock 0.14.0: Evaluator-Neutral Evidence and Recipient-Controlled Acceptance

InvarLock 0.14.0 adds evaluator-neutral qualification and a recipient-controlled acceptance handoff while preserving verifier-owned replay and permanent v0.13 compatibility.

July 27, 2026

5 min read

InvarLock Team

Read post

External evaluator records cross a per-record qualification gate into a signed technical receipt, then a separate DSSE envelope and recipient-policy decision, while aggregate-only evidence branches to observation-only.

Release

InvarLock 0.13.0: one signed evaluation transaction

InvarLock 0.13.0 replaces the former guard-and-report workflow with a closed paired request, canonical signed evidence, independent verifier replay, and one authenticated report path.

InvarLock 0.14.0: Evaluator-Neutral Evidence and Recipient-Controlled Acceptance

InvarLock 0.13.0: one signed evaluation transaction

What Release Gates Add to InvarLock Evaluation Artifacts

Invariants Are Necessary but Not Sufficient

Guard-value contracts and stock-clean attention control

Self-edit evidence, backend compatibility, and larger model lanes

What Evidence Packs Still Do Not Prove

How to Archive a Model-Edit Decision So Someone Else Can Recheck It

Report outlines, guard warnings, and wider public evidence

Evidence Packs, Not Screenshots

Runtime Manifests and Why Provenance Must Travel With the Result

Evidence packs, authenticity, and quantized-adapter validation

What Belongs in evaluation.report.json

Calibration Is the Product Surface, Not a Side Utility

Strict assurance and runtime provenance

From Sweep Outputs to Tier Policy

Variance Enablement Should Be Evidence-Gated

Null Sweeps as Threshold Derivation, Not Tuning Folklore

The Minimum Evidence Surface for Trustworthy Weight-Edit Results

Evidence packs and explicit runtime provenance

Fail-Closed Verification for Weight-Edit Evaluation

Tag-based publishing with slimmer release verification

Why Paired Evaluation Beats Before/After Benchmarks

Standalone contract bundles with tighter release gates

GPT-OSS pilots with CUDA-ready attested lanes

What InvarLock Actually Claims

Gemma 4 pilot lanes with a clearer assurance contract

Attested smoke lanes with package-native evidence pack signing

Offline release verification with a slimmer public CLI

Stable public contracts with stricter fail-closed verification

Coverage floors and fail-closed CLI/reporting paths

Quantization, spectral, and report-schema hardening

Evidence pack showcase coverage and reproducible CI

Report rename cleanup and offline evidence-pack hardening

Evaluation reports, strict evidence packs, and Transformers v5

Evidence packs v2 and role-based adapter routing

Measurement contracts for CI and release verification

Deterministic evidence packs and safer perplexity runs

Fail-closed baseline pairing with lower-memory retries

Token-weighted paired statistics and stricter release gates

Calibration, determinism, and regression protection

Large-model reload stability and B200 controls

Quantization-aware adapters and safe device movement

Public evaluate pipeline and report schema v1

Welcome to InvarLock