<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Formal Verification on My Hugo Project</title><link>https://ostensible-paradox.pages.dev/en/tags/formal-verification/</link><description>Recent content in Formal Verification on My Hugo Project</description><generator>Hugo</generator><language>en</language><lastBuildDate>Fri, 15 May 2026 11:13:18 +0000</lastBuildDate><atom:link href="https://ostensible-paradox.pages.dev/en/tags/formal-verification/index.xml" rel="self" type="application/rss+xml"/><item><title>Dual Certificates for Agent Audit: Separating Structural Unrecoverability from Decision Relevance</title><link>https://ostensible-paradox.pages.dev/en/posts/dual-certificates-agent-audit/</link><pubDate>Fri, 15 May 2026 11:13:18 +0000</pubDate><guid>https://ostensible-paradox.pages.dev/en/posts/dual-certificates-agent-audit/</guid><description>Auditing a deployed language-model agent requires two separable quantities: how much operative state escapes the recorded trace, and how much of that residual state drives behavior. We introduce a dual-certificate...</description><content:encoded><![CDATA[<h2 id="abstract">Abstract</h2>
<p>Auditing a deployed language-model agent requires two separable quantities: how much operative state escapes the recorded trace, and how much of that residual state drives behavior. We introduce a dual-certificate protocol. The static certificate $_state^UB$ upper-bounds residual hidden-state entropy by a min-cut on untraced channels. The dynamic certificate $_act^LB$ lower-bounds residual decision relevance through an admissible probe taxonomy—replay, intervention, proxy—under conditional data processing. The two axes are independent. In ReAct experiments, logging ablates the static bound from $16,464$ to $0$ bits; controlled replay separates dormant calculator from active planning tasks under the same topology as a soft policy shift ($0.0163$ bits, 95% CI $[0.0124,0.0208]$) with argmax tool selections unchanged. Indexing $_act^LB$ over hidden-channel coordinates produces an activation profile. On an LLaDA denoising trajectory, perturbations stay near the floor through early steps and rise at the final binding step ($0.110$ bits, 95% CI $[0.052,0.234]$). On a multi-agent communication edge, swapping a worker&rsquo;s private report gives $0.901$ bits, 95% CI $[0.873,0.928]$. A Lean 4 artifact mechanizes the autoregressive zero-cut case and proves the conditional-DPI and chain-rule reductions from Mathlib first principles, with only the cut-set capacity bound remaining as an external structural premise.</p>
<h2 id="contents">Contents</h2>
<ul>
<li><a href="/en/posts/dual-certificates-agent-audit/introduction/">Introduction</a></li>
<li><a href="/en/posts/dual-certificates-agent-audit/related-work/">Related Work</a></li>
<li><a href="/en/posts/dual-certificates-agent-audit/setup-and-audit-regime/">Setup and Audit Regime</a></li>
<li><a href="/en/posts/dual-certificates-agent-audit/static-certificate/">Static Certificate: Structural Upper Bound via Untraced-Channel Capacity</a></li>
<li><a href="/en/posts/dual-certificates-agent-audit/dynamic-certificate/">Dynamic Certificate: Decision Relevance via Conditional DPI</a></li>
<li><a href="/en/posts/dual-certificates-agent-audit/empirical-discussion/">Empirical Diagnostics</a></li>
</ul>
]]></content:encoded></item></channel></rss>