Buildability / Receipt
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness
This public receipt window renders only fields present in the canonical receipt object, deterministic fixture receipt, or canonical evidence receipt. Missing compute, demo, hash, signature, approval, telemetry, and adoption fields stay explicit.
Public buildability page receipt window
Watch and verify: Mitigating Reward Hacking in RLHF via Advantage Sign Robustness
/buildability/mitigating-reward-hacking-in-rlhf-via-advantage-sign-robustness
Subject: Mitigating Reward Hacking in RLHF via Advantage Sign Robustness
Verdict
Watch
Verdict is Watch because viability or proof quality is intermediate and should be re-evaluated before execution.
Time to first demo
Insufficient data
No first-demo timestamp, owner estimate, or elapsed demo receipt is attached to this surface.
Compute envelope
Structured compute envelope
Insufficient data
No data, compute, hardware, memory, latency, dependency, or serving requirement receipt is attached.
Evidence ids
Receipt path
/buildability/mitigating-reward-hacking-in-rlhf-via-advantage-sign-robustness
Paper ref
mitigating-reward-hacking-in-rlhf-via-advantage-sign-robustness
arXiv id
2604.02986
Freshness
Generated at
2026-04-06T20:12:49.631Z
Evidence freshness
fresh
Last verification
2026-04-06T20:12:49.631Z
Sources
0
References
0
Coverage
0%
Hash state
Lineage hash
f5427281bde9a1b805e8c18be15902b7842d06e30bbcdb3f6c6b7cd8f514cdef
Canonical opportunity-kernel lineage hash.
Signature state
External signature
unsigned_external
No founder, registry, pilot, or production-adoption signature is attached to this receipt.
Verification
not_verified
Verification is blocked until an external signature is provided.
Blockers
- Missing: paper_evidence_receipts.references_count
- Missing: paper_evidence_receipts.coverage
- Unknown: Canonical evidence receipt has not been materialized yet.
Some score or evidence fields are outside the preferred freshness window.
paper_evidence_receipts.references_count
paper_evidence_receipts.coverage
Truth Boundary
External gate remains unresolved for live deployment claims.
Buildability surfaces only report computed viability and proof receipts. They do not claim live production usage, pilot outcomes, founder sign-off, public Brier calibration, judge divergence, or external adoption unless explicitly sourced.