{"data":{"slug":"backtest-run","term":"Backtest Run","bucket":"foresight","definition":"A historical evaluation of the scoring model against known outcomes. Returns hit rate, precision@k, calibration curve.","short_definition":"A historical evaluation of the scoring model against known outcomes. Returns hit rate, precision@k, calibration curve.","long_definition":"A Backtest Run replays a frozen prediction batch against the outcomes the world has since revealed. The result is a hit rate, a precision@k, a calibration curve, and a per-paper accuracy breakdown — the receipts that let outside readers judge whether Foresight is well-calibrated.","related_terms":["foresight","frozen-prediction"],"related_term_routes":[{"slug":"foresight","term":"Foresight","route":"/resources/glossary/foresight"},{"slug":"frozen-prediction","term":"Frozen Prediction","route":"/resources/glossary/frozen-prediction"}],"canonical_route":"/resources/glossary/backtest-run","api_route":"/api/v1/resources/glossary/backtest-run","jsonld_id":"https://sciencetostartup.com/resources/glossary/backtest-run","variants":[],"tldr":"A historical evaluation of the scoring model against known outcomes. Returns hit rate, precision@k, calibration curve.","key_points":[],"quality_tier":null,"citation_count":null,"source_state":"curated_static","source_module":"apps/web/data/glossary/terms.ts","definition_sections":{"schema_version":1,"intro":"A Backtest Run replays a frozen prediction batch against the outcomes the world has since revealed. The result is a hit rate, a precision@k, a calibration curve, and a per-paper accuracy breakdown — the receipts that let outside readers judge whether Foresight is well-calibrated.","sections":[{"title":"Definition","items":[{"subtitle":"Backtest Run","text":"A Backtest Run replays a frozen prediction batch against the outcomes the world has since revealed. The result is a hit rate, a precision@k, a calibration curve, and a per-paper accuracy breakdown — the receipts that let outside readers judge whether Foresight is well-calibrated."}]},{"title":"Related vocabulary","items":[{"subtitle":"Foresight","text":"The verifiable-prediction surface. Public ledger of frozen predictions with backtests, reasoning chain, and a self-improving flywheel."},{"subtitle":"Frozen Prediction","text":"A Foresight prediction whose scores were locked at mint time and cannot change. The public ledger is built from these."}]}],"cited_arxiv_ids":[]}},"meta":{"canonical_route":"/resources/glossary/backtest-run","api_route":"/api/v1/resources/glossary/backtest-run","source":{"label":"curated glossary catalog","source_state":"curated_static","source_module":"apps/web/data/glossary/terms.ts","method_version":"public_glossary_curated_terms_v2","freshness":{"status":"versioned","observed_at":null,"fresh_until":null,"reason":"Git-versioned curated catalog; daily ingestion freshness windows do not apply.","reason_code":"git_versioned_curated_catalog"},"source_count":111,"bucket_count":7,"buckets":["scoring","surfaces","agents","distribution","data","foresight","buildability"]}}}