Skip to main content

Why Science to Startup

Three proof layers, a public methodology, and time-travel backtests. Every claim is signed, cited, and graded -- not asserted.

Signed

Every claim carries a cryptographic receipt

Every score, prediction, and enrichment on Science to Startup is backed by an Ed25519 signed receipt. The receipt locks the claim hash, the model version, and the computation timestamp into a tamper-evident envelope that anyone can verify against the public key.

Receipts are published in real time on the Proof Feed and can be verified against our JWKS endpoint or the founder PEM key.

No other research intelligence platform signs its outputs. This is not a feature -- it is the foundation the rest of the proof stack builds on.

Cited

Every signal links back to auditable sources

Signal Fusion scores are not black-box model outputs. Each component signal -- citation velocity, author lineage, GitHub traction, community buzz -- links directly to its source data: arXiv metadata, OpenAlex citation graphs, GitHub repository snapshots, and social-mention threads.

The Signal Canvas surfaces every citation as an inspectable card with the original text, page number, and extraction receipt. Reviewers can audit any claim without trusting the model.

Source provenance is not optional -- it is structurally enforced. The enrichment pipeline rejects any score that cannot cite its upstream data, and the methodology page publishes the Brier calibration across all graded sub-areas so accuracy is verifiable, not asserted.

Graded

Holdout grading with public Brier scores

Every prediction is evaluated against a holdout ground-truth set using the Brier score -- the same metric used by forecasting tournaments like Metaculus and Good Judgment Open. Lower is better; our public gate is 0.30.

The Methodology page publishes the current Brier score, the 95% Wilson confidence interval, a calibration plot, a reliability diagram, and per-sub-area small multiples -- all SSR, no client JavaScript, refreshed by a daily recompute cron.

Ground truth is Opus-4.7-graded with founder spot-check, not founder-graded. This matters: the grading model is disclosed publicly, and anyone can pull our open-source Brier helpers from apps/web/lib/methodology/brier.ts and re-run the math.

How we compare

Six dimensions that separate commercialization intelligence from academic search. Green checkmarks mean the platform has the capability today.

PlatformReal-time scoringSigned evidenceCalibrated predictionsFull-text analysisFounder-fit signalsAPI-first
ScienceToStartup
Allen Institute for AI
----------
MetaShut down 2025 (archive)
----------
Connected Papers
------------
Elicit
----------
Consensus
------------
--------
Iris.ai
----------

Last reviewed: founder-curated, weekly snapshot. Capabilities verified against public documentation and product surfaces.

Time-travel backtest

50 curated cs.AI papers scored against the snapshot tables that existed on or before each paper's publication week. No current-state lookahead -- every score is reproducible. JSON twin

Top-50 curated cs.AI papers spawned 35 companies — 14x the 5.0% base rate.

Baseline 5.0% (cited: https://www.cbinsights.com/research/startup-failure-rates/ · wayback fallback).

Headline multiplier: 14x · top_n 50 · spawned 35

  • Direct spinout: 17 rows
  • Inspired: 17 rows
  • Methodology: 16 rows
PaperPublishedScore @ pub-weekCurrent scoreLineageCompanyOutcomeEvidence
1611.045002019-06-060.940top-decile1.000Direct spinoutCharacter.AIscaledsource
1909.137192014-06-100.927top-decile0.977MethodologyAnyscalescaledsource
2204.023112017-06-220.925top-decile0.855MethodologySnorkel AIshutdownsource
2005.141652018-06-230.917top-decile0.976InspiredMistral AIscaledsource
1502.031672021-06-080.899top-decile0.950InspiredOwkinacquiredsource
1503.025312018-06-140.896top-decile1.000Direct spinoutMosaicMLacquiredsource
1409.04732014-06-100.889top-decile0.775Direct spinoutRecursionshutdownsource
1502.031672020-06-250.887top-decile0.881InspiredAdept AIactivesource
1909.137192022-06-270.868top-decile1.000InspiredPony.aiscaledsource
1706.037622017-06-040.861top-decile0.769InspiredInsilico Medicineshutdownsource
2204.023112014-06-190.854top-decile0.740InspiredRelationalAIshutdownsource
1409.04732015-06-200.845top-decile1.000InspiredNumentascaledsource
1503.025312021-06-170.824top-decile0.898MethodologyHugging Facescaledsource
1810.048052018-06-050.823top-decile0.897InspiredAtomwisescaledsource
1502.031672018-06-050.810top-decile0.959Direct spinoutAdept AIacquiredsource
1611.045002020-06-160.806top-decile0.912Direct spinoutOctoAIscaledsource
1706.037622015-06-110.804top-decile0.937Direct spinoutInsilico Medicinescaledsource
1503.025312020-06-070.799top-decile0.749InspiredMosaicMLshutdownsource
2005.141652016-06-030.792top-decile0.954Direct spinoutMistral AIacquiredsource
2005.141652017-06-130.775top-decile0.705Direct spinoutAI21 Labsshutdownsource
1611.045002014-06-190.768top-decile0.906MethodologyCharacter.AIacquiredsource
2204.023112022-06-090.755top-decile0.818Direct spinoutSnorkel AIscaledsource
1706.037622018-06-140.750top-decile0.713MethodologyCohereactivesource
1611.045002021-06-260.746top-decile0.910InspiredCharacter.AIscaledsource
1610.057552016-06-210.730top-decile0.899MethodologyDetermined AIscaledsource
1503.025312017-06-040.710top-decile0.811Direct spinoutHugging Facescaledsource
1409.04732018-06-230.707top-decile0.826MethodologyRecursionscaledsource
1610.057552021-06-080.705top-decile0.895Direct spinoutDetermined AIscaledsource
1502.031672019-06-150.703top-decile0.792Direct spinoutOwkinscaledsource
1706.037622019-06-240.701top-decile0.583MethodologyInsilico Medicineshutdownsource
1409.04732016-06-030.6960.774InspiredRecursionscaledsource
1610.057552015-06-110.6900.649MethodologyPetuumactivesource
2204.023112016-06-120.6860.565MethodologyRelationalAIshutdownsource
1810.048052019-06-150.6760.855MethodologyAnthropicscaledsource
1909.137192015-06-200.6750.690MethodologyPony.aiactivesource
1502.031672022-06-180.6700.771MethodologyAdept AIscaledsource
1610.057552014-06-010.6580.722InspiredDetermined AIscaledsource
2005.141652020-06-160.6430.727MethodologyMistral AIscaledsource
1706.037622016-06-210.6180.718InspiredCoherescaledsource
1909.137192020-06-070.6180.715Direct spinoutPony.aiscaledsource
1810.048052017-06-220.6150.814InspiredAnthropicscaledsource
1810.048052016-06-120.6090.560Direct spinoutAtomwiseshutdownsource
1909.137192021-06-170.6040.757Direct spinoutAnyscaleacquiredsource
1503.025312019-06-240.5990.523InspiredHugging Faceshutdownsource
2005.141652019-06-060.5980.737InspiredAI21 Labsscaledsource
1610.057552022-06-180.5950.662Direct spinoutPetuumacquiredsource
1611.045002022-06-090.5870.684MethodologyOctoAIacquiredsource
1409.04732017-06-130.5730.574MethodologyNumentaactivesource
1810.048052015-06-020.5620.752Direct spinoutAnthropicacquiredsource
2204.023112015-06-020.5590.652InspiredSnorkel AIscaledsource

Time-travel reproduced: every score is computed against the snapshot tables that existed on or before the paper's publication week — no current-state lookahead.

Dig deeper