ScienceToStartup
TrendsTopicsSavedArticlesChangelogCareersAbout

113 Cherry St #92768

Seattle, WA 98104-2205

Backed by Research Labs
All systems operational

Product

  • Dashboard
  • Workspace
  • Build Loop
  • Research Map
  • Trends
  • Topics
  • Articles

Enterprise

  • TTO Dashboard
  • Scout Reports
  • RFP Marketplace
  • API

Resources

  • All Resources
  • Benchmark
  • Database
  • Dataset
  • Calculator
  • Glossary
  • State Reports
  • Industry Index
  • Directory
  • Templates
  • Alternatives
  • Changelog
  • FAQ
  • Docs

Company

  • About
  • Careers
  • For Media
  • Privacy Policy
  • Legal
  • Contact

Community

  • Open Source
  • Community
ScienceToStartup

Copyright © 2026 ScienceToStartup. All rights reserved.

Privacy Policy|Legal
  1. Home
  2. Resources
  3. Dataset

Public Dataset

Schema, update cadence, and download links. Exports are generated daily by our pipeline.

Dataset metadata (API)

Freshness + Provenance

Last updated
2026-04-02
Source count
10,292
Coverage window
Daily rolling ingestion
Method version
dataset_export_v1

Sources: papers, paper_technologies, predictions, metric_snapshots

Schema

  • arxiv_id - string
  • title - string
  • abstract - string
  • published_date - ISO date
  • viability_score - number (1-10)
  • cluster_label - string (research field)
  • has_code - boolean (repo_url present)
  • commercial_flags - string (commercialization signals)
  • one_liner - string (short summary)
  • time_to_mvp - string (estimated time to MVP)
  • tags - string (semicolon-separated tags)

Data is served on-demand from the API (no static file commit). License: CC BY 4.0 (attribution required).