data
The full ingestion-enrichment-publish pipeline. Daily Modal jobs at 20:05, 20:30, 21:00 UTC.
Corpus Engine is the offline brain. Daily at 20:05 UTC it ingests new arXiv papers; at 20:30 it enriches them with extraction, knowledge graph, build passport, scoring, and tier assignment; at 21:00 it publishes articles, outreach, media, SEO, and growth events. Everything else in the product reads from what the engine produces.
Source: curated glossary catalog. Freshness: git_versioned_curated_catalog.
Term API · apps/web/data/glossary/terms.ts