ScienceToStartup
Product
Proof
DevelopersTrends
Resources
Company

113 Cherry St #92768

Seattle, WA 98104-2205

Backed by Research Labs

Product, Proof, and developer surfaces share one public navigation contract.

Product

  • Daily Dashboard
  • Signal Canvas
  • Build Loop
  • Evidence
  • Workspace
  • Terminal
  • Talent Layer
  • GitHub Velocity

Proof

  • Foresight
  • Proof Layer
  • Proof Homepage
  • Freshness Hub
  • Example Paper Page
  • Topic Proof Layer
  • Benchmark Scorecard
  • Public Dataset

Developers

  • Overview
  • Start Here
  • REST API
  • MCP Server
  • SDKs
  • Examples
  • Keys
  • Docs

Trends

  • Live Desk
  • Archive
  • Entities
  • Narratives
  • Topics
  • Methodology

Resources

  • All Resources
  • Benchmark
  • Dataset
  • Database
  • Glossary
  • Directory
  • Templates
  • Topics

Company

  • Company Hub
  • About
  • Articles
  • Changelog
  • Careers
  • Enterprise
  • Scout
  • RFPs
  • FAQ
  • Legal
  • Privacy
  • Contact
ScienceToStartup

Copyright © 2026 ScienceToStartup. All rights reserved.

Privacy|Legal

Build Loop

Opened from Signal Canvas
Paper: 2604.03136

Papers

155

With code

116

Suggested Build

79

Suggested Watch

26

🔔

Preview from your Build/Watch decisions. Set up Scout for daily delivery.

IDOBE: Infectious Disease Outbreak forecasting Benchmark Ecosystem

Morning brief

High conviction build candidate

DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization

Morning brief

High conviction build candidate

Tight Auditing of Differential Privacy in MST and AIM

48h review

Needs sharper wedge before committing

Saved thesis

Find deployable ai papers with public code, proof pass, and a wedge that can ship inside 6 weeks.

🔔Run morning brief

Novelty / saturation by cluster

Uses the current paper cohort to show whether a lane looks crowded or sparse, with named comparable papers from the same slice.

  • LLM Evaluation

    Screen Before You Interpret: A Portable Validity Protocol for Benchmark-Based LLM Confidence Signals · TPS-CalcBench: A Benchmark and Diagnostic Evaluation Framework for LLM Analytical Calculation Competence in Hypersonic Thermal Protection System Engineering

    7

    Crowded

  • Agents

    Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence · WebUncertainty: Dual-Level Uncertainty Driven Planning and Reasoning For Autonomous Web Agent

    7

    Crowded

  • AI Agents

    AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation · Co-evolving Agent Architectures and Interpretable Reasoning for Automated Optimization

    5

    Crowded

  • LLM Optimization

    Evolutionary Negative Module Pruning for Better LoRA Merging · AQPIM: Breaking the PIM Capacity Wall for LLMs with In-Memory Activation Quantization

    4

    Balanced

  • LLM Safety

    Reverse Constitutional AI: A Framework for Controllable Toxic Data Generation via Probability-Clamped RLAIF · MHSafeEval: Role-Aware Interaction-Level Evaluation of Mental Health Safety in Large Language Models

    4

    Balanced

  • LLM Agents

    Negative Advantage Is a Double-Edged Sword: Calibrating Advantage in GRPO for Deep Search · SELF-EMO: Emotional Self-Evolution from Recognition to Consistent Expression

    4

    Balanced

  • Reinforcement Learning

    OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning · Bounded Ratio Reinforcement Learning

    4

    Balanced

  • Medical AI

    ProtoCLIP: Prototype-Aligned Latent Refinement for Robust Zero-Shot Chest X-Ray Classification · AI Approach for MRI-only Full-Spine Vertebral Segmentation and 3D Reconstruction in Paediatric Scoliosis

    3

    Balanced

  • Multi-Agent Systems

    CADMAS-CTX: Contextual Capability Calibration for Multi-Agent Delegation · Diversity Collapse in Multi-Agent LLM Systems: Structural Coupling and Collective Failure in Open-Ended Idea Generation

    3

    Balanced

  • LLM Reasoning

    A Control Architecture for Training-Free Memory Use · SPREG: Structured Plan Repair with Entropy-Guided Test-Time Intervention for Large Language Model Reasoning

    3

    Balanced

  • LLM Quantization

    DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization · Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition

    2

    Rarer lane

  • LLM Inference Optimization

    Stability Implies Redundancy: Delta Attention Selective Halting for Efficient Long-Context Prefilling · How Much Cache Does Reasoning Need? Depth-Cache Tradeoffs in KV-Compressed Transformers

    2

    Rarer lane

IDOBE: Infectious Disease Outbreak forecasting Benchmark Ecosystem

Epidemiological Forecasting2026-04-20Build NowPendingfreshGitHub stars n/aVelocity flatHistory 1 snapshot
Commercial69
Deployability—
Reproducibility40
Novelty100
View full paper →

No dossier data.