Skip to main content
+S
ScienceToStartup
Product
Proof
Developers
Trends
Resources
Company
Build Loop
Build Loop | ScienceToStartup
Build Loop · Decide which papers become startups. · Today's queue.
BUILD-LOOP
·
1
BROWSE
2
TRIAGE
·
SORT:
SIGNAL
DATE
CITATIONS
MARKET
FRESHNESS
·
CLUSTER:
ALL
Uncategorized
AI Infrastructure
AI Model Optimization
AI Model Training Methods
AI Research Critique
·
FILTERS
1 buildable
of 168 papers
·
11 await proof
·
116 with code
Search
/
Filters
Filters
Proof
Any proof
Verified
Partial
Pending
Failed
Min signal ·
0
Freshness
Any freshness
Fresh
Aging
Stale
Cluster
Any
Uncategorized
163
AI Infrastructure
1
AI Model Optimization
1
AI Model Training Methods
1
AI Research Critique
1
More clusters…
Choose cluster · 6
Esc
to close
Close
A
AI Infrastructure
1
AI Model Optimization
1
AI Model Training Methods
1
AI Research Critique
1
AI Tools for Continuous Learning
1
U
Uncategorized
163
Sort
Commercial
Novelty
Fresh
Momentum
168 papers
#
PAPER
CLUSTER
DATE
SCORE
1
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents
AI Tools for Continuous Learning
2d
64.7
2
A History-Aware Visually Grounded Critic for Computer Use Agents
Uncategorized
2d
36.73
3
Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations
Uncategorized
2d
36.02
4
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
Uncategorized
2d
35.89
5
A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
AI Model Training Methods
2d
35.77
6
Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation
Uncategorized
2d
35.27
7
FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model
Uncategorized
2d
35.27
8
From Context-Aware to Conflict-Aware: Generalizing Contrastive Decoding for Knowledge Conflict in LLMs
Uncategorized
2d
34.72
9
One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA
Uncategorized
2d
33.84
10
Beyond Static Evaluation: Co-Evolutionary Mechanisms for LLM-Driven Strategy Evolution in Adversarial Games
Uncategorized
2d
33.77
11
Spatial-Omni: Spatial Audio Understanding Integration in Multimodal LLMs via FOA Encoding
Uncategorized
2d
33.33
12
++nnU-Net: Scaling nnU-Net with Prefix-Based Data Augmentation
Uncategorized
2d
33.22
13
The Role of Feedback Alignment in Self-Distillation
AI Model Optimization
2d
32.57
14
LIBERO-Occ: Evaluating and Improving Vision-Language-Action Models under Scene-Induced Occlusion via Viewpoint Imagination
Uncategorized
2d
32.42
15
Piper: A Programmable Distributed Training System
AI Infrastructure
2d
31.77
16
Flaws in the LLM Automation Narrative
AI Research Critique
2d
31.67
17
Self-Distillation Policy Optimization via Visual Feedback: Bridging Code and Visual Artifacts
Uncategorized
2d
24.12
18
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields
Uncategorized
2d
23.81
19
Mind the Gap: Can Frontier LLMs Pass a Standardized Office Proficiency Exam?
Uncategorized
2d
23.26
20
Machine Learning Methods for Studying Latent Neural Activity Dynamics
Uncategorized
2d
23.15
21
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics
Uncategorized
2d
23.15
22
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution
Uncategorized
2d
23.15
23
T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains
Uncategorized
2d
23.15
24
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning
Uncategorized
2d
23.15
25
CIAware-Bench: Benchmarking Control Intervention Awareness Across Frontier LLMs
Uncategorized
2d
23.15
26
Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning
Uncategorized
2d
23.12
27
Test-time Adversarial Takeover: A Real-time Hijacking Interface against Robotic Diffusion Policies
Uncategorized
2d
23.11
28
Do VLMs Reason Like Engineers? A Benchmark and a Stage-wise Evaluation
Uncategorized
2d
22.81
29
Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction
Uncategorized
2d
22.71
30
Reasoning or Memorization? Direction-Aware Diversity Exploration in LLM Reinforcement Learning
Uncategorized
2d
22.71
Show more (138 remaining)
Select a paper from the list to view details.