Skip to main content
+S
ScienceToStartup
Product
Proof
Developers
Trends
Resources
Company
Build Loop
Build Loop · Decide which papers become startups. · Today's queue.
BUILD-LOOP
·
1
BROWSE
2
TRIAGE
·
SORT:
SIGNAL
DATE
CITATIONS
MARKET
FRESHNESS
·
CLUSTER:
ALL
Uncategorized
AI Adaptation Systems
AI Ethics and Evaluation
AI/ML Techniques
Distributed Systems
·
FILTERS
1 buildable
of 168 papers
·
11 await proof
·
116 with code
Search
/
Filters
Filters
Proof
Any proof
Verified
Partial
Pending
Failed
Min signal ·
0
Freshness
Any freshness
Fresh
Aging
Stale
Cluster
Any
Uncategorized
163
AI Adaptation Systems
1
AI Ethics and Evaluation
1
AI/ML Techniques
1
Distributed Systems
1
More clusters…
Choose cluster · 6
Esc
to close
Close
A
AI Adaptation Systems
1
AI Ethics and Evaluation
1
AI/ML Techniques
1
D
Distributed Systems
1
R
Research Methodologies
1
U
Uncategorized
163
Sort
Commercial
Novelty
Fresh
Momentum
168 papers
#
PAPER
CLUSTER
DATE
SCORE
1
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents
AI Adaptation Systems
1d
59
2
Piper: A Programmable Distributed Training System
Distributed Systems
1d
40.77
3
A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
Research Methodologies
1d
35.77
4
Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations
Uncategorized
1d
33.18
5
Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation
Uncategorized
1d
32.43
6
Beyond Static Evaluation: Co-Evolutionary Mechanisms for LLM-Driven Strategy Evolution in Adversarial Games
Uncategorized
1d
32.43
7
FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model
Uncategorized
1d
32.43
8
A History-Aware Visually Grounded Critic for Computer Use Agents
Uncategorized
1d
32.4
9
Spatial-Omni: Spatial Audio Understanding Integration in Multimodal LLMs via FOA Encoding
Uncategorized
1d
31.99
10
From Context-Aware to Conflict-Aware: Generalizing Contrastive Decoding for Knowledge Conflict in LLMs
Uncategorized
1d
31.88
11
++nnU-Net: Scaling nnU-Net with Prefix-Based Data Augmentation
Uncategorized
1d
31.88
12
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
Uncategorized
1d
31.55
13
LIBERO-Occ: Evaluating and Improving Vision-Language-Action Models under Scene-Induced Occlusion via Viewpoint Imagination
Uncategorized
1d
31.09
14
One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA
Uncategorized
1d
31
15
The Role of Feedback Alignment in Self-Distillation
AI/ML Techniques
1d
28.07
16
Flaws in the LLM Automation Narrative
AI Ethics and Evaluation
1d
27.17
17
Self-Distillation Policy Optimization via Visual Feedback: Bridging Code and Visual Artifacts
Uncategorized
1d
24.12
18
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields
Uncategorized
1d
23.81
19
Mind the Gap: Can Frontier LLMs Pass a Standardized Office Proficiency Exam?
Uncategorized
1d
23.26
20
Machine Learning Methods for Studying Latent Neural Activity Dynamics
Uncategorized
1d
23.15
21
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics
Uncategorized
1d
23.15
22
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution
Uncategorized
1d
23.15
23
T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains
Uncategorized
1d
23.15
24
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning
Uncategorized
1d
23.15
25
CIAware-Bench: Benchmarking Control Intervention Awareness Across Frontier LLMs
Uncategorized
1d
23.15
26
Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning
Uncategorized
1d
23.12
27
Test-time Adversarial Takeover: A Real-time Hijacking Interface against Robotic Diffusion Policies
Uncategorized
1d
23.11
28
Do VLMs Reason Like Engineers? A Benchmark and a Stage-wise Evaluation
Uncategorized
1d
22.81
29
Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction
Uncategorized
1d
22.71
30
Reasoning or Memorization? Direction-Aware Diversity Exploration in LLM Reinforcement Learning
Uncategorized
1d
22.71
Show more (138 remaining)
Select a paper from the list to view details.
Build Loop | ScienceToStartup