LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards | Signal Canvas | ScienceToStartup

Page Freshness

Signal Canvas proof surface

Canonical route: /signal-canvas/longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards

stale

Proof freshness: stale
Proof status: unverified
Display score: 8/10
Last proof check: 2026-06-01
Score updated: 2026-06-01
Score fresh until: 2026-07-01
References: 0
Source count: 4
Coverage: 67%

This page is showing the last landed evidence receipt and score bundle because the latest proof data is outside the freshness window.

Agent Handoff

Canonical ID longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards | Route /signal-canvas/longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards

REST example

curl https://sciencetostartup.com/api/v1/agent-handoff/signal-canvas/longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards

MCP example

{
  "tool": "search_signal_canvas",
  "arguments": {
    "mode": "paper",
    "paper_ref": "longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards",
    "query_text": "Summarize LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards"
  }
}

source_context

{
  "surface": "signal_canvas",
  "mode": "paper",
  "query": "LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards",
  "normalized_query": "2605.31584",
  "route": "/signal-canvas/longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards",
  "paper_ref": "longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards",
  "topic_slug": null,
  "benchmark_ref": null,
  "dataset_ref": null
}

Evidence Receipt

Route status: building

Claims: 1

References: Pending verification

Proof: Verification pending

Freshness state: computing

Source paper: LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

PDF: https://arxiv.org/pdf/2605.31584v1

Repository: https://github.com/THU-KEG/LongTraceRL

Source count: 4

Coverage: 67%

Last proof check: 2026-06-01T20:20:18.426Z

Signal Canvas receipt window

Ready for execution: LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

/buildability/longtracerl-learning-long-context-reasoning-from-search-agent-trajectories-with-rubric-rewards

Build Nowready

Subject: LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Verdict

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Use Signal Canvas as the narrative proof surface

Use this Signal Canvas via API or MCP

Signal Canvas proof surface