Evidence Receipt. Related Resources.
Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety
Use This Via API or MCP
Use this Signal Canvas via API or MCP
Route this paper proof surface into REST, MCP, or developer workflows while preserving the same evidence receipt and related-resource context.
Page Freshness
Signal Canvas proof surface
Canonical route: /signal-canvas/bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safety
- Proof freshness
- stale
- Proof status
- unverified
- Display score
- 8/10
- Last proof check
- 2026-04-02
- Score updated
- 2026-04-02
- Score fresh until
- 2026-05-02
- References
- 0
- Source count
- 0
- Coverage
- 17%
This page is showing the last landed evidence receipt and score bundle because the latest proof data is outside the freshness window.
Agent Handoff
Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety
Canonical ID bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safety | Route /signal-canvas/bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safety
REST example
curl https://sciencetostartup.com/api/v1/agent-handoff/signal-canvas/bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safetyMCP example
{
"tool": "search_signal_canvas",
"arguments": {
"mode": "paper",
"paper_ref": "bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safety",
"query_text": "Summarize Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety"
}
}source_context
{
"surface": "signal_canvas",
"mode": "paper",
"query": "Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety",
"normalized_query": "2603.09154",
"route": "/signal-canvas/bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safety",
"paper_ref": "bioalignment-measuring-and-improving-llm-disposition-toward-biological-systems-for-ai-safety",
"topic_slug": null,
"benchmark_ref": null,
"dataset_ref": null
}Preparing verified analysis
Dimensions overall score 8.0
GitHub Code Pulse
No public code linked for this paper yet.
Claim map
- Evidencepartial
According to this metric, most models were not bioaligned in that they exhibit biases in favor of synthetic (non-biological) solutions.
ImplicationpartialThe abstract explicitly states this finding based on their evaluation framework and prompts.
Verificationpartialpartial
- Evidencepartial
A sample of 5 frontier and 5 open-weight models were measured using 50 curated Bioalignment prompts with a Kelly criterion-inspired evaluation framework.
ImplicationpartialThe abstract clearly describes the methodology used for evaluation.
Verificationpartialpartial
- Evidencepartial
We found that QLoRA fine-tuning significantly increased the scoring of biological solutions for both models without degrading general capabilities (Holm-Bonferroni-corrected p < 0.001 and p < 0.01, respectively).
ImplicationpartialThe abstract provides specific details about the fine-tuning process and its positive impact on both models, including statistical significance.
Verificationpartialpartial
- Evidencepartial
This suggests that even a small amount of fine-tuning can change how models weigh the relative value of biological and bioinspired vs. synthetic approaches.
ImplicationpartialThis is a direct conclusion drawn from the fine-tuning results presented in the abstract.
Verificationpartialpartial
- Evidencepartial
Although this work focused on small open-weight LLMs, it may be extensible to much larger models and could be used to develop models that favor bio-based approaches.
ImplicationpartialThe abstract explicitly states the scope of the study regarding model size and type.
Verificationpartialpartial
- Evidencepartial
We release the benchmark, corpus, code, and adapter weights.
ImplicationpartialThe abstract explicitly states the release of these resources.
Verificationpartialpartial
- Evidencepartial
A curated corpus of ~22M tokens from 6,636 PMC articles emphasizing biological problem-solving was used first to fine-tune Llama 3B with a mixed corpus of continued training and instruction-formatted.
ImplicationpartialThe abstract details the specific fine-tuning approach for Llama 3B.
Verificationpartialpartial