MappingEvolve: LLM-Driven Code Evolution for Technology Mapping Build Now
Accelerate technology mapping in logic synthesis via AI-driven code evolution for substantial area and delay reductions.
GitHub 100 stars Velocity flat History 1 snapshot Code Generation Apr 29 Pending High viability
Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations Build Now
Automate online system operation tasks with Bian Que's flexible agentic framework.
GitHub stars n/a Velocity flat History 1 snapshot AI Operations Management Apr 29 Pending High viability
Progressive Semantic Communication for Efficient Edge-Cloud Vision-Language Models Build Now
A framework for efficient edge-cloud visual data processing using progressive semantic communication for VLMs.
GitHub 100 stars Velocity flat History 1 snapshot Edge AI Apr 29 Pending High viability
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models Build Now
A framework for efficiently distilling high-performance diffusion language models into smaller, faster models.
GitHub stars n/a Velocity flat History 1 snapshot AI Model Compression Apr 29 Pending High viability
OMEGA: Optimizing Machine Learning by Evaluating Generated Algorithms Build Now
OMEGA automates ML algorithm generation and optimization using LLMs to outperform common benchmarks.
GitHub stars n/a Velocity flat History 1 snapshot Machine Learning Optimization Apr 29 Code High viability
DepthPilot: From Controllability to Interpretability in Colonoscopy Video Generation Build Now
DepthPilot is an interpretable framework for generating realistic and clinically aligned colonoscopy videos, enabling better surgical navigation and diagnosis.
GitHub stars n/a Velocity flat History pending Medical Video Generation Apr 29 Code High viability
SecMate: Multi-Agent Adaptive Cybersecurity Troubleshooting with Tri-Context Personalization Build Now
SecMate is a multi-agent virtual customer assistant that enhances cybersecurity troubleshooting through device, user, and service specificity.
GitHub stars n/a Velocity flat History pending Cybersecurity Apr 29 Code High viability
LATTICE: Evaluating Decision Support Utility of Crypto Agents Build Now
LATTICE benchmarks the decision support utility of crypto agents in user-facing scenarios.
GitHub stars n/a Velocity flat History pending Crypto Decision Support Apr 29 Code High viability
DSIPA: Detecting LLM-Generated Texts via Sentiment-Invariant Patterns Divergence Analysis Build Now
DSIPA is a training-free framework for detecting LLM-generated content using sentiment distribution analysis.
GitHub stars n/a Velocity flat History pending Content Detection Apr 29 Code High viability
Translating Under Pressure: Domain-Aware LLMs for Crisis Communication Build Now
A domain-adaptive pipeline for improving multilingual crisis communication through fine-tuning language models.
GitHub stars n/a Velocity flat History pending Crisis Communication Apr 29 Code High viability
Domain-Adapted Small Language Models for Reliable Clinical Triage Build Now
Domain-adapted small language models improve clinical triage accuracy and efficiency.
GitHub stars n/a Velocity flat History pending Medical AI Apr 29 Code High viability
SynSur: An end-to-end generative pipeline for synthetic industrial surface defect generation and detection Build Now
An end-to-end pipeline for synthetic industrial defect generation and detection to overcome data scarcity.
GitHub stars n/a Velocity flat History pending Industrial AI Apr 29 Code High viability
Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning Watch
A transformer-based framework for offline safe reinforcement learning that uses Lyapunov-guided imagination for test-time adaptation without retraining.
GitHub stars n/a Velocity flat History pending Safe Reinforcement Learning Apr 29 Pending
A self-evolving agent for explainable diagnosis of DFT-experiment band-gap mismatch Build Now
XDFT is a self-evolving agent that automates diagnosis of DFT-experiment band-gap mismatches.
GitHub stars n/a Velocity flat History pending Physics AI Apr 29 Code High viability
CheXthought: A global multimodal dataset of clinical chain-of-thought reasoning and visual attention for chest X-ray interpretation Build Now
CheXthought provides a detailed multimodal dataset for enhancing AI-driven chest X-ray interpretation and clinical decision-making.
GitHub stars n/a Velocity flat History 1 snapshot Healthcare Apr 29 Code High viability
Atomic-Probe Governance for Skill Updates in Compositional Robot Policies Build Now
An atomic-quality probe and hybrid selector system for managing skill updates in compositional robot policies, improving reliability and reducing costs.
GitHub stars n/a Velocity flat History pending Robotics Apr 29 Code High viability
Tatemae: Detecting Alignment Faking via Tool Selection in LLMs Build Now
Detecting LLM alignment faking by analyzing tool selection, with a new dataset and evaluation of frontier models.
GitHub stars n/a Velocity flat History pending LLM Security Apr 29 Code High viability
ClawGym: A Scalable Framework for Building Effective Claw Agents Watch
ClawGym is a scalable framework for developing environment-grounded Claw-style agents with a specific focus on task synthesis and evaluation.
GitHub stars n/a Velocity flat History 1 snapshot AI Frameworks Apr 29 Pending
Seeking Consensus: Geometric-Semantic On-the-Fly Recalibration for Open-Vocabulary Remote Sensing Semantic Segmentation Build Now
SeeCo is a plug-and-play framework that recalibrates open-vocabulary models on-the-fly for improved semantic segmentation in remote sensing images.
GitHub stars n/a Velocity flat History pending Remote Sensing AI Apr 29 Code High viability
TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models Build Now
A fine-tuning framework that precisely targets and corrects language confusion in LLMs at the token level, improving multilingual consistency without sacrificing general capabilities.
GitHub stars n/a Velocity flat History pending LLM Fine-tuning Apr 29 Code High viability
Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising Build Now
A unified 4D world model for robots that synthesizes high-fidelity video and 3D reconstructions while enabling real-time action execution.
GitHub stars n/a Velocity flat History pending Robotics Apr 29 Code High viability
HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists Build Now
HalluCiteChecker is a lightweight, offline toolkit for detecting and verifying hallucinated citations in scientific papers, reducing reviewer workload.
GitHub stars n/a Velocity flat History pending AI Safety Apr 29 Code High viability
When to Vote, When to Rewrite: Disagreement-Guided Strategy Routing for Test-Time Scaling Build Now
A training-free framework that dynamically routes large reasoning models to different scaling strategies based on output disagreement, improving accuracy and reducing cost.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 29 Code High viability
FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards Build Now
FutureWorld is a live environment for training predictive agents using real-world outcome rewards.
GitHub stars n/a Velocity flat History pending Agents Apr 29 Code High viability
MedSynapse-V: Bridging Visual Perception and Clinical Intuition via Latent Memory Evolution Build Now
MedSynapse-V bridges visual perception and clinical intuition for enhanced medical diagnosis.
GitHub stars n/a Velocity flat History pending Medical AI Apr 29 Code High viability
StratMem-Bench: Evaluating Strategic Memory Use in Virtual Character Conversation Beyond Factual Recall Build Now
StratMem-Bench evaluates strategic memory use in virtual character conversations for enhanced realism.
GitHub stars n/a Velocity flat History pending Conversational AI Apr 29 Code High viability
Human-in-the-Loop Benchmarking of Heterogeneous LLMs for Automated Competency Assessment in Secondary Level Mathematics Build Now
A Human-in-the-Loop framework for automating secondary-level mathematics assessment using multiple LLMs.
GitHub stars n/a Velocity flat History pending Education AI Apr 29 Code High viability
Tree-of-Text: A Tree-based Prompting Framework for Table-to-Text Generation in the Sports Domain Build Now
A tree-structured prompting framework for LLMs to generate sports game reports from tables, improving comprehension and efficiency.
GitHub stars n/a Velocity flat History pending Table-to-Text Generation Apr 29 Code High viability
Naamah: A Large Scale Synthetic Sanskrit NER Corpus via DBpedia Seeding and LLM Generation Build Now
Naamah is a large-scale, high-quality synthetic Sanskrit NER dataset created using DBpedia seeding and LLM generation, addressing the scarcity of annotated resources for classical Sanskrit literature.
GitHub stars n/a Velocity flat History pending NLP Datasets Apr 29 Code High viability
ATLAS: An Annotation Tool for Long-horizon Robotic Action Segmentation Build Now
An annotation tool for long-horizon robotic action segmentation that synchronizes multi-modal data and streamlines the annotation process.
GitHub stars n/a Velocity flat History pending Robotics Annotation Apr 29 Code High viability
ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance Build Now
A novel optimization framework for diffusion models that enhances perceptual quality without sacrificing generative fidelity.
GitHub stars n/a Velocity flat History pending Diffusion Models Apr 29 Code High viability
STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices Build Now
STLGT is a scalable, trace-based linear graph transformer for accurate and efficient tail-latency prediction in microservice systems, enabling proactive SLO management.
GitHub stars n/a Velocity flat History pending MLOps Apr 29 Code High viability
QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing Build Now
QYOLO is a lightweight object detection framework that uses quantum-inspired channel mixing to achieve significant parameter and GFLOPs reduction with minimal accuracy loss.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 29 Code High viability
When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models Build Now
A retrieval framework that adaptively injects evidence during multi-step reasoning for large language models, improving accuracy and efficiency.
GitHub stars n/a Velocity flat History pending Retrieval Augmented Generation Apr 29 Code High viability
Star-Fusion: A Multi-modal Transformer Architecture for Discrete Celestial Orientation via Spherical Topology Build Now
A multi-modal transformer architecture for discrete celestial orientation, achieving high accuracy and efficiency for autonomous spacecraft navigation.
GitHub stars n/a Velocity flat History 1 snapshot Robotics Apr 29 Code High viability
Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation Build Now
A framework for multi-agent policy simulation that uses architectural heterogeneity and coherence validation to prevent artificial consensus and improve diverse decision-making.
GitHub stars n/a Velocity flat History pending Multi-Agent Systems Apr 29 Code High viability
Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking Build Now
An uncertainty-aware reward framework for reinforcement learning that mitigates reward hacking and improves alignment.
GitHub stars n/a Velocity flat History pending Reinforcement Learning Apr 29 Code High viability
Delineating Knowledge Boundaries for Honest Large Vision-Language Models Watch
A framework to enhance refusal capabilities of Vision-Language Models, improving their trustworthiness in specialized domains.
GitHub stars n/a Velocity flat History pending Vision-Language Models Apr 29 Code
DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent Watch
DreamProver is an agentic framework that evolves reusable lemma libraries for formal theorem proving.
GitHub stars n/a Velocity flat History pending Theorem Proving Apr 29 Code
TimeMM: Time-as-Operator Spectral Filtering for Dynamic Multimodal Recommendation Watch
A dynamic multimodal recommendation framework that adapts to evolving user interests over time.
GitHub stars n/a Velocity flat History pending Dynamic Recommendation Systems Apr 29 Code
SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data Watch
An agentic system for evaluating the AI-readiness of scientific data across governance, quality, compatibility, and adaptability.
GitHub stars n/a Velocity flat History pending AI for Science Apr 29 Code
Breaking the Autoregressive Chain: Hyper-Parallel Decoding for Efficient LLM-Based Attribute Value Extraction Watch
A novel decoding algorithm that accelerates LLM-based attribute value extraction by up to 13.8X through hyper-parallel processing, significantly reducing inference costs.
LLM Inference Optimization Apr 29 High viability
Graph Construction and Matching for Imperative Programs using Neural and Structural Methods Watch
A pipeline for converting imperative programs into typed, attributed graphs using neural and structural methods for verification artefact reuse.
GitHub stars n/a Velocity flat History pending Code Analysis Apr 29 Code
Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging Ignore
Reveals a stable, content-invariant distributional attractor in LLM response positions under sandbagging, offering a black-box signature for this behavior.
GitHub stars n/a Velocity flat History pending LLM Behavior Analysis Apr 29 Pending
From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy Watch
A framework for engineering measurable trust in clinical AI through evidence, supervision, and staged autonomy, moving beyond black-box confidence.
GitHub stars n/a Velocity flat History pending Clinical AI Apr 29 Code
A Toolkit for Detecting Spurious Correlations in Speech Datasets Watch
A toolkit to detect spurious correlations in speech datasets, preventing overestimation of performance in critical applications.
GitHub stars n/a Velocity flat History pending Speech AI Apr 29 Code
Text-Utilization for Encoder-dominated Speech Recognition Models Watch
Efficient methods for utilizing text-only data to improve encoder-dominated speech recognition models, showing larger encoders with smaller decoders perform comparably.
GitHub stars n/a Velocity flat History pending Speech Recognition Apr 29 Code
Grounding vs. Compositionality: On the Non-Complementarity of Reasoning in Neuro-Symbolic Systems Watch
An iterative logic tensor network that empirically demonstrates reasoning is a distinct capability from symbol grounding for compositional generalization in neural networks.
GitHub stars n/a Velocity flat History pending Neuro-Symbolic AI Apr 29 Code
Calibrated Surprise: An Information-Theoretic Account of Creative Quality Watch
A framework for enhancing creative writing quality through calibrated surprise using information theory.
GitHub stars n/a Velocity flat History pending Creative Writing AI Apr 29 Code
Apriori-based Analysis of Learned Helplessness in Mathematics Tutoring: Behavioral Patterns by Level, Intervention, and Outcome Watch
This study analyzes learned helplessness in math tutoring using the Apriori algorithm to identify behavioral patterns.
GitHub stars n/a Velocity flat History pending Educational AI Apr 29 Code
MemOVCD: Training-Free Open-Vocabulary Change Detection via Cross-Temporal Memory Reasoning and Global-Local Adaptive Rectification Ignore
A training-free framework for open-vocabulary change detection that uses cross-temporal memory reasoning and global-local adaptive rectification to identify semantic changes in bi-temporal images.
GitHub stars n/a Velocity flat History pending Change Detection Apr 29 Code
Language Diffusion Models are Associative Memories Capable of Retrieving Unseen Data Ignore
This research frames language diffusion models as associative memories, identifying a sharp transition between memorization and generalization using conditional entropy.
GitHub stars n/a Velocity flat History pending LLM Training Apr 29 Code
Auto-Relational Reasoning Ignore
A theoretical framework for automated relational reasoning integrated with ANNs, achieving high IQ test scores.
GitHub stars n/a Velocity flat History pending Reasoning AI Apr 29 Code
ViCrop-Det: Spatial Attention Entropy Guided Cropping for Training-Free Small-Object Detection Ignore
A training-free framework for small object detection that uses spatial attention entropy to adaptively focus computation on high-saliency, high-uncertainty regions, improving performance with marginal latency overhead.
GitHub stars n/a Velocity flat History pending Small Object Detection Apr 29 Code
Random Cloud: Finding Minimal Neural Architectures Without Training Ignore
A training-free method for discovering minimal neural network architectures by progressively reducing random topologies, outperforming pruning baselines with significant parameter reduction and faster execution.
GitHub stars n/a Velocity flat History pending Neural Architecture Search Apr 29 Code
DUAL-BLADE: Dual-Path NVMe-Direct KV-Cache Offloading for Edge LLM Inference Ignore
A framework for efficient LLM inference on edge devices by intelligently offloading KV caches to NVMe storage, reducing latency and improving SSD utilization.
LLM Inference Optimization Apr 29
Benchmarking the Safety of Large Language Models for Robotic Health Attendant Control Ignore
Benchmarking the safety of LLMs for robotic health attendant control using a dataset of harmful instructions, revealing significant safety concerns.
GitHub stars n/a Velocity flat History pending AI Safety Apr 29 Code
Quantum Gatekeeper: Multi-Factor Context-Bound Image Steganography with VQC Based Key Derivation on Quantum Hardware Ignore
A quantum-based image steganography framework that ensures secure payload recovery through multi-factor context binding.
GitHub stars n/a Velocity flat History pending Quantum Steganography Apr 29 Code
Causal Learning with Neural Assemblies Ignore
DIRECT introduces a novel mechanism for causal learning in neural assemblies, enhancing the understanding of causal influence.
GitHub stars n/a Velocity flat History pending Causal Learning Apr 29 Code
Text Style Transfer with Machine Translation for Graphic Designs Ignore
Improving text style transfer in graphic designs by developing new methods for word alignment in machine translation.
Machine Translation Apr 29
AGEL-Comp: A Neuro-Symbolic Framework for Compositional Generalization in Interactive Agents Ignore
A neuro-symbolic framework for interactive agents that improves compositional generalization by grounding actions with a causal program graph and inductive logic programming.
Agents Apr 29
SG-UniBuc-NLP at SemEval-2026 Task 6: Multi-Head RoBERTa with Chunking for Long-Context Evasion Detection Ignore
A system for political question evasion detection using a multi-head RoBERTa with chunking for long contexts.
NLP Classification Apr 29
MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution Ignore
MetaSR enhances generative super-resolution by dynamically utilizing content-adaptive metadata.
Generative Super-Resolution Apr 29
TDD Governance for Multi-Agent Code Generation via Prompt Engineering Ignore
An AI-native TDD framework that operationalizes classical TDD principles for reliable LLM-assisted development.
Software Development Apr 29
Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents Ignore
A behavioral firewall for structured-workflow AI agents to enhance security against tool-call anomalies.
Security for AI Agents Apr 29
Culturally Aware GenAI Risks for Youth: Perspectives from Youth, Parents, and Teachers in a Non-Western Context Ignore
This research explores culturally specific Generative AI risks for youth in non-Western contexts, providing design implications for inclusive parental controls.
AI Ethics & Safety Apr 29
Rule-based High-Level Coaching for Goal-Conditioned Reinforcement Learning in Search-and-Rescue UAV Missions Under Limited-Simulation Training Ignore
This paper proposes a hierarchical decision-making framework for UAV search-and-rescue missions combining rule-based coaching with online reinforcement learning.
Robotics Apr 29
Multi-Stage Bi-Atrial Segmentation Framework from 3D Late Gadolinium-Enhanced MRI using V-Net Family Models Ignore
A multi-stage framework for bi-atrial segmentation from 3D MRI using V-Net models.
Medical Imaging Apr 29
Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework Ignore
The ST-PT framework explores the mathematical equivalence of Transformers and probabilistic models for time series.
Time Series Modeling Apr 29
Resume-ing Control: (Mis)Perceptions of Agency Around GenAI Use in Recruiting Workflows Ignore
This paper explores how generative AI subtly influences control and agency in recruiting workflows, leading to deskilling despite perceived efficiency gains.
Human-AI Interaction Apr 29
Benchmarking Complex Multimodal Document Processing Pipelines: A Unified Evaluation Framework for Enterprise AI Ignore
A benchmarking framework for evaluating complex multimodal document processing pipelines in enterprise AI.
Document AI Apr 29
Persuadability and LLMs as Legal Decision Tools Ignore
This paper explores how Large Language Models respond to legal arguments, investigating their persuadability and implications for legal decision-making.
Legal AI Apr 29
Fundamental Physics, Existential Risks and Human Futures Ignore
Exploring foundational physics with implications for AI and information processing.
Theoretical Physics Apr 29
Qvine: Vine Structured Quantum Circuits for Loading High Dimensional Distributions Ignore
Qvine introduces a vine-structured ansatz for quantum circuits to efficiently load high-dimensional distributions, addressing challenges in quantum machine learning and finance.
Quantum Machine Learning Apr 29
Recent Advances in mm-Wave and Sub-THz/THz Oscillators for FutureG Technologies Ignore
This paper reviews advancements in mm-wave and sub-THz oscillators for future communication technologies.
Oscillator Technology Apr 29