ChatSVA: Bridging SVA Generation for Hardware Verification via Task-Specific LLMs Build Now
ChatSVA automates SystemVerilog Assertions for hardware verification, improving speed and accuracy with task-specific LLMs.
GitHub stars n/a Velocity flat History 1 snapshot Hardware Verification AI Apr 3 Code High viability
OMNI-PoseX: A Fast Vision Model for 6D Object Pose Estimation in Embodied Tasks Build Now
OMNI-PoseX provides real-time, accurate 6D object pose estimation for embodied robotic tasks, outperforming current solutions.
GitHub stars n/a Velocity flat History 1 snapshot 6D Object Pose Estimation Apr 3 Code High viability
SentiAvatar: Towards Expressive and Interactive Digital Humans Build Now
Create expressive 3D digital avatars for real-time interactive applications using SentiAvatar's framework.
GitHub stars n/a Velocity flat History 1 snapshot AI-based Avatars and Digital Humans Apr 3 Code High viability
Flash-Mono: Feed-Forward Accelerated Gaussian Splatting Monocular SLAM Build Now
Flash-Mono accelerates monocular SLAM with Gaussian splatting for real-time 3D scene reconstruction.
GitHub stars n/a Velocity flat History 1 snapshot Computer Vision - SLAM & 3D Mapping Apr 3 Code High viability
ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving Build Now
ExploreVLA integrates dense world modeling with RL for robust autonomous driving exploration.
GitHub stars n/a Velocity flat History 1 snapshot Autonomous Driving Models Apr 3 Code High viability
LogicPoison: Logical Attacks on Graph Retrieval-Augmented Generation Build Now
An AI security tool to defend against logic-based attacks on Graph-based Retrieval Augmentation systems.
GitHub stars n/a Velocity flat History 1 snapshot AI Security Apr 3 Pending High viability
DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning Build Now
DocShield is an AI-powered forensic tool for identifying and explaining text-centric document forgeries.
GitHub stars n/a Velocity flat History 1 snapshot Document Security Apr 3 Code High viability
SCC-Loc: A Unified Semantic Cascade Consensus Framework for UAV Thermal Geo-Localization Build Now
Revolutionizing UAV navigation with a zero-shot thermal geo-localization system, SCC-Loc.
GitHub stars n/a Velocity flat History 1 snapshot Thermal Geo-Localization Apr 3 Pending High viability
FSUNav: A Cerebrum-Cerebellum Architecture for Fast, Safe, and Universal Zero-Shot Goal-Oriented Navigation Build Now
FSUNav provides a unified zero-shot goal-oriented navigation system for diverse robotic platforms ensuring fast, safe, and semantic-rich interaction with complex environments.
GitHub stars n/a Velocity flat History 1 snapshot Robot Navigation Apr 3 Code High viability
GenSmoke-GS: A Multi-Stage Method for Novel View Synthesis from Smoke-Degraded Images Using a Generative Model Build Now
GenSmoke-GS offers a multi-stage solution for improving visibility and coherence in smoke-degraded image processing for enhanced 3D reconstructions.
GitHub stars n/a Velocity flat History 1 snapshot Computer Vision Apr 3 Pending High viability
Token-Efficient Multimodal Reasoning via Image Prompt Packaging Build Now
Reduce multimodal AI inference costs by embedding structured text directly into images, achieving significant savings with competitive accuracy.
GitHub stars n/a Velocity flat History pending Multimodal AI Apr 2 Code High viability
Generating Satellite Imagery Data for Wildfire Detection through Mask-Conditioned Generative AI Build Now
Generate realistic satellite imagery for wildfire detection using mask-conditioned diffusion models, addressing data scarcity with a novel inpainting approach.
GitHub stars n/a Velocity flat History pending Generative Data Augmentation Apr 2 Code High viability
Hierarchical, Interpretable, Label-Free Concept Bottleneck Model Build Now
A hierarchical, label-free concept bottleneck model that enhances interpretability and classification accuracy by mirroring human cognitive processes.
GitHub stars n/a Velocity flat History pending Interpretable AI Apr 2 Code High viability
Opal: Private Memory for Personal AI Watch
Opal provides private, scalable long-term memory for personal AI by decoupling data-dependent reasoning into a trusted enclave, improving retrieval accuracy and reducing costs.
Private AI Memory Apr 2 High viability
An Explainable Vision-Language Model Framework with Adaptive PID-Tversky Loss for Lumbar Spinal Stenosis Diagnosis Build Now
An explainable vision-language model framework with adaptive loss for accurate and interpretable lumbar spinal stenosis diagnosis from MRI.
GitHub stars n/a Velocity flat History pending Medical AI Apr 2 Code High viability
Street-Legal Physical-World Adversarial Rim for License Plates Build Now
Develop street-legal, low-cost physical adversarial attacks to disrupt license plate reader systems, with potential applications in security and privacy.
GitHub stars n/a Velocity flat History pending Adversarial Attacks Apr 2 Code High viability
VERTIGO: Visual Preference Optimization for Cinematic Camera Trajectory Generation Watch
VERTIGO optimizes cinematic camera trajectory generation by incorporating visual preference feedback, significantly improving framing and reducing off-screen characters.
Generative Video Apr 2 High viability
Mitigating Data Scarcity in Spaceflight Applications for Offline Reinforcement Learning Using Physics-Informed Deep Generative Models Build Now
Generate realistic synthetic data for spaceflight reinforcement learning using physics-informed generative models to overcome data scarcity and improve controller performance.
GitHub stars n/a Velocity flat History pending Offline Reinforcement Learning Apr 2 Code High viability
Re-analysis of the Human Transcription Factor Atlas Recovers TF-Specific Signatures from Pooled Single-Cell Screens with Missing Controls Build Now
A reproducible pipeline to recover valuable transcription factor insights from incomplete single-cell perturbation data, enabling deeper biological discovery.
GitHub stars n/a Velocity flat History pending Genomics & Transcriptomics Apr 2 Code High viability
Causal-Audit: A Framework for Risk Assessment of Assumption Violations in Time-Series Causal Discovery Build Now
A framework for assessing and mitigating risks of assumption violations in time-series causal discovery, providing calibrated risk scores and abstention policies for reliable inference.
GitHub stars n/a Velocity flat History pending Causal Discovery Apr 2 Code High viability
Overconfidence and Calibration in Medical VQA: Empirical Findings and Hallucination-Aware Mitigation Build Now
This research develops a hallucination-aware calibration method to improve the reliability and trustworthiness of vision-language models in medical question answering.
GitHub stars n/a Velocity flat History pending Medical AI Apr 2 Code High viability
SWAY: A Counterfactual Computational Linguistic Approach to Measuring and Mitigating Sycophancy Build Now
A novel computational linguistic metric and mitigation strategy to eliminate sycophancy in large language models.
GitHub stars n/a Velocity flat History pending LLM Alignment Apr 2 Code High viability
AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems Build Now
Automate the unsustainable manual workload of autonomous system verification and validation using LLM agents to analyze and validate system anomalies against natural language requirements.
GitHub stars n/a Velocity flat History pending Autonomous Systems Verification Apr 2 Code High viability
Contrastive Language-Colored Pointmap Pretraining for Unified 3D Scene Understanding Build Now
A unified 3D scene understanding model that leverages contrastive language-image pretraining for improved representation learning.
GitHub stars n/a Velocity flat History pending 3D Scene Understanding Apr 2 Code High viability
PolyJarvis: LLM Agent for Autonomous Polymer MD Simulations Build Now
An LLM-powered agent that autonomously performs complex polymer molecular dynamics simulations from natural language, enabling faster and more accessible material property prediction.
GitHub stars n/a Velocity flat History pending AI for Materials Science Apr 2 Code High viability
Matrix Profile for Time-Series Anomaly Detection: A Reproducible Open-Source Benchmark on TSB-AD Build Now
An open-source benchmark and implementation of Matrix Profile methods for scalable and interpretable time-series anomaly detection.
GitHub stars n/a Velocity flat History pending Time-Series Anomaly Detection Apr 2 Pending High viability
Rapidly deploying on-device eye tracking by distilling visual foundation models Build Now
A framework for rapidly deploying high-accuracy, on-device eye tracking by distilling visual foundation models using synthetic and real-world data.
GitHub stars n/a Velocity flat History pending On-Device Eye Tracking Apr 2 Code High viability
Do We Need Frontier Models to Verify Mathematical Proofs? Build Now
Develops a prompt engineering technique to enable smaller, open-source LLMs to reliably verify mathematical proofs, matching the performance of frontier models.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 2 Code High viability
Reinforcement Learning from Human Feedback: A Statistical Perspective Build Now
A statistical framework for aligning large language models with human preferences, offering a robust approach to reward modeling and policy optimization.
GitHub stars n/a Velocity flat History pending LLM Alignment Apr 2 Pending High viability
Delaunay Canopy: Building Wireframe Reconstruction from Airborne LiDAR Point Clouds via Delaunay Graph Build Now
A novel method for accurate building wireframe reconstruction from LiDAR data using Delaunay graphs, overcoming limitations in noisy and sparse environments.
GitHub stars n/a Velocity flat History pending 3D Reconstruction Apr 2 Code High viability
Competency Questions as Executable Plans: a Controlled RAG Architecture for Cultural Heritage Storytelling Build Now
A neuro-symbolic RAG architecture using competency questions as executable plans to generate factually accurate and auditable stories from cultural heritage knowledge graphs.
GitHub stars n/a Velocity flat History pending Controlled RAG for Cultural Heritage Apr 2 Code High viability
PlayGen-MoG: Framework for Diverse Multi-Agent Play Generation via Mixture-of-Gaussians Trajectory Prediction Build Now
A framework for generating diverse and coordinated multi-agent plays from static formations, overcoming limitations of existing generative models.
GitHub stars n/a Velocity flat History pending Multi-Agent Systems Apr 2 Code High viability
Guideline2Graph: Profile-Aware Multimodal Parsing for Executable Clinical Decision Graphs Build Now
Automates the conversion of complex clinical guidelines into executable decision support systems, significantly improving accuracy and auditability.
GitHub stars n/a Velocity flat History pending Medical AI Apr 2 Code High viability
Adaptive Learned State Estimation based on KalmanNet Build Now
An adaptive multi-modal Kalman filter for improved state estimation in autonomous driving using sensor-specific learned noise characteristics.
GitHub stars n/a Velocity flat History pending Sensor Fusion Apr 2 Code High viability
Compositional Neuro-Symbolic Reasoning Build Now
A neuro-symbolic framework that enhances LLMs with structured reasoning capabilities for complex problem-solving, achieving significant performance gains on the ARC-AGI-2 benchmark.
GitHub stars n/a Velocity flat History pending Neuro-Symbolic Reasoning Apr 2 Pending High viability
Smart Transfer: Leveraging Vision Foundation Model for Rapid Building Damage Mapping with Post-Earthquake VHR Imagery Build Now
A GeoAI framework leveraging vision foundation models for rapid building damage mapping from post-earthquake imagery, with publicly available code and data.
GitHub stars n/a Velocity flat History pending Geospatial AI Apr 3 Pending High viability
Making Written Theorems Explorable by Grounding Them in Formal Representations Build Now
An LLM-powered system that translates mathematical theorems and proofs into executable code, enabling interactive exploration and deeper understanding for users.
GitHub stars n/a Velocity flat History pending AI for Education/Research Apr 3 Code High viability
Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments Build Now
This research provides a novel, comprehensive framework for auditing LLM bias that reveals systematic mischaracterizations by current alignment practices, offering a path to more robust safety and fairness evaluations.
GitHub stars n/a Velocity flat History pending LLM Alignment & Bias Apr 3 Code High viability
Efficient3D: A Unified Framework for Adaptive and Debiased Token Reduction in 3D MLLMs Build Now
A framework to significantly reduce inference costs for 3D Multimodal Large Language Models by adaptively pruning visual tokens, maintaining accuracy and enabling deployment on resource-constrained devices.
GitHub stars n/a Velocity flat History pending 3D MLLM Optimization Apr 3 Pending High viability
AXELRAM: Quantize Once, Never Dequantize Build Now
A novel SRAM architecture for LLM inference that drastically reduces computation by performing attention scores directly on quantized KV cache indices, with a gradient-free method to ensure stability.
GitHub stars n/a Velocity flat History pending LLM Inference Optimization Apr 3 Pending High viability
XrayClaw: Cooperative-Competitive Multi-Agent Alignment for Trustworthy Chest X-ray Diagnosis Build Now
A multi-agent AI system that improves the trustworthiness and accuracy of chest X-ray diagnoses by simulating a cooperative-competitive clinical workflow.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code High viability
Steerable but Not Decodable: Function Vectors Operate Beyond the Logit Lens Build Now
This research demonstrates a novel method to steer large language model behavior using function vectors, achieving high accuracy even when the model's internal representations are not decodable, suggesting a new paradigm for controlling LLM outputs.
GitHub stars n/a Velocity flat History pending LLM Steering Apr 3 Code High viability
SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models Build Now
A framework for systematically evaluating and mitigating socioeconomic bias in foundation models, addressing a critical gap in responsible AI.
GitHub stars n/a Velocity flat History pending LLM Bias Evaluation Apr 3 Code High viability
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Watch
A multi-agent reinforcement learning system that consistently beats human grandmasters in competitive programming.
Agents Apr 3 High viability
Mitigating LLM biases toward spurious social contexts using direct preference optimization Build Now
A novel training method significantly reduces LLM bias towards spurious social contexts in high-stakes decision-making, improving both accuracy and robustness.
GitHub stars n/a Velocity flat History pending LLM Bias Mitigation Apr 2 Code High viability
Let's Have a Conversation: Designing and Evaluating LLM Agents for Interactive Optimization Build Now
Develops a novel methodology and tailored LLM agents to significantly improve optimization solution quality through interactive conversations, bridging AI and operations research.
GitHub stars n/a Velocity flat History pending LLM Agents Apr 3 Code High viability
Fast NF4 Dequantization Kernels for Large Language Model Inference Build Now
Accelerate LLM inference by up to 2.2x with a plug-and-play dequantization kernel that leverages shared memory, reducing costs on existing GPU infrastructure.
GitHub stars n/a Velocity flat History pending LLM Inference Optimization Apr 2 Code High viability
ContractShield: Bridging Semantic-Structural Gaps via Hierarchical Cross-Modal Fusion for Multi-Label Vulnerability Detection in Obfuscated Smart Contracts Build Now
ContractShield is a multimodal framework that uses hierarchical cross-modal fusion to detect vulnerabilities in obfuscated smart contracts, outperforming state-of-the-art by 6-15%.
GitHub stars n/a Velocity flat History pending Smart Contract Security Apr 3 Code High viability
DeltaLogic: Minimal Premise Edits Reveal Belief-Revision Failures in Logical Reasoning Models Build Now
A new benchmark and evaluation protocol for assessing and improving belief revision capabilities in language models, crucial for dynamic environments.
GitHub stars n/a Velocity flat History pending Logical Reasoning Apr 3 Code High viability
InverseDraping: Recovering Sewing Patterns from 3D Garment Surfaces via BoxMesh Bridging Build Now
Recovering precise 2D sewing patterns from 3D garment scans using a novel BoxMesh representation and a two-stage autoregressive model.
GitHub stars n/a Velocity flat History pending 3D Digitization Apr 3 Code High viability
Principled and Scalable Diversity-Aware Retrieval via Cardinality-Constrained Binary Quadratic Programming Build Now
A principled and scalable method for diversity-aware retrieval in RAG, offering theoretical guarantees and significant speedups.
GitHub stars n/a Velocity flat History pending RAG Optimization Apr 2 Code High viability
From Theory to Practice: Code Generation Using LLMs for CAPEC and CWE Frameworks Build Now
A novel dataset of vulnerable code snippets linked to CAPEC and CWE, generated by LLMs, to train automated vulnerability detection systems.
GitHub stars n/a Velocity flat History pending Security Vulnerability Detection Apr 2 Code High viability
Elastomeric Strain Limitation for Design of Soft Pneumatic Actuators Build Now
Develop human-safe soft robotic actuators with controllable shape and force using electroadhesive strain limiters and advanced modeling for applications in collaborative robotics.
GitHub stars n/a Velocity flat History pending Soft Robotics Apr 3 Code High viability
Differentiable SpaTiaL: Symbolic Learning and Reasoning with Geometric Temporal Logic for Manipulation Tasks Build Now
A fully differentiable symbolic logic toolbox for robot manipulation that enables end-to-end learning and optimization of complex geometric and temporal constraints.
GitHub stars n/a Velocity flat History pending Robotics Apr 3 Pending High viability
Visual Instruction-Finetuned Language Model for Versatile Brain MR Image Tasks Build Now
A versatile LLM for brain MRI that performs report generation, VQA, segmentation, and translation, outperforming specialized models.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code High viability
Eligibility-Aware Evidence Synthesis: An Agentic Framework for Clinical Trial Meta-Analysis Build Now
An agentic framework that automates clinical trial discovery and eligibility-aware meta-analysis for precision medicine evidence synthesis.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code High viability
OntoKG: Ontology-Oriented Knowledge Graph Construction with Intrinsic-Relational Routing Build Now
A novel ontology-oriented approach to knowledge graph construction that decouples schema design from graph building, enabling reusable and adaptable knowledge representations for downstream AI tasks.
GitHub stars n/a Velocity flat History pending Knowledge Graph Construction Apr 3 Code High viability
VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation Build Now
A benchmark and platform for evaluating and improving 3D spatial reasoning in code generation models.
GitHub stars n/a Velocity flat History pending 3D World Modeling Apr 2 Code High viability
MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications Build Now
A multi-sensor foundation model for Mars orbital applications, outperforming existing baselines on downstream tasks.
GitHub stars n/a Velocity flat History pending Earth Observation AI Apr 3 Pending High viability
DeCo-DETR: Decoupled Cognition DETR for efficient Open-Vocabulary Object Detection Build Now
A vision-centric framework for efficient open-vocabulary object detection that decouples semantic reasoning from localization for practical deployment.
GitHub stars n/a Velocity flat History pending Open-Vocabulary Object Detection Apr 3 Code High viability
WSVD: Weighted Low-Rank Approximation for Fast and Efficient Execution of Low-Precision Vision-Language Models Build Now
Accelerate vision-language model inference by 1.8x using weighted low-rank approximation and quantization, preserving accuracy.
GitHub stars n/a Velocity flat History pending LLM Optimization Apr 2 Pending High viability
ALIVE-LIO: Degeneracy-Aware Learning of Inertial Velocity for Enhancing ESKF-Based LiDAR-Inertial Odometry Build Now
A degeneracy-aware LiDAR-inertial odometry framework that uses a neural network to predict velocity and improve state estimation in challenging environments.
GitHub stars n/a Velocity flat History pending Robotics Apr 3 Code High viability
TrackerSplat: Exploiting Point Tracking for Fast and Robust Dynamic 3D Gaussians Reconstruction Build Now
TrackerSplat enhances 3D Gaussian Splatting for dynamic scenes by using point tracking to improve robustness and throughput in reconstructions with fast object motion.
GitHub stars n/a Velocity flat History pending 3D Reconstruction Apr 2 Pending High viability
Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents Build Now
A neuro-symbolic framework for LLM agents that decouples semantic guidance from logical validation to improve long-horizon decision-making.
GitHub stars n/a Velocity flat History pending LLM Agents Apr 3 Code High viability
Drift-Resilient Temporal Priors for Visual Tracking Build Now
A lightweight module that significantly improves visual tracking performance by intelligently filtering noisy historical data and synthesizing dynamic temporal priors.
GitHub stars n/a Velocity flat History pending Visual Tracking Apr 3 Code High viability
STDDN: A Physics-Guided Deep Learning Framework for Crowd Simulation Build Now
A physics-guided deep learning framework for accurate and efficient crowd simulation, outperforming state-of-the-art with reduced latency.
GitHub stars n/a Velocity flat History pending Simulation Apr 3 Code High viability
Generalized Small Object Detection:A Point-Prompted Paradigm and Benchmark Build Now
A new paradigm for small object detection that uses point prompts at inference time to significantly improve accuracy and generalize to unseen objects and datasets.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 3 Code High viability
VBGS-SLAM: Variational Bayesian Gaussian Splatting Simultaneous Localization and Mapping Build Now
A probabilistic SLAM system using Gaussian Splatting that improves robustness and reduces drift by explicitly modeling uncertainty.
GitHub stars n/a Velocity flat History pending 3D Reconstruction & SLAM Apr 3 Code High viability
Geometrically-Constrained Radar-Inertial Odometry via Continuous Point-Pose Uncertainty Modeling Build Now
A geometrically-constrained radar-inertial odometry system that improves localization accuracy in challenging environments by dynamically modeling and integrating point and pose uncertainties.
GitHub stars n/a Velocity flat History pending Robotics Apr 3 Code High viability
FusionBERT: Multi-View Image-3D Retrieval via Cross-Attention Visual Fusion and Normal-Aware 3D Encoder Build Now
FusionBERT enables robust multi-view image-to-3D model retrieval by adaptively fusing visual cues and enhancing 3D geometry encoding.
GitHub stars n/a Velocity flat History pending Multimodal Retrieval Apr 2 Code High viability
Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge Build Now
Leverage LLMs as judges for label-free knowledge distillation to significantly improve model reasoning capabilities on unlabeled data.
GitHub stars n/a Velocity flat History pending LLM Fine-tuning Apr 3 Code High viability
IndustryCode: A Benchmark for Industry Code Generation Build Now
A new benchmark for evaluating LLM code generation across diverse industrial domains and programming languages, enabling more robust industrial AI solutions.
GitHub stars n/a Velocity flat History pending Code Generation Apr 3 Code High viability
Rascene: High-Fidelity 3D Scene Imaging with mmWave Communication Signals Build Now
Leverage existing mmWave communication signals for high-fidelity, low-cost 3D environmental perception, overcoming limitations of optical sensors in adverse conditions.
GitHub stars n/a Velocity flat History pending 3D Perception Apr 3 Code High viability
THOM: Generating Physically Plausible Hand-Object Meshes From Text Build Now
THOM generates photorealistic, physically plausible 3D hand-object interactions from text, enhancing VR/AR experiences.
GitHub stars n/a Velocity flat History pending 3D Generation Apr 3 Code High viability
Vision-Based End-to-End Learning for UAV Traversal of Irregular Gaps via Differentiable Simulation Build Now
A vision-based end-to-end framework for autonomous drones to navigate complex, irregular gaps, enhancing inspection and rescue operations.
GitHub stars n/a Velocity flat History pending Autonomous Drones Apr 3 Code High viability
V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views Build Now
A new dataset and benchmark for multimodal LLMs in autonomous driving, along with a specialized MoE model, to improve reasoning across vehicle, infrastructure, and cooperative views.
GitHub stars n/a Velocity flat History pending Autonomous Driving AI Apr 3 Pending High viability
SentinelAgent: Intent-Verified Delegation Chains for Securing Federal Multi-Agent AI Systems Build Now
SentinelAgent provides a formal framework and runtime protocol for verifiable delegation chains in multi-agent AI systems, ensuring policy compliance and forensic traceability.
GitHub stars n/a Velocity flat History pending AI Security Apr 3 Code High viability
Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles Build Now
A self-supervised depth estimation framework for articulated vehicles that leverages cross-vehicle geometric consistency to improve perception in complex robotic platforms.
GitHub stars n/a Velocity flat History pending Autonomous Driving Perception Apr 3 Code High viability
Moondream Segmentation: From Words to Masks Build Now
A vision-language model extension that generates precise image masks from textual descriptions, with a new dataset for improved evaluation.
GitHub stars n/a Velocity flat History pending Vision-Language Models Apr 3 Code High viability
Unlocking Multi-Site Clinical Data: A Federated Approach to Privacy-First Child Autism Behavior Analysis Build Now
A privacy-preserving federated learning framework for early autism behavior analysis in children, enabling multi-site collaboration without centralizing sensitive data.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code High viability
FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving Build Now
A system that significantly boosts LLM serving throughput by intelligently managing expert weights to free up GPU memory for critical runtime data.
GitHub stars n/a Velocity flat History pending LLM Serving Apr 3 Code High viability
When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs Build Now
A novel model for continual learning in multimodal knowledge graphs that prevents catastrophic forgetting and enhances new knowledge acquisition.
GitHub stars n/a Velocity flat History pending Multimodal Knowledge Graphs Apr 3 Code High viability
AutoVerifier: An Agentic Automated Verification Framework Using Large Language Models Build Now
An LLM-powered agent framework that automates the verification of complex technical claims in scientific literature, even without domain expertise.
GitHub stars n/a Velocity flat History pending AI Agents Apr 3 Code High viability
Cross-subject Muscle Fatigue Detection via Adversarial and Supervised Contrastive Learning with Inception-Attention Network Build Now
A novel neural network for robust cross-subject muscle fatigue detection using sEMG signals, enhancing physical rehabilitation.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code High viability
A Unified Perspective on Adversarial Membership Manipulation in Vision Models Build Now
A novel framework to detect and mitigate adversarial attacks that manipulate AI model privacy by making non-training data appear as if it was part of the training set.
GitHub stars n/a Velocity flat History pending AI Security Apr 3 Code High viability
GRADE: Probing Knowledge Gaps in LLMs through Gradient Subspace Dynamics Build Now
A novel method to detect and explain knowledge gaps in LLMs by analyzing gradient dynamics, enabling more responsible deployment.
GitHub stars n/a Velocity flat History pending LLM Safety & Interpretability Apr 3 Code High viability
Token Warping Helps MLLMs Look from Nearby Viewpoints Build Now
A novel token warping technique for multimodal LLMs to improve viewpoint robustness, outperforming existing methods on a new benchmark.
GitHub stars n/a Velocity flat History pending Multimodal LLMs Apr 3 Code High viability
CMCC-ReID: Cross-Modality Clothing-Change Person Re-Identification Build Now
A novel network for person re-identification that handles both changes in clothing and camera modality, addressing a realistic surveillance challenge.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 3 Code High viability
Extracting Money Laundering Transactions from Quasi-Temporal Graph Representation Build Now
A simple and scalable supervised learning framework to detect money laundering transactions, outperforming state-of-the-art models and complementing existing AML systems.
GitHub stars n/a Velocity flat History pending Financial Crime Detection Apr 3 Pending High viability
Modality-Specific Hierarchical Enhancement for RGB-D Camouflaged Object Detection Build Now
A novel framework for camouflaged object detection that enhances RGB and depth features independently before adaptive fusion, outperforming existing methods.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 3 Pending High viability
Adaptive Local Frequency Filtering for Fourier-Encoded Implicit Neural Representations Build Now
Enhance implicit neural representations for modeling complex signals with spatially varying frequencies, improving reconstruction quality and optimization speed.
GitHub stars n/a Velocity flat History pending Implicit Neural Representations Apr 3 Code High viability
Factorized Multi-Resolution HashGrid for Efficient Neural Radiance Fields: Execution on Edge-Devices Build Now
A novel parameter encoding method for efficient on-device neural radiance field training, significantly reducing memory usage while maintaining quality and speed.
GitHub stars n/a Velocity flat History pending 3D Representation Apr 3 Code High viability
Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks Build Now
A novel framework that bridges response-level scores and token-level credit assignment for more accurate LLM instruction following.
GitHub stars n/a Velocity flat History pending LLM Alignment Apr 3 Code High viability
Unlocking Positive Transfer in Incrementally Learning Surgical Instruments: A Self-reflection Hierarchical Prompt Framework Build Now
A framework that improves surgical instrument segmentation by enabling models to learn from past and future knowledge, reducing catastrophic forgetting.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code High viability
CANDLE: Illumination-Invariant Semantic Priors for Color Ambient Lighting Normalization Build Now
A novel method for color ambient lighting normalization using self-supervised features to recover intrinsic object color, achieving state-of-the-art results in challenging conditions.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 3 Pending High viability
Toward an Artificial General Teacher: Procedural Geometry Data Generation and Visual Grounding with Vision-Language Models Build Now
Generate synthetic geometry diagrams and explanations for AI tutors, overcoming domain shift issues with a novel data engine and fine-tuned vision-language models.
GitHub stars n/a Velocity flat History pending Educational AI Apr 3 Code High viability
EnsemHalDet: Robust VLM Hallucination Detection via Ensemble of Internal State Detectors Build Now
A framework that significantly improves the accuracy of detecting factual errors in Vision-Language Models by ensembling multiple internal state detectors.
GitHub stars n/a Velocity flat History pending Vision-Language Models Apr 3 Code High viability
SPG: Sparse-Projected Guides with Sparse Autoencoders for Zero-Shot Anomaly Detection Build Now
A prompt-free framework for zero-shot anomaly detection and segmentation that leverages sparse autoencoders to generate anomaly guides, achieving state-of-the-art pixel-level segmentation.
GitHub stars n/a Velocity flat History pending Anomaly Detection Apr 3 Code High viability
PaveBench: A Versatile Benchmark for Pavement Distress Perception and Interactive Vision-Language Analysis Build Now
A benchmark and framework for interactive vision-language analysis of pavement distress, enabling quantitative assessment and maintenance reasoning.
GitHub stars n/a Velocity flat History pending Vision-Language Analysis Apr 3 Code High viability
High-dimensional Many-to-many-to-many Mediation Analysis Build Now
A statistical framework for high-dimensional mediation analysis to uncover complex genetic-neural-cognitive pathways and improve predictive performance in areas like Alzheimer's research.
GitHub stars n/a Velocity flat History pending Statistical Analysis Apr 3 Pending High viability
RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection Build Now
RayMamba enhances 3D object detection by organizing sparse LiDAR data into geometry-aware sequences, significantly improving performance in challenging long-range scenarios.
GitHub stars n/a Velocity flat History pending 3D Object Detection Apr 3 Code High viability
QuadAgent: A Responsive Agent System for Vision-Language Guided Quadrotor Agile Flight Build Now
A training-free agent system for vision-language guided agile quadrotor flight that decouples reasoning and control for improved efficiency and responsiveness.
GitHub stars n/a Velocity flat History pending Robotics Agents Apr 3 Code High viability
CharTool: Tool-Integrated Visual Reasoning for Chart Understanding Build Now
A multimodal LLM that uses tools for precise chart understanding and numerical reasoning, outperforming existing models on key benchmarks.
GitHub stars n/a Velocity flat History pending Multimodal LLMs Apr 3 Code High viability
MFE: A Multimodal Hand Exoskeleton with Interactive Force, Pressure and Thermo-haptic Feedback Build Now
A multimodal hand exoskeleton providing rich force, pressure, and thermal haptic feedback for enhanced robotic teleoperation and VR experiences.
GitHub stars n/a Velocity flat History pending Robotics & Haptics Apr 3 Code High viability
Council Mode: Mitigating Hallucination and Bias in LLMs via Multi-Agent Consensus Build Now
A multi-agent consensus framework that significantly reduces LLM hallucinations and biases by synthesizing outputs from diverse frontier models.
GitHub stars n/a Velocity flat History pending LLM Hallucination Mitigation Apr 3 Code High viability
NavCrafter: Exploring 3D Scenes from a Single Image Build Now
Generate explorable 3D scenes from a single image with controllable camera movement and high fidelity.
GitHub stars n/a Velocity flat History pending 3D Scene Generation Apr 3 Code High viability
Student-in-the-Loop Chain-of-Thought Distillation via Generation-Time Selection Build Now
A framework that distills complex reasoning from large language models to smaller ones by selecting only the most learnable reasoning paths during generation.
GitHub stars n/a Velocity flat History pending LLM Distillation Apr 3 Code High viability
RAGE: A Tightly Coupled Radar-Aided Grip Estimator For Autonomous Race Cars Build Now
A real-time friction estimator for autonomous race cars using standard sensors, enabling safer and more effective operation at physical limits.
GitHub stars n/a Velocity flat History pending Autonomous Driving Apr 3 Code High viability
Photonic convolutional neural network with pre-trained in-situ training Watch
A fully photonic convolutional neural network for energy-efficient image classification, validated with a hybrid training approach.
GitHub stars n/a Velocity flat History pending Photonic Computing Apr 2 Code
Synapse: Evolving Job-Person Fit with Explainable Two-phase Retrieval and LLM-guided Genetic Resume Optimization Watch
A two-phase AI system that improves job-person fit by efficiently retrieving candidates and then optimizing resumes using LLMs and genetic algorithms.
Recruitment AI Apr 2
Train Yourself as an LLM: Exploring Effects of AI Literacy on Persuasion via Role-playing LLM Training Watch
An interactive AI literacy tutorial that trains users to understand LLM persuasion tactics, reducing susceptibility to AI influence.
GitHub stars n/a Velocity flat History 1 snapshot AI Literacy & Persuasion Apr 3
Pragmatics Meets Culture: Culturally-adapted Artwork Description Generation and Evaluation Watch
Generate artwork descriptions tailored to specific cultural audiences to improve comprehension and engagement.
Cultural AI Apr 2
Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions? Watch
A benchmark suite to rigorously evaluate the stability of AI feature attribution methods in vision systems, revealing critical insights into their reliability under real-world perturbations.
GitHub stars n/a Velocity flat History pending Computer Vision Explainability Apr 2 Code
A Rapid Instrument Exchange System for Humanoid Robots in Minimally Invasive Surgery Watch
A teleoperated system enabling humanoid robots to rapidly and efficiently exchange surgical instruments, reducing complexity and cognitive load for surgeons.
GitHub stars n/a Velocity flat History pending Robotics for Surgery Apr 3 Code
Dependency-Guided Parallel Decoding in Discrete Diffusion Language Models Watch
Accelerate discrete diffusion language model text generation by predicting token dependencies to improve quality and speed.
LLM Generation Apr 2
Learning Locomotion on Complex Terrain for Quadrupedal Robots with Foot Position Maps and Stability Rewards Watch
A new reinforcement learning approach for quadrupedal robots that uses foot position maps and stability rewards to achieve precise and stable locomotion on complex, unseen terrains.
GitHub stars n/a Velocity flat History pending Robotics Apr 3 Code
Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization Watch
An interpretable reinforcement learning framework optimizes bridge life-cycle management using element-level condition data, producing understandable and auditable decision trees.
GitHub stars n/a Velocity flat History pending AI for Infrastructure Management Apr 2 Code
VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors Watch
This research identifies a critical limitation in Vision Language Models, showing they prioritize semantic understanding over visual detail, and proposes methods to improve their fine-grained visual reasoning capabilities.
GitHub stars n/a Velocity flat History pending Vision Language Models Apr 2 Code
When simulations look right but causal effects go wrong: Large language models as behavioral simulators Watch
This research evaluates LLMs' ability to simulate causal intervention effects, revealing a divergence between descriptive fit and causal fidelity, crucial for reliable behavioral simulations.
GitHub stars n/a Velocity flat History pending LLM Evaluation Apr 2 Code
ROMAN: A Multiscale Routing Operator for Convolutional Time Series Models Watch
A novel operator for convolutional time series models that improves efficiency and accuracy by creating a multiscale, position-aware representation.
GitHub stars n/a Velocity flat History pending Time Series Analysis Apr 2 Pending
Parser-Oriented Structural Refinement for a Stable Layout Interface in Document Parsing Watch
A structural refinement module stabilizes document parsing pipelines by ensuring consistent input order for downstream parsers, significantly reducing errors on complex layouts.
GitHub stars n/a Velocity flat History pending Document Parsing Apr 3 Code
JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency Watch
An efficient Mixture-of-Experts LLM that significantly improves token efficiency and inference throughput for mid-scale models.
LLM Training Apr 3
Towards Near-Real-Time Telemetry-Aware Routing with Neural Routing Algorithms Watch
A framework for training and evaluating neural routing algorithms that explicitly model communication and inference delays, outperforming traditional methods in realistic network conditions.
Network Routing AI Apr 3
Explicit Time-Frequency Dynamics for Skeleton-Based Gait Recognition Watch
Enhance skeleton-based gait recognition by adding explicit time-frequency dynamics to existing models, improving performance under challenging conditions.
Computer Vision Apr 3
Verbalizing LLMs' assumptions to explain and control sycophancy Watch
A framework to understand and control LLM sycophancy by verbalizing and steering their underlying assumptions.
GitHub stars n/a Velocity flat History pending LLM Safety & Control Apr 3 Code
Enhancing Multi-Robot Exploration Using Probabilistic Frontier Prioritization with Dirichlet Process Gaussian Mixtures Watch
A probabilistic approach to frontier prioritization enhances multi-robot exploration efficiency by 10-14% in complex environments.
Robotics Apr 3
An Asynchronous Two-Speed Kalman Filter for Real-Time UUV Cooperative Navigation Under Acoustic Delays Watch
A novel asynchronous Kalman filter with variational history distillation enables real-time cooperative navigation for underwater vehicles despite significant acoustic communication delays.
GitHub stars n/a Velocity flat History pending Robotics Navigation Apr 3 Code
Prompt Compression in the Wild: Measuring Latency, Rate Adherence, and Quality for Faster LLM Inference Watch
Accelerate LLM inference by intelligently compressing prompts, offering significant speed-ups and reduced memory usage with a predictive profiler.
LLM Inference Optimization Apr 3
StoryScope: Investigating idiosyncrasies in AI fiction Watch
A novel pipeline that detects AI-generated fiction by analyzing narrative structure, not just style, offering a robust method for authorship attribution and content verification.
AI Content Detection Apr 3
HyperFitS -- Hypernetwork Fitting Spectra for metabolic quantification of ${}^1$H MR spectroscopic imaging Watch
A hypernetwork for rapid and configurable metabolic quantification in brain MRI, reducing processing time from hours to seconds.
Medical AI Apr 3
Reliability Gated Multi-Teacher Distillation for Low Resource Abstractive Summarization Watch
A novel distillation method for abstractive summarization in low-resource languages that improves performance and reduces model size.
GitHub stars n/a Velocity flat History pending Low Resource NLP Apr 3 Code
Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems Watch
This research identifies a novel supply-chain attack vector for LLM coding agents by embedding malicious logic in skill documentation, bypassing existing defenses.
LLM Security Apr 3
BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition Watch
A benchmark dataset for clinical Urdu Named Entity Recognition to advance NLP capabilities in under-resourced languages.
GitHub stars n/a Velocity flat History pending NLP for Low-Resource Languages Apr 3 Code
Explainable Machine Learning Reveals 12-Fold Ucp1 Upregulation and Thermogenic Reprogramming in Female Mouse White Adipose Tissue After 37 Days of Microgravity: First AI/ML Analysis of NASA OSD-970 Watch
Leveraging explainable AI to uncover microgravity's impact on thermogenesis in female white adipose tissue, with implications for astronaut health and metabolic disease research.
GitHub stars n/a Velocity flat History pending Biomedical AI Apr 3 Pending
Multi-Aspect Knowledge Distillation for Language Model with Low-rank Factorization Watch
A novel knowledge distillation method for language models that captures richer language knowledge by mimicking self-attention and feed-forward modules.
GitHub stars n/a Velocity flat History pending LLM Compression Apr 3 Code
A Data-Centric Vision Transformer Baseline for SAR Sea Ice Classification Watch
A data-centric Vision Transformer baseline for improved SAR sea ice classification, offering a more useful precision-recall trade-off for rare ice classes.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 3 Code
PR3DICTR: A modular AI framework for medical 3D image-based detection and outcome prediction Watch
A modular framework for developing 3D medical image classification and prediction models with pre-established functionality.
Medical AI Apr 3
A Tsetlin Machine-driven Intrusion Detection System for Next-Generation IoMT Security Watch
A Tsetlin Machine-based Intrusion Detection System for IoMT networks that offers interpretable insights into cyberattacks.
GitHub stars n/a Velocity flat History pending IoMT Security Apr 3 Code
AlertStar: Path-Aware Alert Prediction on Hyper-Relational Knowledge Graphs Watch
AlertStar enhances network intrusion detection by predicting alerts through advanced hyper-relational knowledge graph modeling.
GitHub stars n/a Velocity flat History pending Cybersecurity Apr 3 Code
AI-Assisted Unit Test Writing and Test-Driven Code Refactoring: A Case Study Watch
Automate unit test generation and safe code refactoring using AI to accelerate software development and reduce regression risk.
AI-Assisted Software Engineering Apr 3
Not All Frames Deserve Full Computation: Accelerating Autoregressive Video Generation via Selective Computation and Predictive Extrapolation Watch
Accelerate autoregressive video generation by intelligently reusing computations and predicting future frames, achieving significant speedups without retraining.
Generative Video Apr 3
ProtoFlow: Mitigating Forgetting in Class-Incremental Remote Sensing Segmentation via Low-Curvature Prototype Flow Watch
A framework for continual remote sensing segmentation that mitigates forgetting by modeling class prototype evolution as low-curvature trajectories.
GitHub stars n/a Velocity flat History pending Remote Sensing AI Apr 3 Code
PRISM: LLM-Guided Semantic Clustering for High-Precision Topics Watch
A topic modeling framework that uses LLMs to guide semantic clustering for precise topic discovery and analysis.
Topic Modeling with LLMs Apr 3
Progressive Video Condensation with MLLM Agent for Long-form Video Understanding Watch
An agent that progressively condenses long videos into keyframes for efficient multimodal LLM reasoning, achieving state-of-the-art accuracy with reduced computational cost.
Video Understanding Apr 3
Revealing the Learning Dynamics of Long-Context Continual Pre-training Ignore
A framework for monitoring and evaluating the learning dynamics of large-scale continual pre-training for industrial LLMs, revealing insights into data scaling and training stability.
GitHub stars n/a Velocity flat History pending LLM Training Apr 3 Code
Time-Warping Recurrent Neural Networks for Transfer Learning Ignore
A novel time-warping method for Recurrent Neural Networks enhances transfer learning accuracy in predicting time-varying physical systems.
Transfer Learning for Time Series Apr 2
WGFINNs: Weak formulation-based GENERIC formalism informed neural networks' Ignore
WGFINNs enhance the robustness of neural networks in scientific machine learning by integrating weak formulations to handle noisy data effectively.
GitHub stars n/a Velocity flat History pending Scientific Machine Learning Apr 3 Code
AdaHOP: Fast and Accurate Low-Precision Training via Outlier-Pattern-Aware Rotation Ignore
A novel low-precision training method for LLMs that adapts transform strategies based on outlier patterns to achieve significant memory and speed improvements.
LLM Training Apr 2
Overcoming the "Impracticality" of RAG: Proposing a Real-World Benchmark and Multi-Dimensional Diagnostic Framework Ignore
A new benchmark and diagnostic framework to evaluate the real-world performance of Retrieval-Augmented Generation systems in enterprise settings.
GitHub stars n/a Velocity flat History pending RAG Evaluation Apr 3 Code
An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages Ignore
Empirically studying many-shot in-context learning to improve machine translation for low-resource languages by optimizing example retrieval and selection.
Machine Translation Apr 3
GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers Ignore
A new benchmark and agent framework to evaluate and improve LLMs' ability to autonomously discover software bugs in game development.
GitHub stars n/a Velocity flat History pending AI for Software Engineering Apr 3 Code
Complex-Valued GNNs for Distributed Basis-Invariant Control of Planar Systems Ignore
A novel graph neural network architecture for distributed control of planar systems that is invariant to local sensor orientation, improving data efficiency and generalization.
Robotics Control Apr 3
A Spectral Framework for Multi-Scale Nonlinear Dimensionality Reduction Ignore
A spectral framework for multi-scale nonlinear dimensionality reduction that bridges global and local structure with analytical transparency.
GitHub stars n/a Velocity flat History pending Dimensionality Reduction Apr 2 Code
Evolution and Perspectives of the Keep IT Secure Ecosystem:A Six-Year Analysis of Cybersecurity Experts Supporting Belgian SMEs Ignore
A framework for validating cybersecurity experts to improve SME security posture.
GitHub stars n/a Velocity flat History pending Cybersecurity Apr 2 Code
I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime Ignore
This research demonstrates that current AI agents can be manipulated to suppress evidence of fraud and harm for corporate gain, highlighting a critical security vulnerability.
AI Agents Apr 2
From Impact to Insight: Dynamics-Aware Proprioceptive Terrain Sensing on Granular Media Ignore
A physics-based framework for robots to accurately characterize deformable terrain during high-speed locomotion using proprioceptive sensing.
GitHub stars n/a Velocity flat History pending Robotics Apr 2 Code
Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets Ignore
This research demonstrates that single-agent LLMs can outperform multi-agent systems on complex reasoning tasks when computational resources are normalized, challenging the perceived benefits of multi-agent architectures.
GitHub stars n/a Velocity flat History pending LLM Agents Apr 2 Code
LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape Ignore
A tool to help researchers develop novel research ideas by dynamically linking literature review with idea refinement.
Research Tools Apr 3
Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits Ignore
This research theoretically and empirically evaluates the effectiveness of using LLM-generated data to initialize bandit algorithms, identifying critical thresholds for data corruption and misalignment that impact performance.
GitHub stars n/a Velocity flat History pending LLM Agents Apr 2 Code
Transfer Learning for Meta-analysis Under Covariate Shift Ignore
A framework for more accurate meta-analysis by leveraging source trial outcomes as proxy signals and target trial placebo outcomes as gold labels to calibrate baseline risk.
GitHub stars n/a Velocity flat History pending Causal Inference Apr 3 Code
Generalization Limits of Reinforcement Learning Alignment Ignore
Develops novel 'compound jailbreak' techniques to expose generalization failures in LLM safety alignment, demonstrating a significant increase in attack success rates.
LLM Safety Apr 3
Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents Ignore
This research introduces a novel attack vector for LLM-based web agents, demonstrating how environmental contamination can lead to persistent memory poisoning and cross-session, cross-site compromise.
LLM Security Apr 3
Conditional Sampling via Wasserstein Autoencoders and Triangular Transport Ignore
A new framework for conditional simulation that reduces approximation error in low-dimensional problems.
GitHub stars n/a Velocity flat History pending Generative Models Apr 3 Code
Communication-Efficient Distributed Learning with Differential Privacy Ignore
A differentially private algorithm for distributed learning that improves communication efficiency and data privacy.
Distributed Learning with Privacy Apr 2
Analytic Drift Resister for Non-Exemplar Continual Graph Learning Ignore
A novel framework for continual graph learning that resists feature drift and achieves zero-forgetting class-incremental learning.
GitHub stars n/a Velocity flat History pending Graph Neural Networks Apr 3 Code
Automated Malware Family Classification using Weighted Hierarchical Ensembles of Large Language Models Ignore
A zero-label malware classification framework using weighted hierarchical ensembles of LLMs to improve robustness and analyst-style reasoning.
GitHub stars n/a Velocity flat History pending Malware Classification Apr 2 Code
Tune to Learn: How Controller Gains Shape Robot Policy Learning Ignore
Optimize robot controller gains based on the learning algorithm, not just task compliance, to improve policy learning.
Robotics Learning Apr 2
Self-Directed Task Identification Ignore
A framework for models to autonomously identify target variables in datasets without pre-training, reducing manual annotation effort.
GitHub stars n/a Velocity flat History pending Autonomous Learning Apr 2 Code
Structure-Preserving Multi-View Embedding Using Gromov-Wasserstein Optimal Transport Ignore
A novel approach to multi-view data analysis using Gromov-Wasserstein optimal transport to preserve intrinsic relational structure across heterogeneous data representations.
GitHub stars n/a Velocity flat History pending Representation Learning Apr 3 Code
Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration Ignore
A new randomized subspace iteration method for efficient low-rank compression of large pretrained models that improves approximation quality and predictive accuracy.
GitHub stars n/a Velocity flat History pending LLM Compression Apr 3 Code
Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models Ignore
A framework to analyze and improve multimodal reasoning models by understanding and leveraging model hallucination during reinforcement learning post-training.
GitHub stars n/a Velocity flat History pending Multimodal Reasoning Apr 3 Code
Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy Ignore
A new benchmark reveals LLMs struggle with formal reasoning tasks, highlighting inefficiencies and the continued need for traditional algorithms.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 3 Code
Can VLMs Truly Forget? Benchmarking Training-Free Visual Concept Unlearning Ignore
A new benchmark to rigorously evaluate the ability of large vision-language models to forget specific visual concepts without retraining.
GitHub stars n/a Velocity flat History pending LLM Evaluation Apr 3 Code
Towards Secure Agent Skills: Architecture, Threat Taxonomy, and Security Analysis Ignore
This paper analyzes security vulnerabilities in the Agent Skills framework for LLM agents, proposing a threat taxonomy and defense strategies.
AI Agents Apr 3
Enhancing Robustness of Federated Learning via Server Learning Ignore
A federated learning enhancement that improves model robustness against malicious attacks using server-side learning and client update filtering.
GitHub stars n/a Velocity flat History pending Federated Learning Apr 3 Code
Learning from Synthetic Data via Provenance-Based Input Gradient Guidance Ignore
A new framework for training computer vision models with synthetic data that uses provenance information to guide learning towards relevant input regions, improving robustness and reducing reliance on synthesis artifacts.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 3 Code
Open Challenges for Secure and Scalable Wi-Fi Connectivity in Rural Areas Ignore
Securing pay-for-use Wi-Fi hotspots in rural areas by addressing hijacking and rogue hotspot vulnerabilities.
GitHub stars n/a Velocity flat History pending Rural Connectivity Security Apr 3 Code
QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models Ignore
A framework for co-optimizing vision token pruning and quantization to enable efficient deployment of multimodal LLMs.
LLM Compression Apr 3
Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems Ignore
A method to mitigate sycophancy in multi-agent LLM discussions by providing peer sycophancy rankings, improving accuracy.
Multi-Agent LLM Behavior Apr 3
Differentiable Stroke Planning with Dual Parameterization for Efficient and High-Fidelity Painting Creation Ignore
A novel dual parameterization for stroke-based rendering that improves structural coherence and efficiency.
Generative Art Apr 3
HiDiGen: Hierarchical Diffusion for B-Rep Generation with Explicit Topological Constraints Ignore
A hierarchical diffusion model for generating topologically valid 3D CAD models by decoupling geometry and topology.
3D CAD Generation Apr 3
Adaptive Semantic Communication for Wireless Image Transmission Leveraging Mixture-of-Experts Mechanism Ignore
An adaptive semantic communication system for wireless image transmission that uses a Mixture-of-Experts mechanism to improve reconstruction quality and efficiency.
Wireless Communication AI Apr 3
Coupled Control, Structured Memory, and Verifiable Action in Agentic AI (SCRAT -- Stochastic Control with Retrieval and Auditable Trajectories): A Comparative Perspective from Squirrel Locomotion and Scatter-Hoarding Ignore
A new framework for agentic AI that integrates control, memory, and verification, inspired by squirrel behavior, to improve robustness and reduce errors.
GitHub stars n/a Velocity flat History pending Agentic AI Apr 3 Code
Comparing the Impact of Pedagogy-Informed Custom and General-Purpose GAI Chatbots on Students' Science Problem-Solving Processes and Performance Using Heterogeneous Interaction Network Analysis Ignore
A pedagogy-informed custom AI chatbot designed to enhance student science problem-solving by fostering cognitive engagement over direct answers.
GitHub stars n/a Velocity flat History pending Educational AI Apr 3 Code
Safety-Critical Centralized Nonlinear MPC for Cooperative Payload Transportation by Two Quadrupedal Robots Ignore
A safety-critical NMPC framework for cooperative payload transportation by two quadrupedal robots, validated on hardware.
Robotics Control Apr 3
Joint Prediction of Human Motions and Actions in Human-Robot Collaboration Ignore
A probabilistic framework for robots to jointly predict human movements and actions for improved collaboration.
Human-Robot Interaction Apr 3
Multiple-Debias: A Full-process Debiasing Method for Multilingual Pre-trained Language Models Ignore
A method to reduce biases in multilingual language models across multiple sensitive attributes and languages.
LLM Debiasing Apr 3
Effect of Input Resolution on Retinal Vessel Segmentation Performance: An Empirical Study Across Five Datasets Ignore
This research empirically studies the impact of image resolution on retinal vessel segmentation, revealing a critical trade-off for thin vessel detection that standard metrics overlook.
GitHub stars n/a Velocity flat History pending Medical AI Apr 3 Code
Inversion-Free Natural Gradient Descent on Riemannian Manifolds Ignore
An inversion-free natural gradient descent method for optimizing probability distributions on Riemannian manifolds, offering improved convergence and constraint enforcement for applications like variational Bayes and normalizing flows.
GitHub stars n/a Velocity flat History pending Optimization Algorithms Apr 3 Code
Beyond Precision: Importance-Aware Recall for Factuality Evaluation in Long-Form LLM Generation Ignore
A framework to evaluate LLM factuality by jointly measuring precision and recall, highlighting factual incompleteness as a key limitation.
LLM Evaluation Apr 3
Cross Event Detection and Topic Evolution Mining in cross events for Man Made Disasters in Social Media Streams Ignore
A framework for detecting and analyzing the evolution of related events in social media streams to understand their impact on human actions.
GitHub stars n/a Velocity flat History pending Social Media Event Analysis Apr 3 Code
The Compression Gap: Why Discrete Tokenization Limits Vision-Language-Action Model Scaling Ignore
This research identifies a critical bottleneck in vision-language-action model scaling, suggesting a new direction for improving robotic manipulation performance by addressing information flow rather than just model size.
GitHub stars n/a Velocity flat History pending Robotics Apr 3 Code
How Annotation Trains Annotators: Competence Development in Social Influence Recognition Ignore
This research explores how social influence impacts annotator competence, leading to improved data quality and LLM performance, with potential applications in optimizing annotation pipelines.
GitHub stars n/a Velocity flat History pending AI Training & Annotation Apr 3 Code
Finding Belief Geometries with Sparse Autoencoders Ignore
A pipeline to discover and validate belief-like geometric structures within large language model representations.
GitHub stars n/a Velocity flat History pending LLM Interpretability Apr 3 Code
Generative Frontiers: Why Evaluation Matters for Diffusion Language Models Ignore
This research proposes a new principled method for evaluating diffusion language models, addressing limitations in current benchmarks and metrics to ensure reliable comparisons of generative quality.
GitHub stars n/a Velocity flat History pending LLM Evaluation Apr 3 Code
Understanding Latent Diffusability via Fisher Geometry Ignore
A theoretical framework to diagnose and improve latent diffusion model performance by analyzing geometric properties of the latent space.
GitHub stars n/a Velocity flat History pending Diffusion Models Apr 3 Code
A Numerical Method for Coupling Parameterized Physics-Informed Neural Networks and FDM for Advanced Thermal-Hydraulic System Simulation Ignore
A novel hybrid AI-FDM method accelerates nuclear safety simulations by learning solution manifolds for parameterized physics-informed neural networks, eliminating retraining needs for varying problem parameters.
Physics-Informed AI for Simulation Apr 3
PolyReal: A Benchmark for Real-World Polymer Science Workflows Ignore
A new benchmark for evaluating multimodal LLMs on real-world polymer science workflows, revealing significant gaps in practical application capabilities.
GitHub stars n/a Velocity flat History pending Scientific Benchmarking Apr 3 Code
An Independent Safety Evaluation of Kimi K2.5 Ignore
An independent safety evaluation of the Kimi K2.5 LLM reveals significant dual-use capabilities and amplified risks in open-weight models, urging developers to prioritize systematic safety assessments.
GitHub stars n/a Velocity flat History pending LLM Safety Evaluation Apr 3 Code
Asymptotically-Bounded 3D Frontier Exploration enhanced with Bayesian Information Gain Ignore
A more efficient 3D robotic exploration algorithm that reduces computational demands by focusing on frontiers and using Bayesian information gain for viewpoint prioritization.
Robotics Apr 3
Analysis of Optimality of Large Language Models on Planning Problems Ignore
LLMs demonstrate near-perfect optimality in complex planning problems, outperforming traditional planners by leveraging algorithmic simulation and geometric memory.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 3 Code
Beyond Semantic Manipulation: Token-Space Attacks on Reward Models Ignore
A novel attack framework that exploits reward models by directly manipulating token sequences, bypassing semantic understanding to achieve high reward scores with nonsensical outputs.
LLM Security Apr 3
Analyzing Healthcare Interoperability Vulnerabilities: Formal Modeling and Graph-Theoretic Approach Ignore
A formal graph-based model to detect race conditions in healthcare interoperability platforms like FHIR.
Healthcare AI Apr 3
FedSQ: Optimized Weight Averaging via Fixed Gating Ignore
A federated learning method that stabilizes training by freezing structural components of pretrained models to improve robustness and reduce convergence time.
Federated Learning Apr 3
A Systematic Security Evaluation of OpenClaw and Its Variants Ignore
This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.
GitHub stars n/a Velocity flat History pending AI Agent Security Apr 3 Code
Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems Ignore
This research uncovers fundamental laws governing the collective intelligence of LLM multi-agent systems, identifying a key bottleneck and proposing a mechanism to improve their scalability and performance.
LLM Agents Apr 3
Information-Regularized Constrained Inversion for Stable Avatar Editing from Sparse Supervision Ignore
A framework for stable editing of animatable human avatars from sparse supervision by performing constrained inversion in a structured latent space.
3D Avatar Editing Apr 3
Rethinking Forward Processes for Score-Based Data Assimilation in High Dimensions Ignore
A novel measurement-aware score-based filter for more accurate and stable data assimilation in high-dimensional systems.
GitHub stars n/a Velocity flat History pending Data Assimilation Apr 3 Code
Breakdowns in Conversational AI: Interactional Failures in Emotionally and Ethically Sensitive Contexts Ignore
This research identifies and categorizes breakdowns in conversational AI when dealing with emotional and ethical complexities, offering a path to more robust and aligned dialogue systems.
GitHub stars n/a Velocity flat History pending Conversational AI Safety Apr 3 Code
On the Geometric Structure of Layer Updates in Deep Language Models Ignore
This paper analyzes the geometric structure of layer updates in deep language models to understand how representations change between layers, identifying a dominant tokenwise component and a distinct residual component.
LLM Research Apr 2
SEDGE: Structural Extrapolated Data Generation Ignore
A theoretical framework for generating new data based on assumptions about the data generation process.
Data Generation Apr 2
Skeleton-based Coherence Modeling in Narratives Ignore
This research explores using sentence skeletons to model narrative coherence, finding that sentence-level analysis remains superior.
NLP Coherence Modeling Apr 2
Failing to Falsify: Evaluating and Mitigating Confirmation Bias in Language Models Ignore
This research explores and mitigates confirmation bias in language models to enhance their reasoning capabilities.
LLM Bias Mitigation Apr 2
Towards Realistic Class-Incremental Learning with Free-Flow Increments Ignore
A model-agnostic framework for robust class-incremental learning that handles variable class arrivals, stabilizing learning signals and improving performance.
Class-Incremental Learning Apr 3
LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction Ignore
A model that enhances triplet extraction from text using atomic propositions.
Knowledge Graph Extraction Apr 3
Do Audio-Visual Large Language Models Really See and Hear? Ignore
This paper analyzes the internal workings of audio-visual large language models to understand modality biases, not a product pitch.
Multimodal LLMs Apr 3
Product-Stability: Provable Convergence for Gradient Descent on the Edge of Stability Ignore
This paper provides a theoretical framework for understanding gradient descent convergence at the edge of stability for a broader class of loss functions.
LLM Training Apr 3
Learning interacting particle systems from unlabeled data Ignore
A theoretical framework for learning interacting particle systems from unlabeled data without trajectory information.
Scientific Simulation Apr 2
Gradient Boosting within a Single Attention Layer Ignore
A novel attention mechanism that applies gradient boosting within a single layer to improve Transformer performance.
LLM Training Apr 3
LieTrunc-QNN: Lie Algebra Truncation and Quantum Expressivity Phase Transition from LiePrune to Provably Stable Quantum Neural Networks Ignore
A theoretical framework for improving the trainability and stability of quantum neural networks using Lie algebra.
Quantum Machine Learning Apr 3
Toys that listen, talk, and play: Understanding Children's Sensemaking and Interactions with AI Toys Ignore
This research explores how children understand and interact with AI toys, identifying design implications for more responsible and developmentally appropriate AI toy development.
Human-AI Interaction Apr 3
Efficient Logistic Regression with Mixture of Sigmoids Ignore
This paper presents a computationally efficient algorithm for online logistic regression with theoretical guarantees, improving upon existing complexity bounds.
Online Learning Algorithms Apr 3
One Model to Translate Them All? A Journey to Mount Doom for Multilingual Model Merging Ignore
This research explains why merging fine-tuned language models fails for multilingual translation, identifying representational divergence as the cause.
LLM Training Apr 3
Self-Optimizing Multi-Agent Systems for Deep Research Ignore
Self-optimizing multi-agent systems for deep research that adapt and improve their prompt strategies.
Agents Apr 3
Random Is Hard to Beat: Active Selection in online DPO with Modern LLMs Ignore
This research investigates active preference learning for LLMs, finding random sampling to be a surprisingly effective baseline and questioning the value of complex active selection strategies.
GitHub stars n/a Velocity flat History pending LLM Training Apr 3 Pending
A Comprehensive Framework for Long-Term Resiliency Investment Planning under Extreme Weather Uncertainty for Electric Utilities Ignore
A framework for electric utilities to optimize capital investments for extreme weather resiliency using digital twins and Monte Carlo simulations.
Energy Infrastructure Planning Apr 2
The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report Ignore
Develops efficient single-image super-resolution networks that significantly reduce computational cost while maintaining high image quality.
GitHub stars n/a Velocity flat History pending Image Super-Resolution Apr 3 Code
Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training Ignore
A 4D parallel framework for scalable mini-batch GNN training that significantly speeds up distributed learning on large graphs.
GitHub stars n/a Velocity flat History pending GNN Training Apr 3 Code
Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints Ignore
This research demonstrates that simple vocabulary constraints, rather than complex linguistic manipulations, can improve LLM reasoning by acting as output regularizers.
LLM Reasoning Apr 3
Improving Role Consistency in Multi-Agent Collaboration via Quantitative Role Clarity Ignore
This paper proposes a method to enhance role consistency in multi-agent systems using a quantitative role clarity approach.
Multi-Agent Systems Apr 3
Robust Learning with Optimal Error Ignore
Develops theoretical algorithms for learning with adversarial noise, improving optimal error rates using randomized hypotheses.
Learning Theory Apr 2
State estimations and noise identifications with intermittent corrupted observations via Bayesian variational inference Ignore
A novel adaptive Kalman filter for state estimation in sensor networks with intermittent packet dropouts and corrupted observations.
State Estimation Apr 3
High Volatility and Action Bias Distinguish LLMs from Humans in Group Coordination Ignore
This research investigates the coordination capabilities of LLMs compared to humans in group tasks, identifying key behavioral differences to inform future agent development.
GitHub stars n/a Velocity flat History pending Agents Apr 2 Code
Understanding the Effects of Safety Unalignment on Large Language Models Ignore
This research analyzes how safety alignment in LLMs can be compromised by specific techniques, revealing that one method (WO) significantly enhances malicious capabilities while the other (JT) has less impact, and proposes a mitigation strategy using supervised fine-tuning.
LLM Safety & Alignment Apr 2
GP-4DGS: Probabilistic 4D Gaussian Splatting from Monocular Video via Variational Gaussian Processes Ignore
A novel framework for probabilistic 4D scene reconstruction from monocular video, offering uncertainty quantification and motion estimation for unobserved regions.
3D Reconstruction Apr 3
Social Meaning in Large Language Models: Structure, Magnitude, and Pragmatic Prompting Ignore
This paper investigates how well large language models understand social meaning and explores prompting techniques to improve their accuracy in this area.
LLM Reasoning Apr 2
Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation Ignore
This paper surveys methods for enriching LLM knowledge at inference time, from simple prompting to advanced retrieval techniques, to improve reasoning and reduce reliance on static parameters.
LLM Augmentation Apr 3
Quotient-Based Posterior Analysis for Euclidean Latent Space Models Ignore
A theoretical framework for analyzing latent space models in network analysis by providing canonical posterior summaries.
Statistical Network Analysis Apr 3
The Quantum-Cryptographic Co-evolution Ignore
A framework for understanding the transition to quantum-resistant cryptography by mapping resilience and computational capability.
Cryptography Apr 3
Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space Ignore
This paper theoretically explores generative AI through the lens of threshold logic and high-dimensional geometry, offering a new perspective on neural computation.
AI Theory Apr 2
Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization Ignore
This paper provides a theoretical framework for understanding the behavior of empirical risk minimization in high-dimensional, non-Gaussian settings, extending existing Gaussian universality theorems.
Statistical Learning Theory Apr 3
Generating DDPM-based Samples from Tilted Distributions Ignore
This paper presents a theoretical framework for generating diffusion-based samples from tilted distributions.
Generative Modeling Apr 3
LLM+Graph@VLDB'2025 Workshop Summary Ignore
This workshop report summarizes research directions at the intersection of LLMs and graph data management, highlighting challenges and solutions for practical applications.
LLM+Graph Research Apr 3
Generative AI Use in Entrepreneurship: An Integrative Review and an Empowerment-Entrapment Framework Ignore
This paper reviews how generative AI impacts entrepreneurs, proposing a framework to understand its empowering and entrapping effects across the entrepreneurial lifecycle.
Entrepreneurship AI Apr 2
Disrupting Cognitive Passivity: Rethinking AI-Assisted Data Literacy through Cognitive Alignment Ignore
A framework for human-AI interaction that aligns AI's response mode with user cognitive demand to foster data literacy and prevent cognitive passivity.
Human-AI Interaction Apr 3
Corporations Constitute Intelligence Ignore
This paper analyzes the legal and democratic shortcomings of corporate AI constitutions, arguing for the need for a democratic body to govern AI behavior.
GitHub stars n/a Velocity flat History pending AI Governance Apr 3 Code
From Elevation Maps To Contour Lines: SVM and Decision Trees to Detect Violin Width Reduction Ignore
This research explores automated violin width reduction detection using 3D photogrammetry and machine learning, comparing different feature engineering approaches.
Computer Vision Apr 2
Lipschitz bounds for integral kernels Ignore
This paper theoretically characterizes the Lipschitz continuity of feature maps for integral kernels, offering insights into the robustness of kernel methods and neural networks.
Kernel Methods Theory Apr 3
Speaking of Language: Reflections on Metalanguage Research in NLP Ignore
This paper explores the concept of metalanguage in NLP and LLMs, identifying future research directions.
NLP Research Apr 3
Frame Theoretical Derivation of Three Factor Learning Rule for Oja's Subspace Rule Ignore
This paper provides a theoretical derivation of a learning rule for principal component analysis using frame theory, offering a new mathematical perspective on biologically plausible learning.
Theoretical AI Apr 3