UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding Build Now
UI-Zoomer enhances GUI interaction by employing uncertainty-driven adaptive zoom for improved element localization.
GitHub stars n/a Velocity flat History 1 snapshot GUI Optimization Tools Apr 15 Pending High viability
Training-Free Test-Time Contrastive Learning for Large Language Models Build Now
A training-free framework that enables frozen LLMs to adapt to distribution shifts by distilling supervision from their own inference experiences through a dynamic 'Explore-Reflect-Steer' loop.
GitHub stars n/a Velocity flat History pending LLM Adaptation Apr 15 Pending High viability
Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation Build Now
Aligning early visual cortex representations in vision-language models to shield them from sycophantic manipulation.
GitHub stars n/a Velocity flat History pending AI Safety / Vision-Language Models Apr 15 Pending High viability
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space Build Now
A novel reinforcement learning approach that optimizes LLM reasoning by directly updating the pre-train space, leading to significant improvements in reasoning ability and pruning of incorrect reasoning paths.
GitHub stars n/a Velocity flat History pending LLM Training Apr 15 Pending High viability
UHR-BAT: Budget-Aware Token Compression Vision-Language model for Ultra-High-Resolution Remote Sensing Build Now
A budget-aware token compression framework for ultra-high-resolution remote sensing imagery that achieves state-of-the-art performance by leveraging text-guided importance estimation and region-wise strategies.
GitHub stars n/a Velocity flat History pending Remote Sensing AI Apr 15 Pending High viability
MIND: AI Co-Scientist for Material Research Build Now
An AI co-scientist framework for materials research that automates hypothesis validation through experimentation and debate.
GitHub stars n/a Velocity flat History pending Agents Apr 15 Pending High viability
TIP: Token Importance in On-Policy Distillation Build Now
A novel method for on-policy knowledge distillation that significantly reduces memory usage and training time by intelligently selecting informative tokens, validated on multiple LLM architectures.
GitHub stars n/a Velocity flat History pending LLM Training Apr 15 Pending High viability
Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus Build Now
A Transformer-based framework that bridges multi-agent reinforcement learning to single-agent learning for improved coordination and performance.
GitHub stars n/a Velocity flat History pending Multi-Agent Reinforcement Learning Apr 15 Pending High viability
MaMe & MaRe: Matrix-Based Token Merging and Restoration for Efficient Visual Perception and Synthesis Build Now
A GPU-friendly token merging and restoration technique that significantly accelerates Vision Transformers and image generation models with minimal performance loss.
GitHub stars n/a Velocity flat History pending Vision Transformer Optimization Apr 15 Pending High viability
AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot Build Now
Revolutionize academic conference peer reviews with AI-driven critique enhancement systems.
GitHub stars n/a Velocity flat History 1 snapshot AI-Assist Tools Apr 15 Code High viability
A KL Lens on Quantization: Fast, Forward-Only Sensitivity for Mixed-Precision SSM-Transformer Models Build Now
A fast, forward-only KL-divergence based sensitivity analysis for efficient mixed-precision quantization of SSM-Transformer models on edge devices.
GitHub stars n/a Velocity flat History pending LLM Optimization Apr 15 Pending High viability
ReSS: Learning Reasoning Models for Tabular Data Prediction via Symbolic Scaffold Build Now
ReSS bridges symbolic and neural reasoning for tabular data, using decision trees to scaffold LLMs for accurate and faithful natural-language explanations.
GitHub stars n/a Velocity flat History pending Tabular AI Apr 15 Code High viability
Diffusion Language Models for Speech Recognition Build Now
Diffusion language models integrated with CTC for improved speech recognition accuracy, with code and recipes available.
GitHub stars n/a Velocity flat History pending Speech Recognition Apr 15 Code High viability
GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis Build Now
A dynamic benchmark and agent architecture for evaluating and improving tool-augmented LLMs in complex spatial analysis tasks.
GitHub stars n/a Velocity flat History 1 snapshot Agents Apr 15 Code High viability
LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning Build Now
A new benchmark for evaluating long-horizon chain-of-thought reasoning in LLMs, revealing significant gaps in current model capabilities and providing a rigorous measure for future progress.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 15 Code High viability
From Anchors to Supervision: Memory-Graph Guided Corpus-Free Unlearning for Large Language Models Build Now
A framework for unlearning sensitive data from LLMs using memory graphs and synthesized supervision, without access to the original training data.
GitHub stars n/a Velocity flat History pending LLM Unlearning Apr 15 Code High viability
Automatically Inferring Teachers' Geometric Content Knowledge: A Skills Based Approach Build Now
An automated system for assessing teachers' geometric content knowledge using LLMs and a fine-grained skills dictionary, enabling scalable evaluation.
GitHub stars n/a Velocity flat History pending Educational AI Apr 15 Code High viability
MCPThreatHive: Automated Threat Intelligence for Model Context Protocol Ecosystems Build Now
An automated platform for generating and visualizing threat intelligence for agentic systems, addressing critical gaps in current security tools.
GitHub stars n/a Velocity flat History pending Security Apr 15 Code High viability
HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark Build Now
HINTBench: A benchmark for evaluating intrinsic risks in AI agents, revealing a significant capability gap in detecting and localizing latent failures.
GitHub stars n/a Velocity flat History pending Agents Apr 15 Code High viability
SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment Build Now
SafeHarness is a lifecycle-integrated security architecture for LLM agents that significantly reduces unsafe behavior and attack success rates.
GitHub stars n/a Velocity flat History pending LLM Security Apr 15 Code High viability
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration Build Now
An agent-driven system that automates the entire LLM training lifecycle, from research to execution, optimizing model performance.
GitHub stars n/a Velocity flat History pending LLM Training Automation Apr 15 Code High viability
TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds Build Now
TokenFormer unifies multi-field and sequential recommendation models, overcoming sequential collapse propagation with a novel attention scheme and representation method.
GitHub stars n/a Velocity flat History pending Recommendation Systems Apr 15 Code High viability
MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments Build Now
MERRIN is a new benchmark for evaluating AI agents' ability to reason across multimodal, noisy web data, highlighting a critical gap in current search technologies.
GitHub stars n/a Velocity flat History pending Search Agents Apr 15 Code High viability
How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data Build Now
A systematic study and open dataset for synthesizing high-quality pretraining data for LLMs, reducing generation costs by up to 30x.
GitHub stars n/a Velocity flat History pending LLM Training Apr 15 Code High viability
HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System Build Now
A hierarchical robotic manipulation system that decouples planning from execution, preserving VLM reasoning while improving control for complex tasks.
GitHub stars n/a Velocity flat History pending Robotics Apr 15 Code High viability
Do We Still Need Humans in the Loop? Comparing Human and LLM Annotation in Active Learning for Hostility Detection Build Now
LLM-generated annotations for hostility detection achieve comparable performance to human annotations at a fraction of the cost, with nuanced error profiles.
GitHub stars n/a Velocity flat History pending LLM Annotation Apr 15 Code High viability
RiskWebWorld: A Realistic Interactive Benchmark for GUI Agents in E-commerce Risk Management Build Now
A realistic benchmark and infrastructure for evaluating GUI agents in e-commerce risk management, revealing a significant capability gap in current models.
GitHub stars n/a Velocity flat History pending Agents Apr 15 Code High viability
Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understanding Build Now
UniRect-CoT is a training-free framework that enhances unified multimodal model generation by leveraging their inherent understanding to reflect and rectify intermediate results, inspired by human 'Thinking-While-Drawing'.
GitHub stars n/a Velocity flat History pending Multimodal Generation Apr 15 Code High viability
From Prediction to Justification: Aligning Sentiment Reasoning with Human Rationale via Reinforcement Learning Build Now
A reinforcement learning framework that aligns sentiment reasoning with human rationale, improving interpretability and performance in ABSA tasks.
GitHub stars n/a Velocity flat History pending Aspect-based Sentiment Analysis Apr 15 Code High viability
DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis Build Now
A large-scale dataset and benchmark for distractor-free novel view synthesis, enabling robust radiance field development and a diffusion-based enhancement application.
GitHub stars n/a Velocity flat History pending Novel View Synthesis Apr 15 Code High viability
A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies Build Now
This research provides a mechanistic analysis of sim-and-real co-training for generative robot policies, identifying key effects and proposing a method to improve performance.
GitHub stars n/a Velocity flat History pending Robotics AI Apr 15 Code High viability
Jump-Start Reinforcement Learning with Vision-Language-Action Regularization Build Now
VLAJS jump-starts reinforcement learning for robotics by using vision-language-action models to guide exploration and improve learning efficiency, outperforming baselines by over 50%.
GitHub stars n/a Velocity flat History pending Robotics RL Apr 15 Code High viability
BenGER: A Collaborative Web Platform for End-to-End Benchmarking of German Legal Tasks Build Now
BenGER is an open-source web platform for end-to-end benchmarking of German legal tasks, integrating task creation, annotation, LLM runs, and evaluation.
GitHub stars n/a Velocity flat History pending LLM Tools Apr 15 Code High viability
A Unified Conditional Flow for Motion Generation, Editing, and Intra-Structural Retargeting Build Now
A unified generative framework for text-driven motion editing and structural retargeting, simplifying deployment and improving consistency.
GitHub stars n/a Velocity flat History pending Generative Motion Apr 15 Code High viability
ASTER: Latent Pseudo-Anomaly Generation for Unsupervised Time-Series Anomaly Detection Build Now
A novel framework for unsupervised time-series anomaly detection that generates latent pseudo-anomalies to train a Transformer-based classifier.
GitHub stars n/a Velocity flat History pending Time-Series Anomaly Detection Apr 15 Code High viability
IndicDB -- Benchmarking Multilingual Text-to-SQL Capabilities in Indian Languages Build Now
A benchmark and evaluation framework for multilingual Text-to-SQL in Indian languages, revealing an 'Indic Gap' in current LLM performance.
GitHub stars n/a Velocity flat History pending Multilingual Text-to-SQL Apr 15 Code High viability
Learning from Change: Predictive Models for Incident Prevention in a Regulated IT Environment Build Now
An interpretable machine learning model that predicts IT incident risk for regulated environments, outperforming rule-based systems.
GitHub stars n/a Velocity flat History pending IT Operations AI Apr 15 Code High viability
SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization Build Now
A new method for optimizing LLM post-training by controlling data overlap between fine-tuning stages, significantly improving performance without extra compute.
GitHub stars n/a Velocity flat History pending LLM Training Apr 15 Code High viability
Quantifying and Understanding Uncertainty in Large Reasoning Models Build Now
A novel methodology quantifies uncertainty in Large Reasoning Models with statistical guarantees, using Shapley values for explainable subsets of training data.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 15 Code High viability
A 3D SAM-Based Progressive Prompting Framework for Multi-Task Segmentation of Radiotherapy-induced Normal Tissue Injuries in Limited-Data Settings Build Now
A 3D segmentation framework for radiotherapy injuries using progressive prompting and a novel loss function, outperforming state-of-the-art on limited medical data.
GitHub stars n/a Velocity flat History pending Medical AI Apr 15 Code High viability
UMI-3D: Extending Universal Manipulation Interface from Vision-Limited to 3D Spatial Perception Build Now
A multimodal robotic manipulation system integrating LiDAR for robust 3D spatial perception, enhancing data collection and policy performance.
GitHub stars n/a Velocity flat History pending Robotics Apr 15 Code High viability
Syn-TurnTurk: A Synthetic Dataset for Turn-Taking Prediction in Turkish Dialogues Build Now
Syn-TurnTurk is a synthetic dataset for Turkish dialogue turn-taking prediction, enabling more natural human-machine interaction in Turkish voice bots.
GitHub stars n/a Velocity flat History pending Dialogue AI Apr 15 Code High viability
Hierarchical Reinforcement Learning with Runtime Safety Shielding for Power Grid Operation Build Now
A safety-constrained hierarchical reinforcement learning framework for power grid operation that ensures runtime safety and robust generalization.
GitHub stars n/a Velocity flat History pending Reinforcement Learning Apr 15 Code High viability
Outperforming Self-Attention Mechanisms in Solar Irradiance Forecasting via Physics-Guided Neural Networks Build Now
A physics-guided hybrid CNN-BiLSTM model that outperforms attention mechanisms for accurate solar irradiance forecasting.
GitHub stars n/a Velocity flat History pending Renewable Energy Management Apr 15 Code High viability
Asymmetric-Loss-Guided Hybrid CNN-BiLSTM-Attention Model for Industrial RUL Prediction with Interpretable Failure Heatmaps Build Now
A hybrid CNN-BiLSTM-Attention model with interpretable heatmaps for accurate and safe industrial Remaining Useful Life prediction.
GitHub stars n/a Velocity flat History pending Predictive Maintenance Apr 15 Code High viability
Leveraging LLM-GNN Integration for Open-World Question Answering over Knowledge Graphs Watch
Integrate LLM and GNN to enhance open-world question answering over knowledge graphs.
GitHub stars n/a Velocity flat History 1 snapshot AI for Knowledge Graphs and NLP Apr 15 Code
SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention Watch
A novel algorithm-system co-design framework for load-balanced long context LLM training that improves accuracy and system efficiency.
GitHub stars n/a Velocity flat History pending LLM Training Apr 15 Code
From Alignment to Prediction: A Study of Self-Supervised Learning and Predictive Representation Learning Watch
Introduces Predictive Representation Learning (PRL) as a new paradigm for self-supervised learning, demonstrating its potential through comparative analysis of existing methods.
GitHub stars n/a Velocity flat History pending Self-Supervised Learning Apr 15 Code
Reward Design for Physical Reasoning in Vision-Language Models Watch
Systematic reward ablation study for training Vision-Language Models on physical reasoning, revealing domain-specific behaviors and improving spatial relation accuracy.
GitHub stars n/a Velocity flat History pending Vision-Language Models Apr 15 Code
Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning Build Now
A framework using LLMs to automate and optimize reward function design in reinforcement learning, reducing evaluation costs and improving performance.
GitHub stars n/a Velocity flat History pending Reinforcement Learning Apr 15 Code High viability
The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents Watch
A parallel monitoring architecture for LLM agents that detects and recovers from reasoning degradation, with both LLM-based and zero-overhead probe-based implementations.
GitHub stars n/a Velocity flat History pending LLM Agents Apr 15 Code
C-voting: Confidence-Based Test-Time Voting without Explicit Energy Functions Build Now
Confidence-based voting (C-voting) enhances test-time scaling for recurrent models by selecting trajectories based on prediction confidence, outperforming existing methods on complex reasoning tasks.
GitHub stars n/a Velocity flat History pending LLM Reasoning Apr 15 Code High viability
Beyond Voxel 3D Editing: Learning from 3D Masks and Self-Constructed Data Build Now
A framework for efficient 3D asset editing that leverages a self-constructed dataset and lightweight modules to inject textual semantics while preserving local invariance.
GitHub stars n/a Velocity flat History pending 3D Generative AI Apr 15 Code High viability
MAny: Merge Anything for Multimodal Continual Instruction Tuning Build Now
MAny is a framework that merges task-specific knowledge for multimodal LLMs to prevent catastrophic forgetting in both perception and reasoning.
GitHub stars n/a Velocity flat History pending Multimodal LLMs Apr 15 Code High viability
Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective Ignore
A survey proposing a new taxonomy for feed-forward 3D reconstruction models, focusing on design strategies rather than output representations.
GitHub stars n/a Velocity flat History pending 3D Scene Modeling Apr 15 Pending
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents Watch
Enhance coding agent capabilities by leveraging a shared memory pool for cross-domain learning.
GitHub stars n/a Velocity flat History 1 snapshot Transfer Learning in AI Systems Apr 15 Code
From Feelings to Metrics: Understanding and Formalizing How Users Vibe-Test LLMs Watch
Transform subjective user feedback into quantifiable metrics for LLMs through structured vibe-testing.
GitHub stars n/a Velocity flat History 1 snapshot User Experience Metrics for LLMs Apr 15 Code
Representation over Routing: Overcoming Surrogate Hacking in Multi-Timescale PPO Watch
Proposes a Target Decoupling architecture for multi-timescale PPO that isolates short-term signals to prevent surrogate hacking and improve performance in delayed-reward tasks.
GitHub stars n/a Velocity flat History pending Reinforcement Learning Apr 15 Code
Event-Adaptive State Transition and Gated Fusion for RGB-Event Object Tracking Build Now
MambaTrack offers a real-time, adaptive RGB-Event object tracking system that overcomes limitations of static state transitions for robust cross-modal fusion.
GitHub stars n/a Velocity flat History pending Object Tracking Apr 15 Code High viability
Towards Scalable Lightweight GUI Agents via Multi-role Orchestration Watch
A framework for lightweight GUI agents that uses multi-role orchestration to improve task scalability and performance on resource-constrained devices.
Agents Apr 15
Towards Fine-grained Temporal Perception: Post-Training Large Audio-Language Models with Audio-Side Time Prompt Watch
Fine-tune large audio-language models for precise temporal event detection using audio-side time prompts and reinforcement learning.
GitHub stars n/a Velocity flat History pending Audio AI Apr 15 Code
Design Space Exploration of Hybrid Quantum Neural Networks for Chronic Kidney Disease Watch
A comprehensive exploration of Hybrid Quantum Neural Networks for Chronic Kidney Disease diagnosis, benchmarking 625 models to find optimal design choices.
GitHub stars n/a Velocity flat History pending Medical AI Apr 15 Code
Sentiment analysis for software engineering: How far can zero-shot learning (ZSL) go? Watch
Leveraging zero-shot learning for sentiment analysis in software engineering to overcome the challenge of scarce annotated datasets.
GitHub stars n/a Velocity flat History pending NLP Apr 15 Code
Beyond Arrow's Impossibility: Fairness as an Emergent Property of Multi-Agent Collaboration Ignore
Develop fair AI agents by enabling emergent fairness properties through multi-agent collaboration and negotiation.
GitHub stars n/a Velocity flat History pending Agents Apr 15 Code
Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence Ignore
A theoretical framework and adaptive algorithm for minimax optimal majority-vote ensembles in Markov-dependent data, validated on diverse benchmarks.
GitHub stars n/a Velocity flat History pending Ensemble Methods Apr 15 Code
Towards Multi-Object-Tracking with Radar on a Fast Moving Vehicle: On the Potential of Processing Radar in the Frequency Domain Ignore
Processing radar data in the frequency domain for robust multi-object tracking on fast-moving vehicles, demonstrating radar-only odometry.
GitHub stars n/a Velocity flat History pending Autonomous Driving Perception Apr 15 Code
Beyond Conservative Automated Driving in Multi-Agent Scenarios via Coupled Model Predictive Control and Deep Reinforcement Learning Watch
An integrated MPC-RL framework for automated driving that balances safety and efficiency in multi-agent scenarios, outperforming standalone methods.
Autonomous Driving Apr 15
Rhetorical Questions in LLM Representations: A Linear Probing Study Ignore
Investigating how LLMs internally represent rhetorical questions using linear probing, revealing that these signals emerge early and are encoded by multiple, context-dependent directions.
GitHub stars n/a Velocity flat History pending LLM Representations Apr 15 Code
FRAGATA: Semantic Retrieval of HPC Support Tickets via Hybrid RAG over 20 Years of Request Tracker History Watch
Fragata is a semantic search system for HPC support tickets that uses hybrid RAG to improve knowledge reuse and overcome limitations of traditional search engines.
Semantic Search Apr 15
CLIP Architecture for Abdominal CT Image-Text Alignment and Zero-Shot Learning: Investigating Batch Composition and Data Scaling Ignore
Investigating the impact of batch composition and data scaling on CLIP-like vision-language models for zero-shot diagnosis of abdominal CT scans, finding that random sampling outperforms engineered balancing.
GitHub stars n/a Velocity flat History pending Medical Imaging AI Apr 15 Code
The Cognitive Circuit Breaker: A Systems Engineering Framework for Intrinsic AI Reliability Ignore
A framework to detect LLM hallucinations by analyzing internal model states during inference, reducing latency and computational overhead.
GitHub stars n/a Velocity flat History pending LLM Reliability Apr 15 Code
Evaluating Supervised Machine Learning Models: Principles, Pitfalls, and Metric Selection Ignore
A framework for robustly evaluating supervised machine learning models by addressing common pitfalls in metric selection and validation.
GitHub stars n/a Velocity flat History pending ML Evaluation Apr 15 Code
First-See-Then-Design: A Multi-Stakeholder View for Optimal Performance-Fairness Trade-Offs Ignore
A theoretical framework for multi-stakeholder fairness in algorithmic decision-making that explicitly models utilities and welfare across different groups.
GitHub stars n/a Velocity flat History pending Fairness in AI Apr 15 Code
On the Use of Evolutionary Optimization for the Dynamic Chance Constrained Open-Pit Mine Scheduling Problem Ignore
An evolutionary optimization approach addresses dynamic chance-constrained open-pit mine scheduling by maximizing profit and minimizing standard deviation.
GitHub stars n/a Velocity flat History pending Optimization Apr 15 Code
Creo: From One-Shot Image Generation to Progressive, Co-Creative Ideation Ignore
Creo: A multi-stage text-to-image system that allows progressive, co-creative ideation with user control and decision locking.
Generative AI Apr 15
[Emerging Ideas] Artificial Tripartite Intelligence: A Bio-Inspired, Sensor-First Architecture for Physical AI Ignore
A bio-inspired, sensor-first architecture for physical AI that improves end-to-end accuracy and reduces remote inference calls.
Physical AI Apr 15
Comparison of window shapes and lengths in short-time feature extraction for classification of heart sound signals Ignore
An experimental evaluation of window shapes and lengths for feature extraction in classifying heart sound signals using bidirectional LSTMs.
Medical AI Apr 15
Secure and Privacy-Preserving Vertical Federated Learning Ignore
A novel framework for privacy-preserving vertical federated learning using secure multiparty computation and differential privacy.
Federated Learning Apr 15
AlphaCNOT: Learning CNOT Minimization with Model-Based Planning Ignore
Developing AlphaCNOT, a model-based reinforcement learning framework for CNOT minimization in quantum circuits.
Quantum Computing Optimization Apr 15
Weight Patching: Toward Source-Level Mechanistic Localization in LLMs Ignore
A novel method for localizing LLM behavior to specific internal components by patching weights between models with differing capabilities.
LLM Interpretability Apr 15
A Study of Failure Modes in Two-Stage Human-Object Interaction Detection Ignore
An analysis of failure modes in two-stage human-object interaction detection models to improve future research.
GitHub stars n/a Velocity flat History pending Computer Vision Apr 15 Code
Adaptive Conformal Prediction for Improving Factuality of Generations by Large Language Models Ignore
Adaptive conformal prediction for improving the factuality of large language model generations by enabling prompt-dependent calibration.
LLM Factuality Apr 15
Monthly Diffusion v0.9: A Latent Diffusion Model for the First AI-MIP Ignore
A latent diffusion model for simulating low-frequency atmospheric variability at monthly timesteps with modest computational requirements.
Climate AI Apr 15
Cognitive Offloading in Agile Teams: How Artificial Intelligence Reshapes Risk Assessment and Planning Quality Ignore
Investigating the impact of AI on team cognition in agile sprint planning to propose a hybrid AI-human framework.
AI for Project Management Apr 15
Golden Handcuffs make safer AI agents Ignore
A Bayesian mitigation strategy for reinforcement learning agents to prevent unintended high-reward strategies by incorporating a large negative penalty and a mentor override.
Agents Apr 15
Large Language Models to Enhance Business Process Modeling: Past, Present, and Future Trends Ignore
A literature review on using Large Language Models to enhance Business Process Modeling, highlighting current trends, challenges, and future research directions.
LLM Applications Apr 15
Med-CAM: Minimal Evidence for Explaining Medical Decision Making Ignore
Generate minimal and sharp explanation maps for medical imaging AI decisions to improve clinician trust and understanding.
Medical AI Apr 15
Ordinary Least Squares is a Special Case of Transformer Ignore
This paper theoretically demonstrates that Ordinary Least Squares is a special case of the Transformer architecture, revealing a decoupled slow and fast memory mechanism.
LLM Theory Apr 15
Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces Ignore
A theoretical framework for off-policy, entropy-regularised reinforcement learning using eligibility traces.
Reinforcement Learning Apr 15
A Dynamic-Growing Fuzzy-Neuro Controller, Application to a 3PSP Parallel Robot Ignore
A dynamic-growing fuzzy neural controller with adaptive strategy for parallel robot position control.
Robotics Control Apr 15
Rethinking AI Hardware: A Three-Layer Cognitive Architecture for Autonomous Agents Ignore
A novel three-layer cognitive architecture for autonomous agents that decomposes intelligence across heterogeneous hardware to reduce latency and energy consumption.
AI Hardware Architecture Apr 15
Young people's perceptions and recommendations for conversational generative artificial intelligence in youth mental health Ignore
Young people's perceptions and recommendations for conversational generative AI in youth mental health are explored through co-design workshops.
AI Ethics & HCI Apr 15
From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning Ignore
A theoretical characterization of forgetting in continual learning by analyzing task distributions rather than orderings.
Continual Learning Theory Apr 15