ARXIV:2603.26586 · MULTIMODAL AI · SUBMITTED 30 MAR · 21:51 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

MA-Bench: Towards Fine-grained Micro-Action Understanding

Kun Li · Jihao Gu · Fei Wang · Zhiliang Wu · Hehe Fan · Dan Guo · arXiv

A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.

Evidence 96 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis. To tackle this issue, we present MA-Bench, a benchmark comprising 1,000 videos and a three-tier evaluation…

METHOD

Full abstract

With the rapid development of Multimodal Large Language Models (MLLMs), their potential in Micro-Action understanding, a vital role in human emotion analysis, remains unexplored due to the absence of specialized benchmarks. To tackle this issue, we present MA-Bench, a benchmark comprising 1,000 videos and a three-tier evaluation architecture that progressively examines micro-action perception, relational comprehension, and interpretive reasoning. MA-Bench contains 12,000 structured question-answer pairs, enabling systematic assessment of both recognition accuracy and action interpretation. The results of 23 representative MLLMs reveal that there are significant challenges in capturing motion granularity and fine-grained body-part dynamics. To address these challenges, we further construct MA-Bench-Train, a large-scale training corpus with 20.5K videos annotated with structured micro-action captions for fine-tuning MLLMs. The results of Qwen3-VL-8B fine-tuned on MA-Bench-Train show clear performance improvements across micro-action reasoning and explanation tasks. Our work aims to establish a foundation benchmark for advancing MLLMs in understanding subtle micro-action and human-related behaviors. Project Page: https://MA-Bench.github.io

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. The results of 23 representative MLLMs reveal that there are significant challenges in capturing motion granularity and fine-grained body-part dynamics. Code availability is flagged…

WHY NOW

Multimodal AI moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.

Evidence96 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.

Segment

Multimodal AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "6c0ddf6c-1287-47ca-afcf-68e7366da164", "arxiv_id": "2603.26586", "canonical_route": "/paper/ma-bench-towards-fine-grained-micro-action-understanding", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "ma-bench-towards-fine-grained-micro-action-understanding", "endpoints": { "paper_pack": "/api/v1/paper/ma-bench-towards-fine-grained-micro-action-understanding/paper-pack", "build_passport": "/api/v1/paper/ma-bench-towards-fine-grained-micro-action-understanding/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "MA-Bench: Towards Fine-grained Micro-Action Understanding", "normalized_query": "2603.26586", "route": "/paper/ma-bench-towards-fine-grained-micro-action-understanding", "paper_ref": "ma-bench-towards-fine-grained-micro-action-understanding", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/ma-bench-towards-fine-grained-micro-action-understanding#webpage", "url": "https://sciencetostartup.com/paper/ma-bench-towards-fine-grained-micro-action-understanding", "name": "MA-Bench: Towards Fine-grained Micro-Action Understanding", "description": "A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/ma-bench-towards-fine-grained-micro-action-understanding#scholarlyArticle", "headline": "MA-Bench: Towards Fine-grained Micro-Action Understanding", "description": "A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.", "url": "https://sciencetostartup.com/paper/ma-bench-towards-fine-grained-micro-action-understanding", "sameAs": "https://arxiv.org/abs/2603.26586", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.26586" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-27T16:49:19.000Z", "author": [ { "@type": "Person", "name": "Kun Li" }, { "@type": "Person", "name": "Jihao Gu" }, { "@type": "Person", "name": "Fei Wang" }, { "@type": "Person", "name": "Zhiliang Wu" }, { "@type": "Person", "name": "Hehe Fan" }, { "@type": "Person", "name": "Dan Guo" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Multimodal AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Multimodal AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "MA-Bench: Towards Fine-grained Micro-Action Understanding", "item": "https://sciencetostartup.com/paper/ma-bench-towards-fine-grained-micro-action-understanding" } ] } ] }

Competitive landscape

A new benchmark and training dataset for fine-grained micro-action understanding in multimodal LLMs, enabling better human behavior analysis.

Segment

Multimodal AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

MA-Bench: Towards Fine-grained Micro-Action Understanding

MA-Bench: Towards Fine-grained Micro-Action Understanding

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline