ARXIV:2604.13448 · COMPUTER VISION AI · SUBMITTED 16 APR · 18:21 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

A Study of Failure Modes in Two-Stage Human-Object Interaction Detection

Lemeng Wang · Qinqian Lei · Vidhi Bakshi · Daniel Yi · Yifan Liu · Jiacheng Hou · +5 at arXiv

A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.

Ship in 2-4 weeks›Score4.0Evidence unverified

Opportunity summary

Pain A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.

Evidence 0 refs | 4 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research. While recent advances have improved performance on existing benchmarks, their evaluations mainly focus on overall prediction accuracy…

METHOD

Full abstract

Human-object interaction (HOI) detection aims to detect interactions between humans and objects in images. While recent advances have improved performance on existing benchmarks, their evaluations mainly focus on overall prediction accuracy and provide limited insight into the underlying causes of model failures. In particular, modern models often struggle in complex scenes involving multiple people and rare interaction combinations. In this work, we present a study to better understand the failure modes of two-stage HOI models, which form the basis of many current HOI detection approaches. Rather than constructing a large-scale benchmark, we instead decompose HOI detection into multiple interpretable perspectives and analyze model behavior across these dimensions to study different types of failure patterns. We curate a subset of images from an existing HOI dataset organized by human-object-interaction configurations (e.g., multi-person interactions and object sharing), and analyze model behavior under these configurations to examine different failure modes. This design allows us to analyze how these HOI models behave under different scene compositions and why their predictions fail. Importantly, high overall benchmark performance does not necessarily reflect robust visual reasoning about human-object relationships. We hope that this study can provide useful insights into the limitations of HOI models and offer observations for future research in this area.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. We hope that this study can provide useful insights into the limitations of HOI models and offer observations for future research in this area.…

WHY NOW

Computer Vision AI moved forward this cycle; last verified April 2026. Public score 4.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainA study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.

Evidence0 refs | 4 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.

Segment

Computer Vision AI

Adoption evidence

Public code linked for build inspection

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "83b64ee7-1fc3-4dfb-a842-02d6b00c80b8", "arxiv_id": "2604.13448", "canonical_route": "/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "a-study-of-failure-modes-in-two-stage-human-object-interaction-detection", "endpoints": { "paper_pack": "/api/v1/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection/paper-pack", "build_passport": "/api/v1/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "A Study of Failure Modes in Two-Stage Human-Object Interaction Detection", "normalized_query": "2604.13448", "route": "/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection", "paper_ref": "a-study-of-failure-modes-in-two-stage-human-object-interaction-detection", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection#webpage", "url": "https://sciencetostartup.com/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection", "name": "A Study of Failure Modes in Two-Stage Human-Object Interaction Detection", "description": "A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection#scholarlyArticle", "headline": "A Study of Failure Modes in Two-Stage Human-Object Interaction Detection", "description": "A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.", "url": "https://sciencetostartup.com/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection", "sameAs": "https://arxiv.org/abs/2604.13448", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.13448" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-15T04:01:23.000Z", "author": [ { "@type": "Person", "name": "Lemeng Wang" }, { "@type": "Person", "name": "Qinqian Lei" }, { "@type": "Person", "name": "Vidhi Bakshi" }, { "@type": "Person", "name": "Daniel Yi" }, { "@type": "Person", "name": "Yifan Liu" }, { "@type": "Person", "name": "Jiacheng Hou" }, { "@type": "Person", "name": "Asher Seng Hao" }, { "@type": "Person", "name": "Zheda Mai" }, { "@type": "Person", "name": "Wei-Lun Chao" }, { "@type": "Person", "name": "Robby T. Tan" }, { "@type": "Person", "name": "Bo Wang" } ], "codeRepository": "https://github.com/cvpr-org/author-kit", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Computer Vision AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection#software", "name": "A Study of Failure Modes in Two-Stage Human-Object Interaction Detection - Source Code", "description": "A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.", "codeRepository": "https://github.com/cvpr-org/author-kit", "url": "https://github.com/cvpr-org/author-kit" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Computer Vision AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "A Study of Failure Modes in Two-Stage Human-Object Interacti", "item": "https://sciencetostartup.com/paper/a-study-of-failure-modes-in-two-stage-human-object-interaction-detection" } ] } ] }

Competitive landscape

A study analyzing failure modes in two-stage human-object interaction detection models to provide insights for future research.

Segment

Computer Vision AI

Adoption evidence

Public code linked for build inspection

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

A Study of Failure Modes in Two-Stage Human-Object Interaction Detection

A Study of Failure Modes in Two-Stage Human-Object Interaction Detection

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline