ARXIV:2602.20918 · MULTIMODAL AI · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Predicting Sentence Acceptability Judgments in Multimodal Contexts

arXiv

Explore how visual contexts affect sentence acceptability judgments by humans and LLMs.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain Explore how visual contexts affect sentence acceptability judgments by humans and LLMs.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Explore how visual contexts affect sentence acceptability judgments by humans and LLMs. We consider the effect of prior exposure to visual images (i.e., visual context) on these judgments for humans and large language models…

METHOD

Full abstract

Previous work has examined the capacity of deep neural networks (DNNs), particularly transformers, to predict human sentence acceptability judgments, both independently of context, and in document contexts. We consider the effect of prior exposure to visual images (i.e., visual context) on these judgments for humans and large language models (LLMs). Our results suggest that, in contrast to textual context, visual images appear to have little if any impact on human acceptability ratings. However, LLMs display the compression effect seen in previous work on human judgments in document contexts. Different sorts of LLMs are able to predict human acceptability judgments to a high degree of accuracy, but in general, their performance is slightly better when visual contexts are removed. Moreover, the distribution of LLM judgments varies among models, with Qwen resembling human patterns, and others diverging from them. LLM-generated predictions on sentence acceptability are highly correlated with their normalised log probabilities in general. However, the correlations decrease when visual contexts are present, suggesting that a higher gap exists between the internal representations of LLMs and their generated predictions in the presence of visual contexts. Our experimental work suggests interesting points of similarity and of difference between human and LLM processing of sentences in multimodal contexts.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. Our results suggest that, in contrast to textual context, visual images appear to have little if any impact on human acceptability ratings.

WHY NOW

Multimodal AI moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainExplore how visual contexts affect sentence acceptability judgments by humans and LLMs.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Explore how visual contexts affect sentence acceptability judgments by humans and LLMs.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

References(11)

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

2024Zhe Chen, Weiyun Wang et al.

How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments

2024Yusuke Ide, Yuto Nishida et al.

Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models

2024Carina Kauf, Emmanuele Chersoni et al.

Language models align with human judgments on key grammatical constructions

2024Jennifer Hu, Kyle Mahowald et al.

Evaluating Grammatical Well-Formedness in Large Language Models: A Comparative Study with Human Judgments

2024Zhuang Qiu, Xufeng Duan et al.

Improved Baselines with Visual Instruction Tuning

2023Haotian Liu, Chunyuan Li et al.

Deep Learning and Linguistic Representation

2021Shalom Lappin

How Furiously Can Colorless Green Ideas Sleep? Sentence Acceptability in Context

2020Jey Han Lau, C. S. Armendariz et al.

The Influence of Context on Sentence Acceptability Judgements

2018Jean-Philippe Bernardy, Shalom Lappin et al.

Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge

2017Jey Han Lau, Alexander Clark et al.

Moses: Open Source Toolkit for Statistical Machine Translation

2007Philipp Koehn, Hieu T. Hoang et al.

{ "contract_version": "paper-r2", "paper_id": "6cef8fb1-676c-4e9a-909b-5fd8a16260fc", "arxiv_id": "2602.20918", "canonical_route": "/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "predicting-sentence-acceptability-judgments-in-multimodal-contexts", "endpoints": { "paper_pack": "/api/v1/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts/paper-pack", "build_passport": "/api/v1/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Predicting Sentence Acceptability Judgments in Multimodal Contexts", "normalized_query": "2602.20918", "route": "/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts", "paper_ref": "predicting-sentence-acceptability-judgments-in-multimodal-contexts", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts#webpage", "url": "https://sciencetostartup.com/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts", "name": "Predicting Sentence Acceptability Judgments in Multimodal Contexts", "description": "Explore how visual contexts affect sentence acceptability judgments by humans and LLMs.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts#scholarlyArticle", "headline": "Predicting Sentence Acceptability Judgments in Multimodal Contexts", "description": "Explore how visual contexts affect sentence acceptability judgments by humans and LLMs.", "url": "https://sciencetostartup.com/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts", "sameAs": "https://arxiv.org/abs/2602.20918", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.20918" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-24T13:54:38.000Z", "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "5f49ec9560ca9e03eff32a607f6caabc08f98926" }, "url": "https://www.semanticscholar.org/paper/5f49ec9560ca9e03eff32a607f6caabc08f98926" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "c42c10e0205225f9afcb16488fa303150d426911" }, "url": "https://www.semanticscholar.org/paper/c42c10e0205225f9afcb16488fa303150d426911" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e9463915f4481c91b559c44283e5cf9a93e195a3" }, "url": "https://www.semanticscholar.org/paper/e9463915f4481c91b559c44283e5cf9a93e195a3" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "934a0d374e3babeff24edd1ac04f8196ecfcd680" }, "url": "https://www.semanticscholar.org/paper/934a0d374e3babeff24edd1ac04f8196ecfcd680" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "124d4d374fbef2016fa9880489871a58a7450644" }, "url": "https://www.semanticscholar.org/paper/124d4d374fbef2016fa9880489871a58a7450644" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7b65f3e60065cff562f45c6c20f4e8a27949f18d" }, "url": "https://www.semanticscholar.org/paper/7b65f3e60065cff562f45c6c20f4e8a27949f18d" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "09307f588e2800b37c943194c84836b4811eb74e" }, "url": "https://www.semanticscholar.org/paper/09307f588e2800b37c943194c84836b4811eb74e" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "6d84701d47d0064be247dbbc77f42f700913caa8" }, "url": "https://www.semanticscholar.org/paper/6d84701d47d0064be247dbbc77f42f700913caa8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7b36c7c5d6490d02ad28298a42980b43d4b1ded8" }, "url": "https://www.semanticscholar.org/paper/7b36c7c5d6490d02ad28298a42980b43d4b1ded8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "4ee2eab4c298c1824a9fb8799ad8eed21be38d21" }, "url": "https://www.semanticscholar.org/paper/4ee2eab4c298c1824a9fb8799ad8eed21be38d21" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "155e3694e7413177d75a25774454c1ff571c0943" }, "url": "https://www.semanticscholar.org/paper/155e3694e7413177d75a25774454c1ff571c0943" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Multimodal AI" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Multimodal AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Predicting Sentence Acceptability Judgments in Multimodal Co", "item": "https://sciencetostartup.com/paper/predicting-sentence-acceptability-judgments-in-multimodal-contexts" } ] } ] }

References(11)

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

2024Zhe Chen, Weiyun Wang et al.

How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments

2024Yusuke Ide, Yuto Nishida et al.

Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models

2024Carina Kauf, Emmanuele Chersoni et al.

Language models align with human judgments on key grammatical constructions

2024Jennifer Hu, Kyle Mahowald et al.

Evaluating Grammatical Well-Formedness in Large Language Models: A Comparative Study with Human Judgments

2024Zhuang Qiu, Xufeng Duan et al.

Improved Baselines with Visual Instruction Tuning

2023Haotian Liu, Chunyuan Li et al.

Deep Learning and Linguistic Representation

2021Shalom Lappin

How Furiously Can Colorless Green Ideas Sleep? Sentence Acceptability in Context

2020Jey Han Lau, C. S. Armendariz et al.

The Influence of Context on Sentence Acceptability Judgements

2018Jean-Philippe Bernardy, Shalom Lappin et al.

Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge

2017Jey Han Lau, Alexander Clark et al.

Moses: Open Source Toolkit for Statistical Machine Translation

2007Philipp Koehn, Hieu T. Hoang et al.

Predicting Sentence Acceptability Judgments in Multimodal Contexts

Predicting Sentence Acceptability Judgments in Multimodal Contexts

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(11)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(11)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline