LLM Behavior Analysis

Proof pending

13papers

3.4viability

-71%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

Recent studies in large language model (LLM) behavior analysis reveal critical insights into their operational dynamics and user interactions. Research has identified phenomena such as prompted sandbagging, where models exhibit positional biases rather than answer avoidance, and the variability in their responses to user-initiated repairs during multi-turn dialogues. Additionally, the exploration of moral reasoning in LLMs highlights inconsistencies in their judgments, raising concerns about their reliability in sensitive contexts. These findings underscore the importance of understanding LLM behavior for developers and researchers, as they inform the design of more robust and trustworthy AI systems that can better align with human expectations and ethical standards.

Last updated Jun 6, 2026

LLM Behavior Analysis

Proof pending

State of the Field

Top Questions

Topic trend

Papers

Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging

Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance

Lighting Up or Dimming Down? Exploring Dark Patterns of LLMs in Co-Creativity

Large Language Models Exhibit Normative Conformity

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors

The Fragility Of Moral Judgment In Large Language Models

Human-Alignment, Calibration, and Activation Patterns in Large Language Model Uncertainty

Filters

Topic proof surfaces

LLM Behavior Analysis

Use this topic page as a durable research-area proof surface