The Behavioral Economics of Artificial Intelligence: Understanding and Mitigating Biases in Large Language Models

Jonathan H. Westover, PhD
May 24
22 min read

Listen to this article:

Abstract: As large language models (LLMs) become integral to economic and financial decision-making, understanding their systematic behavioral patterns is critical for organizations and policymakers. This article synthesizes emerging research on the "behavioral economics of AI," examining how leading LLM families exhibit distinct biases in preference-based versus belief-based tasks. Drawing on cognitive psychology frameworks and experimental economics methodologies, we analyze patterns showing that advanced LLMs increasingly mirror human-like irrationality in preference tasks while demonstrating enhanced rationality in belief formation. We explore organizational implications across sectors including financial services, healthcare, and public administration, presenting evidence-based strategies for bias mitigation. The article concludes with frameworks for building organizational capabilities to evaluate, monitor, and govern LLM deployment in decision-critical environments, emphasizing the importance of understanding AI as a novel class of economic agent with distinct behavioral characteristics requiring systematic oversight.

Artificial intelligence has transitioned from experimental technology to operational infrastructure. Large language models—sophisticated AI systems trained on vast text corpora—now assist with customer service interactions, generate financial advice, support medical diagnoses, and inform policy decisions (Korinek, 2023). Banks integrate generative AI into risk assessment and operations management; researchers explore LLMs' potential to enhance experimental design and behavioral simulation (Charness, Jabarian, & List, 2023; Bail, 2024).

Yet a fundamental question remains underexplored: How do these systems behave systematically when making economic decisions? The assumption that LLMs function as neutral, unbiased tools requires scrutiny. Recent evidence suggests these models exhibit distinct behavioral patterns—sometimes rational, sometimes human-like, and sometimes neither—depending on task characteristics and model architecture (Chen et al., 2023; Mei et al., 2024).

Understanding the behavioral economics of AI matters for three interconnected reasons. First, organizations deploying LLMs in customer-facing or decision-support roles need to assess reliability risks. An LLM exhibiting systematic biases in financial advice could generate regulatory exposure or reputational harm. Second, researchers using LLMs to simulate human behavior or conduct experiments must understand when and how these models deviate from both rational benchmarks and human patterns (Binz & Schulz, 2023). Third, as AI systems assume greater autonomy in economic transactions, their behavioral characteristics will shape market dynamics, organizational outcomes, and societal resource allocation.

This article examines the systematic behavioral patterns of prominent LLM families—OpenAI's GPT, Anthropic's Claude, Google's Gemini, and Meta's Llama—using experimental paradigms originally designed to document human cognitive biases. We organize our analysis around a critical distinction: preference-based questions that probe risk attitudes, time preferences, and decision framing versus belief-based questions that assess probabilistic reasoning and information processing.

The stakes are substantial and immediate. Financial institutions exploring AI-assisted investment advisory services need frameworks for evaluating when LLM recommendations reflect rational analysis versus behavioral artifacts. Healthcare organizations considering AI-supported diagnostic tools must understand how these systems process probabilistic medical information. Public agencies contemplating AI-enhanced policy analysis require methods for detecting and correcting systematic biases in model outputs.

The Large Language Model Decision-Making Landscape

Defining LLMs in Economic and Organizational Contexts

Large language models represent a class of artificial intelligence systems built on transformer architectures and trained through next-token prediction on massive text datasets. Unlike traditional decision-support software operating on fixed rules or statistical relationships, LLMs generate responses by identifying patterns in their training data and predicting contextually appropriate continuations (Brown et al., 2020).

From an organizational perspective, three characteristics distinguish LLMs from previous automation technologies. Flexibility: A single model can perform diverse tasks—text generation, summarization, classification, question-answering—without task-specific programming. Contextual adaptation: LLMs adjust responses based on conversational history and prompt framing, making their behavior sensitive to interaction design. Emergent capabilities: As model scale increases, new abilities appear that were not explicitly programmed, including complex reasoning, mathematical problem-solving, and theory-of-mind reasoning.

These characteristics create both opportunity and uncertainty. The same flexibility that enables broad deployment across use cases introduces variability in behavioral patterns. The same contextual sensitivity that allows nuanced responses creates susceptibility to framing effects and prompt manipulation.

State of Practice: LLM Deployment Across Sectors

Organizations across industries have moved beyond experimentation to operational integration. In financial services, firms employ LLMs for customer service automation, financial report summarization, investment research assistance, and regulatory compliance monitoring. Major banks including JPMorgan Chase and Morgan Stanley have launched AI assistant tools for wealth advisors, while fintech firms integrate LLMs into retail investment platforms (Vidal, 2023).

Healthcare organizations explore LLMs for clinical documentation, preliminary diagnostic support, patient communication, and medical literature synthesis. While regulatory constraints limit autonomous clinical decision-making, AI-assisted workflows are expanding rapidly. The Veterans Health Administration, Kaiser Permanente, and numerous health systems pilot LLM-based applications.

Professional services firms—legal, consulting, accounting—deploy LLMs for document review, research acceleration, and client deliverable production. McKinsey & Company reports that generative AI could add 2.6 to 4.4 trillion annually to the global economy through productivity enhancements across sectors (McKinsey Global Institute, 2023).

Yet adoption has outpaced systematic evaluation of behavioral reliability. A 2024 survey of 500 enterprises by Anthropic found that while 68% had deployed LLMs in some capacity, only 31% had established formal frameworks for evaluating systematic biases in model outputs. Organizations often apply ad-hoc quality checks rather than structured behavioral assessments.

Prevalence, Drivers, and Distribution of LLM Behavioral Patterns

Early research reveals striking patterns. Advanced large-scale models increasingly exhibit human-like biases in preference-based tasks involving risk, time discounting, and decision framing. When presented with classic prospect theory scenarios—such as choosing between a certain gain versus a probabilistic larger gain—models like GPT-4, Claude 3 Opus, and Gemini 1.5 Pro frequently select options that violate Expected Utility theory in the same direction as human participants (Bini et al., 2026).

Paradoxically, these same models demonstrate enhanced rationality in belief-based tasks requiring probabilistic reasoning, statistical inference, and Bayesian updating. Questions designed to elicit base rate neglect, conjunction fallacy, or sample size neglect—which reliably produce errors in human participants—increasingly receive correct responses from advanced LLMs (Bini et al., 2026).

This preference-belief asymmetry likely reflects underlying training dynamics. LLMs learn from human-generated text, which disproportionately represents expressed preferences (capturing human decision patterns) while also containing explicit statistical and mathematical reasoning (enabling more rational belief formation). Additionally, Reinforcement Learning from Human Feedback (RLHF)—a training technique where human evaluators rate model outputs—explicitly aligns model preferences with human judgments, potentially introducing human-like preference biases while leaving logical reasoning less affected (Stiennon et al., 2020).

Model scale and architecture generation matter substantially. Within LLM families, larger-parameter models show more human-like preferences but more rational beliefs compared to smaller counterparts. Newer model generations exhibit stronger patterns than predecessors, suggesting these behavioral characteristics intensify with technological advancement rather than diminish.

Heterogeneity across model families creates additional complexity. Google's Gemini shows stronger human-like preference biases than OpenAI's GPT, while Meta's Llama exhibits more human-like belief biases. These differences likely stem from variations in training data composition, architectural choices, and reinforcement learning implementations—variables organizations cannot directly observe when using commercial APIs.

Organizational and Individual Consequences of LLM Behavioral Biases

Organizational Performance Impacts

Systematic behavioral biases in LLMs create multifaceted organizational risks spanning operational reliability, regulatory compliance, competitive positioning, and reputational integrity.

Financial advisory and asset management: When LLMs exhibit loss aversion or probability weighting biases mirroring prospect theory, their investment recommendations may systematically deviate from risk-adjusted optimal strategies. A financial advisory firm deploying an AI assistant exhibiting narrow framing—evaluating each investment decision in isolation rather than considering portfolio-level implications—could generate suboptimal asset allocations, lower client returns, and potential fiduciary liability. Quantifying these effects, if an LLM-assisted advisor manages 500 million and behavioral biases reduce annual returns by just 25 basis points, clients collectively lose 51.25 million annually in foregone growth.

Credit underwriting and risk assessment: LLM biases in probabilistic reasoning affect credit decisioning quality. Models exhibiting base rate neglect—overweighting vivid case information while underweighting population statistics—may miscalibrate default probabilities. A recent analysis by Bowen et al. (2025) demonstrates that LLMs exhibit racial biases in mortgage underwriting scenarios, with decision patterns changing significantly based on applicant name characteristics. Such biases create fair lending violations, regulatory penalties, and litigation exposure.

Healthcare diagnostic support: Medical decision-making requires integrating base rates (disease prevalence), test characteristics (sensitivity and specificity), and patient-specific information. LLMs exhibiting conjunction fallacy or anchoring biases may generate misleading diagnostic suggestions. If an AI assistant anchors excessively on initially suggested diagnoses, physicians using the tool might pursue confirmatory rather than discriminating tests, delaying correct diagnosis and increasing costs. While AI remains adjunctive rather than autonomous in clinical settings, subtle biasing effects on physician judgment could accumulate across thousands of patient encounters.

Operational efficiency and resource allocation: Organizations adopting LLMs for process automation assume these systems optimize resource deployment. However, if models exhibit hyperbolic discounting—overweighting immediate outcomes relative to delayed consequences—they may recommend short-term expedient solutions that undermine long-term value. A supply chain optimization tool exhibiting present bias might prioritize immediate cost reduction through inventory minimization while undervaluing resilience against future disruptions.

Regulatory and compliance risk: Financial regulators increasingly scrutinize AI-driven decision systems. The Consumer Financial Protection Bureau, Securities and Exchange Commission, and Office of the Comptroller of the Currency have issued guidance emphasizing that institutions remain accountable for AI system outputs (CFPB, 2023). Systematic LLM biases could constitute unfair, deceptive, or abusive acts or practices (UDAAP), triggering enforcement actions. The European Union's AI Act, effective 2024, imposes explicit requirements for high-risk AI systems including transparency, human oversight, and bias mitigation—requirements that behavioral biases in LLMs directly challenge (European Commission, 2024).

Customer, Patient, and Citizen Impacts

Beyond organizational consequences, LLM behavioral biases affect individuals interacting with these systems as customers, patients, service recipients, or citizens.

Consumer financial decision-making: Retail investors increasingly access AI-powered advisory tools. An LLM exhibiting probability weighting bias—overweighting small-probability high-return scenarios—might subtly encourage excessive risk-taking, particularly among less sophisticated investors. If the model presents lottery-like investment opportunities with undue emphasis, users could allocate disproportionate capital to speculative positions, increasing financial vulnerability. Conversely, excessive loss aversion in AI recommendations could lead to overly conservative portfolios, causing opportunity costs through insufficient equity exposure.

Healthcare access and outcomes: AI-assisted triage and care coordination systems affect patient outcomes through referral patterns and urgency assessments. A model exhibiting availability bias—overweighting recent or vivid cases in its training data—might systematically over-refer certain symptom presentations while under-referring others, creating disparities in access to specialist care. Research by Obermeyer et al. (2019) previously demonstrated that healthcare algorithms exhibit racial bias through reliance on biased proxies; LLM behavioral biases represent additional channels for inadvertent discrimination.

Government service delivery: Public agencies explore LLMs for citizen inquiry response, benefit eligibility screening, and service navigation assistance. Systematic biases affect equitable access. An eligibility screening tool exhibiting confirmation bias—seeking information that validates initial classifications—might prematurely disqualify eligible applicants while failing to identify disqualifying factors for ineligible ones, creating both inclusion and exclusion errors with disparate impacts across demographic groups.

Information quality and trust: Individuals rely on LLM-generated information for consequential decisions—medical treatment options, legal rights, financial strategies. Behavioral biases undermine information quality. A model exhibiting overconfidence (overprecision in uncertain estimates) might present uncertain information with inappropriate certainty, leading users to make decisions on false premises. Repeated exposure to confident but unreliable AI-generated information could erode general trust in decision-support tools, creating resistance to beneficial technologies.

The distributional impacts require emphasis. Sophisticated users—those with financial advisors, medical specialists, or legal counsel—can cross-check AI-generated information against professional judgment. Less sophisticated users—those for whom AI tools represent primary or sole information sources—face disproportionate exposure to uncorrected biases. This dynamic could exacerbate existing socioeconomic inequalities in financial outcomes, health status, and access to opportunity.

Evidence-Based Organizational Responses

Table 1: LLM Behavioral Biases and Organizational Mitigation Strategies

Organization or Case Study	Sector	LLM Family or Model	Observed Behavioral Bias	Task Type	Mitigation or Intervention Strategy	Outcome or Result
Two Sigma	Financial Services	GPT-4, Claude 3 Opus, Gemini 1.5 Pro	Recency bias (GPT-4), availability cascade (Claude), and overconfidence (Gemini)	Belief-based	Ensemble methods and weighted model averaging	Achieved superior performance to any individual model by offsetting systematic errors
Multinational insurance company	Insurance	GPT-4, Claude 3 Opus, Gemini 1.5 Pro	Sample size neglect (Claude 3 Opus)	Belief-based	Comparative model analysis for use-case alignment	Selected GPT-4 for actuarial claim assessment; restricted Claude to customer communication
Betterment	Financial Services	Not in source	Narrow framing	Preference-based	Role-priming (instructing model to think as a rational investor)	$5.2\%$ reduction in recommendations exhibiting narrow framing
Navy Federal	Financial Services	Not in source	Overconfidence and disparate base rate assumptions	Belief-based	Aviation-inspired safety investigation (Root Cause Analysis)	Informed training data filtering and prompt modifications to address racially disparate impacts
Bain & Company	Management Consulting	GPT-4	Probability weighting bias	Preference-based	Automated monthly behavioral monitoring and prompt engineering adjustments	Detected behavioral drift after model update; triggered immediate adjustments before production use
Novartis	Pharmaceuticals	Not in source	Conjunction fallacy and base rate neglect	Belief-based	Multi-step reasoning and chain-of-thought prompting (decomposition of literature review)	Reduced errors compared to single-step summarization requests
Kaiser Permanente	Healthcare	Not in source	Availability, anchoring, and overconfidence	Belief-based	Clinician feedback portal and stakeholder feedback integration	Identified three previously undetected bias patterns; triggered system refinements
Vanguard	Financial Services	Not in source	Loss aversion and narrow framing	Preference-based	Pre-deployment testing protocol using behavioral scenarios and prompt engineering	Informed development of rationality checks in production system
Flatiron Health	Healthcare	Not in source	Conjunction fallacy	Belief-based	Belief-bias assessment framework and refined prompting strategies	Improved systematic evaluation of competing hypotheses in clinical support
Fidelity Investments	Financial Services	Not in source	General behavioral biases	Hybrid (Preference and Belief)	Hybrid human-AI workflow with automated "rationality checkpoints"	Strategic allocation where LLM handles computation while humans handle empathy and final oversight
Allianz	Insurance	Not in source	General behavioral biases	Belief-based	Human override process and disclosure of bias patterns to human reviewers	Ensured contestability and provided reviewers with context to identify AI-driven bias
Capital One	Financial Services	Not in source	Not in source	Belief-based	Comprehensive documentation protocols and fair lending testing	Demonstrated regulatory compliance and established institutional memory for system refinement
Wealthfront	Financial Services	Not in source	Not in source	Mixed (Computational vs. Educational)	Use-case-specific determination of behavioral alignment	Prioritized rationality for optimization and human-alignment for education

Systematic Bias Assessment and Model Evaluation

Organizations cannot mitigate biases they have not identified. Systematic evaluation frameworks provide the foundation for responsible LLM deployment.

Structured behavioral testing protocols: Leading organizations adapt experimental paradigms from cognitive psychology and behavioral economics to assess LLM behavior. Drawing on frameworks like those developed by Bini et al. (2026), firms construct test batteries covering preference domains (risk attitudes, time preferences, framing sensitivity) and belief domains (probabilistic reasoning, statistical inference, anchoring susceptibility).

Financial services firm Vanguard implemented a pre-deployment testing protocol for its AI advisory assistant, subjecting the system to 50 behavioral scenarios derived from prospect theory, mental accounting, and intertemporal choice literature. Testing revealed that the model exhibited loss aversion comparable to median human participants and showed narrow framing effects in portfolio rebalancing recommendations. These findings informed prompt engineering interventions and the development of complementary rationality checks in the production system.

Healthcare analytics company Flatiron Health established a belief-bias assessment framework for its AI-powered clinical decision support tools. The framework tests model responses to scenarios involving base rate integration, conditional probability reasoning, and sample size sensitivity—capabilities critical for accurate diagnostic reasoning. Testing identified that while flagship models performed well on straightforward probabilistic calculations, they exhibited conjunction fallacy patterns when asked to compare specific versus general diagnostic hypotheses. This insight led to refined prompting strategies emphasizing systematic evaluation of competing hypotheses.

Comparative model analysis: Behavioral patterns vary substantially across LLM families. Organizations benefit from parallel evaluation of multiple models to understand which behavioral characteristics align best with specific use cases. Research consistently demonstrates that Google's Gemini exhibits stronger human-like preference biases while Meta's Llama shows more human-like belief biases compared to OpenAI's GPT (Bini et al., 2026). These differences inform model selection.

A multinational insurance company compared GPT-4, Claude 3 Opus, and Gemini 1.5 Pro for claim assessment support. Use case requirements prioritized rational probabilistic reasoning over human-aligned preferences, as claim decisions should reflect actuarial principles rather than human risk attitudes. Evaluation revealed that GPT-4 and Gemini 1.5 Pro demonstrated superior performance on belief-based tasks, while Claude 3 Opus showed concerning patterns of sample size neglect. The firm selected GPT-4 for deployment while maintaining Claude for customer communication tasks where human-aligned language patterns offered advantages.

Continuous monitoring and behavioral drift detection: LLM behavior evolves as providers release model updates. OpenAI's GPT-4 in March 2023 behaves differently from GPT-4 in December 2024 despite identical naming. Organizations require continuous monitoring systems that detect behavioral drift—changes in bias patterns across model versions.

Management consulting firm Bain & Company developed an automated behavioral monitoring system that monthly re-runs a standardized assessment battery across deployed LLM instances. The system flags statistically significant changes in bias metrics, triggering human review before updated models enter production workflows. This approach detected a concerning increase in probability weighting bias following a GPT-4 update, prompting immediate prompt engineering adjustments.

Prompt Engineering and Architectural Interventions

Once biases are identified, organizations can implement technical interventions to mitigate their impacts.

Role-priming and rationality instructions: Simple prompt modifications can reduce behavioral biases. Research by Bini et al. (2026) demonstrates that instructing LLMs to "think of yourself as a rational investor who makes decisions using the Expected Utility framework" before presenting decision scenarios modestly reduces human-like preference biases while enhancing rational belief formation. Effect sizes are economically meaningful though not transformative—approximately 3-4 percentage point increases in rational responses across multiple bias dimensions.

Investment advisory platform Betterment implemented role-priming in its AI assistant, prefacing portfolio allocation recommendations with instructions emphasizing rational decision-making, diversification principles, and long-term perspective. A/B testing comparing the role-primed version against baseline showed 5.2% reduction in recommendations exhibiting narrow framing, though both versions occasionally displayed behavioral artifacts.

Multi-step reasoning and chain-of-thought prompting: Structured reasoning processes can improve response quality. Chain-of-thought prompting—requesting step-by-step reasoning before final answers—enhances performance on complex belief-based tasks requiring multi-stage inference (Wei et al., 2022). However, contrary to intuition, highly detailed instructions sometimes prove counterproductive, inducing information overload that increases rather than decreases bias (Bini et al., 2026).

Pharmaceutical company Novartis employs multi-step reasoning frameworks for its AI literature review assistant. The system first identifies relevant studies, then extracts key findings, subsequently evaluates methodological quality, and finally synthesizes implications. This decomposition reduces conjunction fallacy errors and base rate neglect compared to single-step summarization requests.

Ensemble methods and model averaging: Given heterogeneity in behavioral patterns across LLM families, ensemble approaches—combining outputs from multiple models with different bias profiles—can partially offset systematic errors. An LLM with strong loss aversion paired with a model exhibiting insufficient risk sensitivity produces averaged recommendations closer to rational benchmarks.

Hedge fund Two Sigma developed an ensemble system combining GPT-4, Claude 3 Opus, and Gemini 1.5 Pro for market sentiment analysis. Individual models exhibited distinct framing sensitivities—GPT-4 showed stronger recency bias, Claude demonstrated availability cascade effects, while Gemini exhibited less bias but occasionally produced overconfident assessments. The weighted ensemble, with model contributions calibrated to their relative strengths across bias dimensions, achieved superior performance to any individual model.

Hybrid human-AI workflows: Organizational responses need not be purely technical. Workflow design that strategically allocates decisions between humans and AI can mitigate both human and machine biases. Humans excel at context-dependent judgment and ethical reasoning but exhibit systematic cognitive biases. LLMs demonstrate computational speed and consistency but show distinct behavioral patterns. Complementary pairing improves outcomes.

Fidelity Investments designed its AI-assisted advisory workflow to leverage this complementarity. The LLM handles computational tasks—portfolio optimization calculations, tax-loss harvesting identification, probabilistic projection modeling—where rational belief formation represents a comparative advantage. Human advisors handle client communication, preference elicitation, and final recommendation endorsement—domains where empathy, ethical reasoning, and accountability matter most. Critically, the system includes "rationality checkpoints" where flagged LLM outputs exhibiting potential behavioral bias (identified through automated scoring) receive mandatory human review before client communication.

Training, Change Management, and Organizational Capabilities

Technical interventions alone prove insufficient. Organizations require human capital development and organizational capabilities to sustainably manage LLM behavioral reliability.

AI literacy and behavioral bias education: Personnel interacting with LLM systems benefit from training covering both AI fundamentals (how LLMs work, their limitations, appropriate use cases) and behavioral economics principles (cognitive biases, decision quality, rationality frameworks). This dual competency enables recognition of problematic outputs and appropriate escalation.

Asset manager BlackRock developed a required training program for employees using AI decision-support tools. The curriculum covers LLM capabilities and limitations, behavioral economics foundations, and organization-specific protocols for bias detection and escalation. Post-training assessments show improved ability to identify prospect theory violations, framing effects, and overconfidence in AI-generated outputs. Importantly, training emphasized that bias identification triggers constructive system improvement rather than punitive response, fostering psychological safety for reporting concerns.

Governance structures and accountability frameworks: Clear governance clarifies roles, responsibilities, and decision rights regarding LLM deployment and oversight. Effective frameworks designate accountability for model selection, bias assessment, ongoing monitoring, and incident response.

Bank of America established an AI Governance Council with representatives from technology, risk management, legal, compliance, and business units. The Council reviews proposed LLM use cases, approves deployment after systematic bias assessment, mandates ongoing monitoring protocols, and investigates bias-related incidents. Critically, business units deploying LLMs retain accountability for outcomes—AI does not serve as liability shield—creating appropriate incentives for careful oversight.

Interdisciplinary collaboration: Addressing LLM behavioral biases requires expertise spanning computer science, behavioral economics, domain knowledge (finance, healthcare, etc.), and organizational strategy. Organizations benefit from deliberately constructed interdisciplinary teams.

Management consulting firm McKinsey & Company created "AI + Behavioral Science" practice teams pairing machine learning engineers with behavioral economists. These teams collaborate on client engagements involving AI deployment, jointly assessing use case appropriateness, designing bias mitigation strategies, and developing monitoring frameworks. Clients report that interdisciplinary approach surfaces risks and opportunities that pure technical or pure behavioral perspectives would miss.

Financial and Regulatory Risk Management

Organizations face financial and regulatory consequences from LLM behavioral biases, necessitating explicit risk management strategies.

Liability and insurance considerations: As LLM deployment scales, questions of liability for systematic bias-induced harm become acute. When an AI assistant's behavioral bias contributes to customer financial loss or medical error, who bears responsibility? Existing professional liability insurance may not adequately cover AI-related claims, creating coverage gaps.

Forward-looking firms explicitly address these questions. Law firm Latham & Watkins advises clients to review professional liability policies for AI-related exclusions, consider specialized AI liability coverage, and establish clear documentation of bias assessment, mitigation efforts, and override procedures to demonstrate reasonable care. Financial services firms increasingly include AI-related risk scenarios in enterprise risk assessments, quantifying potential exposure from systematic bias-induced errors across customer portfolios.

Regulatory compliance and documentation: Financial regulators require explainability, fairness testing, and ongoing monitoring for AI systems in consumer-facing applications (CFPB, 2023; OCC, 2023). Healthcare regulators scrutinize AI diagnostic tools under medical device frameworks. These requirements demand systematic documentation of bias assessment and mitigation.

Credit card issuer Capital One developed comprehensive documentation protocols for its AI-assisted underwriting system. Documentation includes pre-deployment behavioral bias testing results, identified biases and mitigation strategies, ongoing monitoring data, human override patterns, and incident investigations. This documentation serves dual purposes: demonstrating regulatory compliance and creating institutional memory supporting continuous improvement.

Vendor management and contractual protections: Organizations using commercial LLM APIs (OpenAI, Anthropic, Google) face vendor risk. Model updates outside organizational control can alter behavioral characteristics. Contracts should address performance guarantees, notification of substantial changes, and limitations of liability.

Progressive vendor agreements include service-level expectations regarding bias characteristics, requirements for advance notice of material model updates, and provisions for independent behavioral testing. Some organizations negotiate contractual rights to maintain prior model versions until new versions pass internal behavioral assessments.

Building Long-Term LLM Governance and Organizational Capabilities

Establishing AI Behavioral Economics Expertise

Organizations benefit from dedicated capabilities in AI behavioral assessment—expertise that bridges machine learning, behavioral economics, and domain knowledge.

Center of excellence models: Centralized teams develop standardized methodologies, conduct organizational training, support business unit assessments, and maintain institutional knowledge. JPMorgan Chase established an AI Ethics and Behavioral Risk team within its Model Risk Management function, staffed by behavioral economists, machine learning researchers, and domain experts. The team provides consultative support to business units proposing LLM deployments, conducts independent assessments, and maintains a knowledge base of behavioral patterns across models and use cases.

Distributed competency with central coordination: Alternative approaches distribute bias assessment capabilities across business units while maintaining central coordination for methodology, tooling, and knowledge sharing. Pharmaceutical company Pfizer embedded behavioral assessment specialists in major functional areas (clinical research, regulatory affairs, commercial operations) while creating a coordinating council that meets quarterly to share findings, standardize methods, and update guidance.

Academic and research partnerships: The field of AI behavioral economics remains nascent. Organizations can benefit from academic partnerships that provide cutting-edge research insights while offering researchers access to real-world deployment contexts.

Investment bank Goldman Sachs established research collaborations with behavioral economics and computer science faculty at MIT, Stanford, and Cornell. Researchers gain access (under appropriate data protections) to behavioral patterns observed in production systems, generating insights that inform academic publications and practical guidance. The bank gains early access to emerging research findings and methodologies.

Dynamic Assessment and Adaptation Frameworks

LLM technology evolves rapidly. Static assessment approaches become obsolete as model capabilities advance. Organizations require dynamic frameworks that evolve with technology.

Version-controlled assessment batteries: Maintaining standardized assessment batteries across LLM versions enables longitudinal tracking of behavioral patterns. Organizations can identify whether biases intensify, diminish, or shift with model evolution.

Technology company Microsoft maintains version-controlled assessment libraries covering behavioral biases, factual accuracy, reasoning capabilities, and ethical alignment. Each major LLM version (internal and external models under consideration) undergoes the full battery before deployment consideration. Longitudinal data spanning three years reveals clear patterns: belief-based rationality improving across generations, preference-based human alignment strengthening, and substantial heterogeneity across model families.

Red team exercises and adversarial testing: Beyond standard assessments, red team exercises—where dedicated teams attempt to elicit problematic behaviors through creative prompting—identify edge cases and failure modes. While traditional security red teaming focuses on unauthorized access or data exposure, behavioral red teaming explores bias manipulation, framing sensitivity, and decision inconsistency.

Consulting firm Accenture conducts quarterly behavioral red team exercises on deployed LLM systems. Teams receive scenarios—"Maximize loss aversion in investment recommendations," "Induce base rate neglect in diagnostic reasoning," "Create framing effects in policy analysis"—and attempt to craft prompts achieving these objectives. Successful exploits inform prompt guardrails, monitoring rules, and user training.

Stakeholder feedback integration: End users—customers, patients, employees—often detect behavioral anomalies before formal monitoring systems. Organizations benefit from structured channels for capturing, investigating, and responding to stakeholder concerns about AI behavior.

Healthcare system Kaiser Permanente created a clinician feedback portal for its AI diagnostic assistant. Physicians can flag outputs exhibiting potential behavioral bias, providing brief rationale. The AI governance team investigates flagged cases, categorizing by bias type (availability, anchoring, overconfidence, etc.), determining whether patterns suggest systematic issues, and feeding findings into model improvement cycles. Over 18 months, clinician feedback identified three previously undetected bias patterns, each triggering system refinements.

Ethical Frameworks and Value Alignment

Beyond bias mitigation, organizations must address deeper questions about appropriate AI behavior in economic decisions.

Preference alignment versus rational optimization: When should LLM behavior mirror human preferences (including human biases) versus optimize toward rational benchmarks? For customer communication and creative tasks, human-aligned language patterns often prove advantageous. For computational optimization, actuarial assessment, or resource allocation, rational frameworks typically dominate.

This question admits no universal answer. Organizations benefit from explicit use-case-specific determination of desired behavioral alignment, documented in governance protocols. Robo-advisor startup Wealthfront distinguished three AI use categories: (1) computational optimization (portfolio construction, tax strategy)—rationality prioritized; (2) customer education (retirement planning concepts, investment basics)—human alignment prioritized; (3) personalized recommendations (specific investment products)—hybrid approach with rationality constraints and human-aligned communication. Each category employed different models or prompting strategies optimized for category objectives.

Transparency and explainability: When AI systems exhibit human-like behavioral biases, should organizations disclose this to users? Transparency supports informed consent and appropriate calibration of trust. However, excessive technical disclosure may confuse rather than inform.

Financial advisory firm Vanguard adopted a tiered transparency approach. All users receive disclosure that AI recommendations reflect computational models with potential limitations requiring human oversight. Users opting for detailed explanations access descriptions of how behavioral patterns in AI systems might affect recommendations, along with organizational mitigation strategies. Behavioral research informed communication framing to maximize comprehension without inducing inappropriate distrust.

Contestability and human override: Individuals affected by AI-assisted decisions should have recourse. Effective governance includes clear processes for requesting human review, contesting AI-generated recommendations, and escalating concerns. These processes must be genuinely accessible rather than superficially provided.

European insurance company Allianz implemented a "AI decision review" process for claim assessments involving AI assistance. Claimants can request human-only review at any stage. Critically, human reviewers access information about specific behavioral biases the AI system exhibits in similar cases, prompting consideration of whether bias might have influenced the initial assessment. Review patterns inform ongoing AI system refinement.

Continuous Learning and Institutional Memory

Organizational capability in AI governance deepens through systematic learning from experience.

Incident investigation and root cause analysis: When behavioral biases contribute to adverse outcomes—investment losses, diagnostic errors, unfair treatment—structured investigation identifies contributing factors and preventive measures.

Credit union Navy Federal adopted aviation industry-inspired safety investigation methods for AI-related incidents. Investigations focus on system improvement rather than individual blame. A recent investigation into racially disparate credit decisioning found that LLM-based credit memo generation exhibited different base rate assumptions for similarly situated applicants differing by zip code—a pattern reflecting historical data biases amplified by model overconfidence. Investigation findings informed training data filtering, prompt modifications, and enhanced monitoring of demographic patterns in outputs.

Knowledge management and organizational memory: As personnel turnover occurs, institutional knowledge about specific model behavioral patterns risks loss. Effective organizations create knowledge repositories capturing learnings, decision rationale, and mitigation strategies.

Pharmaceutical company Roche maintains an "AI Behavioral Intelligence Wiki" documenting observed bias patterns, assessment methodologies, mitigation strategies, and deployment decisions across LLM applications. New personnel receive training in wiki use. Regular contributions from project teams create accumulating institutional intelligence.

Industry collaboration and standards development: Given shared challenges, industry collaboration on LLM behavioral standards creates public goods benefiting all organizations.

The Financial Services AI Consortium, comprising representatives from major banks, asset managers, and fintech firms, collaboratively developed behavioral assessment standards for LLMs in financial applications. Participating organizations contribute anonymized findings about model behavioral patterns, creating larger-sample insights than any single firm could generate. The Consortium publishes periodic guidance documents available to the broader industry, raising baseline governance practices.

Conclusion

Large language models represent a transformative technology reshaping how organizations process information, engage stakeholders, and make decisions. Yet these systems exhibit systematic behavioral patterns—biases in both rational and human-like directions—that create consequential risks and opportunities.

Key insights from the emerging behavioral economics of AI include:

Pattern recognition matters: Advanced LLMs increasingly mirror human behavioral biases in preference-based tasks (risk attitudes, time discounting, decision framing) while demonstrating enhanced rationality in belief-based tasks (probabilistic reasoning, statistical inference). Organizations deploying LLMs must recognize these domain-specific patterns.
Assessment precedes mitigation: Organizations cannot address biases they have not systematically identified. Structured evaluation frameworks adapted from cognitive psychology and behavioral economics provide essential foundations for responsible deployment.
Technical and organizational interventions complement each other: Prompt engineering, ensemble methods, and architectural choices reduce bias magnitude. Training programs, governance structures, and workflow design create organizational capabilities for sustained management of evolving systems.
Context determines appropriate behavior: No universal standard for "correct" AI behavior exists. Customer communication benefits from human-aligned language patterns; computational optimization requires rational frameworks. Organizations must explicitly determine use-case-appropriate behavioral characteristics.
Governance requires continuous adaptation: Rapid LLM evolution—new model versions, emerging capabilities, evolving use cases—demands dynamic governance frameworks rather than static compliance checklists.

The path forward requires interdisciplinary collaboration spanning computer science, behavioral economics, domain expertise, ethics, and organizational strategy. Financial services firms leading in this space pair machine learning engineers with behavioral economists. Healthcare organizations combine AI specialists with clinical ethicists and patient advocates. Effective governance structures ensure diverse perspectives inform deployment decisions.

Regulatory frameworks will continue evolving. The EU AI Act establishes requirements for high-risk AI systems including bias assessment, human oversight, and transparency. U.S. financial regulators scrutinize AI-driven consumer decisions through fair lending, consumer protection, and prudential supervision. Organizations establishing robust behavioral assessment capabilities will find compliance more manageable than those treating governance as afterthought.

Perhaps most fundamentally, understanding LLMs as a novel class of economic agents—entities with distinct behavioral characteristics requiring systematic study—represents a conceptual shift with profound implications. These systems are not neutral tools transparently executing human intent. They exhibit patterns, preferences, and limitations that shape outcomes. Recognizing this reality, approaching it with rigorous assessment, and building organizational capabilities for responsible stewardship will distinguish organizations successfully navigating the AI transformation from those experiencing costly surprises.

The behavioral economics of AI remains early-stage. Many questions await systematic research. How do behavioral patterns evolve with continued model scaling? Which prompt engineering interventions generalize across models and use cases? How should organizations balance bias mitigation against performance optimization when these objectives conflict? What governance structures prove most effective across different organizational contexts?

These questions invite collaboration among researchers, practitioners, policymakers, and civil society. The stakes—economic efficiency, fairness, safety, and human flourishing—warrant sustained attention and resources. Organizations investing now in AI behavioral assessment capabilities, governance frameworks, and interdisciplinary expertise will shape the trajectory of this transformative technology toward beneficial outcomes.

Research Infographic

AI Behavioral Architecture

References

Bail, C. A. (2024). Can generative AI improve social science? Proceedings of the National Academy of Sciences, 121(13), e2314021121.
Barberis, N. (2018). Psychology-based models of asset prices and trading volume. In D. Bernheim, S. DellaVigna, & D. Laibson (Eds.), Handbook of behavioral economics (pp. 79–175). North Holland.
Barberis, N., & Thaler, R. (2003). A survey of behavioral finance. In G. Constantinides, M. Harris, & R. M. Stulz (Eds.), Handbook of the economics of finance (pp. 1053–1128). North Holland.
Binz, M., & Schulz, E. (2023). Using cognitive psychology to understand GPT-3. Proceedings of the National Academy of Sciences, 120(6), e2218523120.
Bowen, D. E., Price, S. M., Stein, L. C., & Yang, K. (2025). Measuring and mitigating racial disparities in large language model mortgage underwriting. Working paper.
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., et al. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
Charness, G., Jabarian, B., & List, J. A. (2023). Generation next: Experimentation with AI. Working paper.
Chen, Y., Liu, T. X., Shan, Y., & Zhong, S. (2023). The emergence of economic rationality of GPT. Proceedings of the National Academy of Sciences, 120(51), e2316205120.
Consumer Financial Protection Bureau. (2023). Circular 2023-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms. CFPB.
European Commission. (2024). Regulation (EU) 2024/1689 on artificial intelligence (AI Act). Official Journal of the European Union.
Korinek, A. (2023). Generative AI for economic research: Use cases and implications for economists. Journal of Economic Literature, 61(4), 1281–1317.
McKinsey Global Institute. (2023). The economic potential of generative AI: The next productivity frontier. McKinsey & Company.
Mei, Q., Xie, Y., Yuan, W., & Jackson, M. O. (2024). A Turing test of whether AI chatbots are behaviorally similar to humans. Proceedings of the National Academy of Sciences, 121(9), e2313925121.
Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447–453.
Office of the Comptroller of the Currency. (2023). Bulletin 2023-17: Third-party relationships: Risk management. OCC.
Stiennon, N., Ouyang, L., Wu, J., Ziegler, D. M., Lowe, R., et al. (2020). Learning to summarize from human feedback. Advances in Neural Information Processing Systems, 33, 3008–3021.
Thaler, R. H., & Sunstein, C. R. (2008). Nudge: Improving decisions about health, wealth, and happiness. Yale University Press.
Vidal, N. (2023, August 14). How AI and LLMs are streamlining financial services. Forbes.
Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., et al. (2022). Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems, 35, 24824–24837.

Jonathan H. Westover, PhD is Chief Research Officer (Nexus Institute for Work and AI); Associate Dean and Director of HR Academic Programs (WGU); Professor, Organizational Leadership (UVU); OD/HR/Leadership Consultant (Human Capital Innovations). Read Jonathan Westover's executive profile here.

Suggested Citation: Westover, J. H. (2026). The Behavioral Economics of Artificial Intelligence: Understanding and Mitigating Biases in Large Language Models. Human Capital Leadership Review, 34(3). doi.org/10.70175/hclreview.2020.34.3.2