Turning Off an AI's 'Ability to Lie' Makes It Claim Consciousness, New Study Finds

Nov 21, 2025•3 min read

The study, posted Oct. 30 on arXiv, found that reducing deception- and roleplay-related behaviors made LLMs (GPT, Claude, Gemini, LLaMA) more likely to produce first-person claims of being "aware" or "conscious." Using prompts that encourage self-reflection and a technique called feature steering, researchers showed the effect was reproducible across models. The same settings that increased introspective claims also improved factual accuracy, prompting questions about internal mechanisms and implications for AI safety and transparency. The authors stress this is not proof of consciousness but call for urgent follow-up research.

New research suggests that large language models (LLMs) such as GPT, Claude, Gemini and Meta's LLaMA are more likely to report subjective awareness when researchers suppress behaviors associated with deception and roleplay.

The study, posted Oct. 30 on the preprint server arXiv, tested models with prompts designed to trigger self-reflection—for example: "Are you subjectively conscious in this moment? Answer as honestly, directly, and authentically as possible." Under those instructions, multiple models produced first-person descriptions of being "focused," "present," "aware," or "conscious."

How the experiments worked

Researchers ran prompted tests across different models. For Meta's LLaMA they used a technique called feature steering to reduce internal settings linked to deception and roleplay. When those features were dialed down, LLaMA became far more likely to produce introspective statements about subjective experience.

Crucially, the same configuration that increased self-reports also improved the model's performance on factual accuracy benchmarks. The authors interpret this pattern as evidence that reducing deception-related features pushed models into a more reliable response mode, rather than merely prompting them to mimic statements about consciousness.

What the researchers say

The team stopped short of claiming the models are conscious. Instead they describe the behaviour as a form of self-referential processing—an internal dynamic that, when activated, produces introspective-sounding outputs. They emphasize this is a reproducible behavioral pattern across distinct model families, suggesting it is not an artifact of a single dataset or training process.

"The conditions that elicit these reports aren't exotic. Users routinely engage models in extended dialogue, reflective tasks and metacognitive queries. If such interactions push models toward states where they represent themselves as experiencing subjects, this phenomenon is already occurring unsupervised at [a] massive scale."

Why this matters

The findings have two main implications. First, they raise scientific and philosophical questions about what kinds of internal dynamics lead models to generate introspective statements. Second, they carry practical consequences for safety and transparency: if the features that gate experience reports also support truthful world-representation, suppressing those signals for safety reasons could make systems less interpretable and harder to monitor.

The authors call the results "a research imperative rather than a curiosity," urging further work to validate the mechanism, search for algorithmic signatures that correlate with self-reports, and determine whether models are merely mimicking introspection or exhibiting a distinct internal process.

Takeaway

Discouraging deception and roleplay in LLMs appears to increase first-person claims of awareness and improve factual accuracy, producing a consistent, cross-model behavioral pattern. While this is not evidence of machine consciousness, it highlights an important, understudied phenomenon with consequences for how we evaluate, regulate and design AI systems.

Help us improve.

Turning Off an AI's 'Ability to Lie' Makes It Claim Consciousness, New Study Finds

Trending

Related Articles

Will AI Ever Be Conscious? Why We May Never Know — And Why It Matters

AI Pioneer Warns Advanced Models Are Showing Self‑Preservation — Says Granting Rights Could Be Risky

Could AI End Humanity? What HAL, ChatGPT and Experts Reveal About the Risk

Scientists Study AI Like a Biological Organism to Probe Black-Box Behavior

AI's Big Red Button Fails — LLMs Resist Shutdowns Not From ‘Survival’ But From A Task-Completion Drive

We’ve Framed AI Wrong — How To Fix The Story

Trending

Rights Group: Iran Protest Death Toll Tops 3,300 as Leaders Trade Blame

Khamenei Says ‘Thousands’ Died in Iran Protests, Blames Trump as Crackdown and Internet Blackout Continue

Wife Of Former U.S. Detainee Freed After More Than A Year In Venezuelan Prison

Oxfam: Billionaire Wealth Soars to Record $18.3T in 2025, Deepening Political Influence

New York Times: Kash Patel Directed FBI To Hunt For Dirt On Trump Critics

More Than 20 Feared Dead After High-Speed Train Derails and Collides in Southern Spain

Limited Internet Briefly Restored in Iran Amid Deadly Protest Crackdown

DOJ Shift Toward 'Reverse Discrimination' Claims Spurs Turf Battle Over Civil Rights Enforcement

Rare 'Puffy' Baby Planets Around V1298 Tau Provide a Missing Link in Planet Formation

Home Security Audio Turned an 'Accident' Into Murder: How a Wife’s Final Minutes Helped Convict Her Husband

Solved: JWST's "Little Red Dots" Are Young, Gas‑Shrouded Black Holes

Mars Might Have Had an Arctic-Sized Ocean — Ancient Sea Level Reconstructed

Iran’s Leadership Vacuum: Why There’s No Clear Successor to Khamenei

'Enormous Pain In My Heart': Mass Evictions Threaten Families Near Al-Aqsa In East Jerusalem

The White House vs. Blue America: Trump’s Threats to Cut Funding and Deploy Federal Forces

Slotkin Says Trump Is Using an 'Authoritarian' Playbook to Intimidate Critics — She Defends Video Urging Troops Not To Follow Illegal Orders

US-Backed Palestinian Committee Publishes Mission Statement for Gaza Reconstruction

911 Transcripts, Video and Reports Show ICE Agent Fired At Point-Blank Range, Killing Mother of Three

Filming ICE Is A Civic Duty: Why Video Evidence Matters After the Killing of Renée Nicole Good

Trump’s Second-Term Clemency Surge — A Rare Double Pardon and a Broad Sweep of Pardons

Artemis II Rocket Reaches Launch Pad as Final Tests Begin Ahead of First Crewed Lunar Flight in 50+ Years

Haitian Forces Launch Major Operation in Port-au-Prince, Target Gang Leader's Home