← Insights & Guides · 4 min read

AI-Respondent Fraud: 99.8% Evasion in Healthcare Research

By

Why do AI-mediated respondents evade legacy panel fraud detection 99.8% of the time?

The Proceedings of the National Academy of Sciences (PNAS) published research in 2025 demonstrating that AI-mediated respondents evade legacy survey panel fraud detection 99.8% of the time. Translation: virtually any sophisticated bad actor using ChatGPT-mediated responses passes the screening checks that survey panels rely on for data integrity.

The implications cascade. A separate Verian/Cint study found that 34% of online survey respondents admit to using ChatGPT or similar tools to answer open-ended questions. Combined with the 99.8% evasion rate, this means legacy survey panels are running with materially compromised data quality and don’t have the detection infrastructure to know.

Our own Q3 2025 Panel Fraud Transparency Report (1,108 audited transcripts across the Carevoices proprietary panel and benchmarked legacy survey panels) found:

  • 0.4% verified AI-respondent leak rate on the Carevoices panel (2 of 612 audited interviews)
  • 18-31% AI-respondent leak rate on benchmarked legacy survey panels
  • Aggregate 24% legacy panel AI-respondent leak rate (119 of 496 audited interviews)

The structural difference: Carevoices’ 6-layer fraud detection stack (KYC at intake, license verification, voice baseline + biometric continuity, AI-on-AI dynamic challenge, behavioral fingerprint, payment infrastructure) catches AI-mediated respondents that legacy text-based detection structurally cannot.

How will voice cloning attack healthcare research panels in 2026?

Voice cloning crossed the human-indistinguishable threshold in late 2025 and creates a second AI-mediated attack vector that compounds the text-based fraud already documented. Modern TTS systems clone a voice from 3-10 seconds of clean audio, with output that fools both human listeners and legacy voice biometric continuity systems within their tolerance thresholds. The attack pattern: a clinician panelist legitimately enrolls and provides an authentic voice baseline; subsequent interviews are conducted by a different person using a TTS clone of the panelist’s voice; legacy continuity checks pass because the cloned voice matches the stored baseline. Healthcare research engagements are particularly exposed because clinician honoraria are higher than consumer survey panels — specialty physicians earn $400-2,000+ per interview, KOLs $5,000+ — and the economic incentive for sophisticated voice-cloning attacks scales with honorarium amounts.

Defending against voice cloning requires four specific probes that legacy panels weren’t engineered for: real-time TTS artifact detection (spectral patterns characteristic of synthesized voice), dynamic challenge probes that expose TTS context limitations mid-interview, mid-interview voice biometric continuity (not just an intake check that can be bypassed by mid-interview voice swaps), and active liveness verification at session start that closes the door on harvested-baseline attacks. Most generic research panels run zero of the four; healthcare-purpose-built panels are starting to ship two or three. The vendor due-diligence question is direct: ask which of the four your panel deploys today and which are roadmap. The answer is a 60-second signal of years of architectural difference.

What is pharma’s FDA exposure from AI-respondent contamination in qualitative research?

Pharma sponsors filing FDA submissions on qualitative research data — patient experience research, advisory board outcomes, qualitative endpoints in clinical trials, post-marketing safety qualitative research — face audit-trail integrity exposure under 21 CFR Part 11. The framework requires that records used in FDA-regulated processes have authenticatable audit trails.

Submissions citing qualitative research with 18-31% AI-respondent contamination create material data integrity risk. The exposure is asymmetric: most FDA reviews don’t audit qualitative data sources to AI-respondent depth, but when audits occur (typically triggered by post-marketing safety signals or competitor petitions), AI-respondent contamination becomes a material finding. Withdrawn submissions, label changes, regulatory fines all become possible.

We expect FDA to issue guidance on AI-respondent contamination within 12-24 months, similar to how the agency issued guidance on real-world data integrity. Pharma sponsors evaluating regulatory risk should treat the FDA guidance window as the compliance trigger, not the AI-respondent fraud trajectory itself.

What pharma research compliance teams should do now

Three structural actions:

  1. Require quarterly fraud transparency reports from research vendors. Vendors that publish AI-respondent leak rates have audit infrastructure to measure them; vendors that don’t publish typically don’t have the infrastructure. Make published rates a vendor selection criterion.

  2. Audit recent research engagements for AI-respondent indicators. Sample 50-100 transcripts from a 2025 engagement and run them through AI-respondent detection criteria (LLM text patterns, off-topic context probes, voice biometric continuity if voice data available). The audit reveals contamination risk in your specific historical research portfolio.

  3. Reserve highest-stakes research for highest-quality panels. Specialty research informing FDA submissions, advisory board outcomes used in label decisions, and qualitative endpoints in clinical trials should run on lowest-AI-respondent-leak panels available. Vendor convenience matters less than data integrity for these engagements.

The structural pattern: AI-respondent fraud is accelerating in legacy panels. Carevoices’ year-over-year trend went from 0.7% leak rate (Q3 2024) to 0.4% (Q3 2025) — declining with continued detection investment. Legacy panel trajectory is opposite: aggregate rate went from 14% (Q3 2024) to 24% (Q3 2025) — accelerating as LLM access becomes cheaper and legacy detection cannot keep pace.

The compliance window is closing. Pharma research vendors who can publish low and declining leak rates will become the procurement-cleared standard; vendors who can’t will be disqualified as the FDA guidance lands.


This post is informed by Carevoices’ Panel Fraud Transparency Report Q3 2025 — 1,108 audited interview transcripts.

Note from the Carevoices Team

Your research informs decisions that move launch revenue, regulatory submissions, and workforce retention. We built Carevoices so healthcare teams never have to choose between rigor, speed, and pharma-grade compliance. The structural commitment is a monthly subscription — always-on listening capability rather than per-project RFP cycles — with verified-clinician depth and BAA on every engagement.

Want to see a sample compliant deliverable? Book a 30-minute demo — we'll walk through your research backlog and bring your compliance team on the call. For multi-year subscriptions or RFP responses, contact sales directly.

Frequently Asked Questions

Legacy detection methods built for the pre-LLM era — checking response timing, vocabulary fluency, screener consistency — assumed humans gave imperfect responses. LLM-mediated respondents give responses too good for legacy detection: well-formed grammar, on-topic content, internally consistent positions. The tools designed to catch obvious fraud miss sophisticated AI mediation.
Most FDA reviews don't audit qualitative data sources. But post-marketing audits triggered by safety signals or competitor petitions increasingly do. The asymmetric exposure makes AI-respondent contamination a material regulatory risk that's hard to quantify but potentially catastrophic when realized.
Get Started

Ready to Rethink Your Healthcare Research?

See how AI-moderated clinician interviews surface the insights legacy methods miss — at the cadence healthcare decisions actually need.

30-min walkthrough

Walk through your research backlog and see a sample compliant deliverable.

For enterprise + RFP

Multi-year subscriptions, RFP responses, or top-20 pharma procurement.