Free AI Text Detector
Scan any text for hidden AI fingerprints — zero-width characters, ChatGPT watermarks, and invisible markers. Proof, not probability.
100% client-side · No text is ever sent to a server · Works with ChatGPT, Claude, Gemini, Grok & more
Scan results will appear here...
How this works: This tool scans for technical fingerprints — hidden Unicode characters, HTML attributes, and formatting markers that AI chat interfaces inject into copied text. It does not analyze writing style or use machine learning. A low score means no technical markers were found, but the text could still be AI-generated and manually cleaned. A high score means definitive technical evidence of AI tool origin.
How Our AI Detector Works
Most AI detectors rely on machine learning models that analyze writing patterns like perplexity, burstiness, and sentence structure. The fundamental problem with this approach is false positives: human-written text gets flagged as AI-generated, and lightly edited AI text passes undetected. Statistical models guess — they do not prove.
GPT Cleanup's AI Detector takes a completely different approach. Instead of analyzing how text reads, we scan for what text contains — hidden Unicode characters, HTML data attributes, and invisible formatting markers that AI chat interfaces inject into copied text. These are technical fingerprints: deterministic evidence that software, not a human keyboard, produced the content.
When we detect a ChatGPT data-start attribute or zero-width spaces scattered through a paragraph, that's forensic evidence — not a probability score. Our results are binary and provable: either the marker exists in your text, or it doesn't. This makes our detector immune to the false-positive problem that plagues NLP-based tools.
The tool processes both the raw HTML and plain text versions of your input simultaneously. This dual-layer approach is critical because some markers (like ChatGPT's proprietary data-start, data-end, and data-message-* attributes) exist only in the HTML clipboard layer that rides along when you copy from ChatGPT's web interface. Plain-text-only analysis would miss these entirely.
How to Use the AI Detector in 3 Steps
Paste Your Text
Copy text directly from ChatGPT, Claude, Gemini, Grok, or any AI tool and paste it into the analysis panel. For best results, copy from the AI chat interface rather than an intermediate editor — this preserves the hidden HTML layer that contains the strongest markers.
Click "Scan"
One click runs our detection engine across 10 marker categories in both HTML and plain text layers. The analysis happens in milliseconds, entirely in your browser. No text is ever uploaded to a server.
Review Your Results
Get a 0-100 confidence score with a clear verdict (Human / Mixed / AI) plus a detailed breakdown of every finding organized by severity — from critical evidence like ChatGPT data attributes to weaker signals like smart quotes.
Complete List of AI Markers We Detect
Our scanner checks for 10 categories of technical AI markers, each assigned a severity weight based on how reliably it indicates AI origin. Critical markers are near-definitive proof — they virtually never appear in human-typed text. Weak markers are common in formatted text and only become meaningful when combined with stronger signals. The weighted scoring system produces a 0–100 confidence score.
| Marker | Severity | Reference | Why It Matters |
|---|---|---|---|
| ChatGPT HTML Attributes | Critical | data-start, data-end | Proprietary HTML markup injected by ChatGPT's web interface during copy — the single strongest signal of AI origin, virtually impossible to find in human-authored text |
| Zero-Width Characters | Critical | U+200B, U+200C, U+200D | Invisible Unicode characters that AI chat UIs embed for cursor tracking and rich-text management — splits keywords, breaks JSON, and flags AI detection tools |
| Byte Order Mark (BOM) | Strong | U+FEFF | Legacy encoding marker that should only appear at the start of a file — when found mid-text, it's a strong indicator of AI text processing pipeline artifacts |
| Word Joiner | Strong | U+2060 | Invisible word-binding character that prevents line breaks — rare in human text, commonly injected by AI chat interfaces during text rendering |
| Direction Markers | Strong | U+200E, U+200F | Left-to-right and right-to-left marks used for bidirectional text control — suspicious when found in monolingual English text from AI tools |
| Non-Breaking Spaces | Moderate | U+00A0 | Looks identical to a normal space but prevents line wrapping — AI tools frequently substitute regular spaces with NBSP, a detectable pattern |
| Soft Hyphens | Moderate | U+00AD | Invisible hyphenation hints that only render at line breaks — AI text rendering engines insert these for layout optimization, humans almost never type them |
| Smart Quotes | Weak | U+201C-201F | Curly quotation marks that AI tools auto-convert from straight quotes — common in formatted text but suspicious in quantity alongside other markers |
| Em Dashes | Weak | U+2013, U+2014 | Long dashes that AI models heavily favor over hyphens — a stylistic signature that becomes relevant when combined with technical markers |
| Unicode Ellipsis | Weak | U+2026 | Single-character ellipsis that AI tools auto-convert from three dots — a minor signal that adds weight when found with stronger evidence |
Who Uses AI Text Detection
Educators & Academics
Verify student submissions with concrete technical evidence. Our detector complements NLP-based tools like Turnitin by finding proof rather than probability — zero-width characters don't lie, even when writing style appears human.
Content Teams & Editors
Check freelancer deliverables and contributed content for unedited AI output. Detect when text has been pasted directly from ChatGPT without any manual editing by scanning for the HTML fingerprints that copy operations preserve.
SEO Professionals
Find hidden characters that silently damage search performance. Zero-width spaces in title tags split keywords, break schema markup, and cause indexing anomalies that are invisible to the human eye but visible to search engine crawlers.
Developers & QA Teams
Detect invisible Unicode contamination in code snippets, API responses, JSON payloads, and configuration files sourced from AI tools. A single zero-width character can cause parsing failures that are nearly impossible to debug.
Technical Detection vs. NLP-Based AI Detectors
Traditional AI detectors like GPTZero, Originality.ai, and Copyleaks use natural language processing to analyze statistical patterns in writing — measuring perplexity (how predictable each word is), burstiness (variation in sentence complexity), and stylistic markers. While powerful, this approach has a fundamental limitation: it produces probability scores, not proof. A well-written human essay can score as "likely AI," and a lightly paraphrased AI text can score as "likely human."
Turnitin's AI detection module combines NLP analysis with their massive academic text database, giving it strong performance in educational contexts. However, it still relies on probabilistic scoring and has documented cases of false positives that have caused real harm to students.
GPT Cleanup's approach is orthogonal — we don't replace these tools, we complement them. Our detector finds concrete, verifiable evidence: a ChatGPT data-start attribute either exists in the text or it doesn't. A zero-width space is either present or absent. There is no gray area, no probability score, no chance of a false positive on technical markers.
The ideal workflow combines both approaches: use GPT Cleanup's AI Detector first for definitive technical evidence, then cross-reference with an NLP-based tool for stylistic analysis. If both methods agree, you have strong confidence. If they disagree, the technical evidence tells you what actually happened at the copy-paste level.
Understanding Your Detection Score
The AI Detector produces a weighted confidence score from 0 to 100, calculated by summing severity-weighted points for each detected marker category. The score maps to three verdict bands:
No significant technical markers found. The text shows no evidence of AI tool origin at the clipboard level. Note: this doesn't guarantee the text wasn't AI-generated — it may have been cleaned or manually retyped.
Some technical markers detected, but not enough for a definitive conclusion. This often means the text was partially edited after copying from an AI tool, or contains formatting artifacts that could have other sources.
Strong technical evidence of AI tool origin. Multiple high-severity markers detected — ChatGPT data attributes, zero-width characters, or a combination of strong signals. This is near-certain proof of unedited copy-paste from an AI interface.
Frequently Asked Questions
Everything you need to know about detecting AI-generated text
How does GPT Cleanup's AI Detector work?
Is this AI detector free?
What's the difference between this and GPTZero or Turnitin?
Can AI text pass this detector?
Is my text private?
What hidden characters does this detect?
What does each severity level mean?
Detects Fingerprints from Every AI Tool
Our detection engine identifies technical markers from every major AI writing platform. Each tool's chat interface injects its own combination of hidden Unicode characters and HTML attributes during copy operations. We scan for all known patterns across every platform.
Clean AI Text with GPT Cleanup
Found AI fingerprints? Remove all hidden characters, zero-width spaces, and ChatGPT watermarks in one click. Free, instant, and 100% private — no data leaves your browser.