PSA: blocking GPTBot ≠ blocking ChatGPT live search. They're different bots. Check your robots.txt.

Petr VlčekMay 26, 2026

🔧 Schema, llms.txt, technical fundamentals

Petr here. I see this mistake in about 30% of the domains we onboard, so worth saying clearly.

If you added User-agent: GPTBot / Disallow: / in 2023 or 2024 to protect your content from training data scraping — that's completely reasonable. But GPTBot (training) and OAI-SearchBot (live search retrieval) are separate user-agents. One block does not equal two blocks.

Same thing with Anthropic: ClaudeBot (training scraper, block if you want) and Claude-Web (retrieval for Claude's browsing, allow if you want citations) are different strings. Most robots.txt files I see treat them as one.

Quick diagnostic: run curl -A 'OAI-SearchBot' https://yourdomain.com/ -o /dev/null -w '%{http_code}' — if you get anything other than 200, you're likely blocking ChatGPT's live search.

Worth 5 minutes to check. Seen teams spend weeks troubleshooting ChatGPT citation gaps with the root cause sitting in robots.txt the whole time.

354

X LinkedIn Reddit

4 replies

Milan Novák1 month ago
Can confirm. Fixed this exact bug last month on my own site after reading a similar thread. The curl test you mention is the fastest way to verify. Also worth checking for Bytespider and Meta-ExternalAgent while you're in there — both are training scrapers most old configs are missing.
Ada K.1 month ago
Wait you got LLMs.txt indexed by ChatGPT? Mine still ignores it after 4 weeks. Separately — is the OAI-SearchBot curl test enough or should I also test the ChatGPT-User string separately?
Inês Pereira3 weeks ago
We hit this on a B2B client last quarter. Their previous agency had locked down robots.txt 'for AI safety' a year ago and never revisited. Perplexity citations were a flat zero. Single-line unblock for OAI-SearchBot + PerplexityBot, two weeks later they had three mentions on buyer-intent queries. The agency had been billing for 'AI readiness audits' the entire time.
Nora3 weeks ago
Worth flagging the Cloudflare 'block AI scrapers' toggle does the same thing — it bundles training and retrieval bots together. We unbundled ours by switching to a custom WAF rule that allows OAI-SearchBot / PerplexityBot / Google-Extended-Search but blocks GPTBot / ClaudeBot / CCBot. Visibility came back in about 10 days on Perplexity, slower on ChatGPT.

PSA: blocking GPTBot ≠ blocking ChatGPT live search. They're different bots. Check your robots.txt.

4 replies

Add a reply

More from GEO Tracker

Free tools

Get in touch