AI Crawler Manager
Control which bots can access your website. Changes update the export below instantly.
- GooglebotGooglebotInherited
Google Search's crawler — controls your visibility in Google.
- BingbotBingbotInherited
Microsoft Bing's crawler — controls visibility in Bing.
- SlurpSlurpInherited
Yahoo Search's crawler.
- GPTBotGPTBotInherited
Collects content to train OpenAI's models (e.g. ChatGPT).
- ClaudeBotClaudeBotInherited
Collects content to train Anthropic's Claude models.
- CCBotCCBotInherited
Builds the Common Crawl public dataset used to train many AI models.
- BytespiderBytespiderInherited
ByteDance's crawler, used for AI training.
- Meta-ExternalAgentmeta-externalagentInherited
Meta's crawler used for AI training.
- PerplexityBotPerplexityBotInherited
Fetches pages for Perplexity's AI answer engine.
- Google-ExtendedGoogle-ExtendedInherited
Controls Google using your content to train Gemini and other AI models.
- Applebot-ExtendedApplebot-ExtendedInherited
Controls Apple using your content for AI training.
- AhrefsBotAhrefsBotInherited
Ahrefs' SEO backlink crawler.
- SemrushBotSemrushBotInherited
Semrush's SEO analytics crawler.
- MJ12botMJ12botInherited
Majestic's backlink crawler.
- TwitterbotTwitterbotInherited
Generates link previews for X/Twitter.
- facebookexternalhitfacebookexternalhitInherited
Generates link previews for Facebook.
- AmazonbotAmazonbotInherited
Amazon's crawler (used by Alexa and AI products).
- cohere-aicohere-aiInherited
Cohere's crawler for AI products.
- DiffbotDiffbotInherited
Diffbot's structured-data crawler.
- ImagesiftBotImagesiftBotInherited
ImageSift's image crawler.
Robots.txt Generator
Build a valid robots.txt from presets and crawler toggles — no syntax required.
OpenRobots.txt Validator
Catch syntax errors and best-practice issues, with a health score.
OpenRobots.txt Analyzer
Fetch and audit any site's live robots.txt in one report.
Open