Skip to content

sefinek/trusted-ips-whitelist

Repository files navigation

🤖 Known Bots IP Whitelist

This repository contains up-to-date lists of IP addresses of known bots and crawlers, useful for whitelisting or filtering network traffic. They can also be used as blacklists.

Do you have any questions or want to receive notifications about important changes or new features in my repositories? Join my Discord server! If you don't use Discord, you can also open an issue on GitHub.

The project is released under the MIT license — you can do whatever you want with it.
If you like this repository, leave a star ⭐. Thank you!

🔄 Update frequency 📋 Changelog
Every 6 hours CHANGELOG.md

📘 Combined IP Lists

🔀 All (11,504 IPs, 5,438 CIDRs)

Format URL
TXT raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-safe-ips.txt
JSON raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-safe-ips.json
CSV raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-safe-ips.csv

🤖 Crawlers only (9,910 IPs, 3,599 CIDRs)

Format URL
TXT raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-crawlers-ips.txt
JSON raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-crawlers-ips.json
CSV raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-crawlers-ips.csv

🧠 AI only (0 IPs, 335 CIDRs)

Format URL
TXT raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-ai-ips.txt
JSON raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-ai-ips.json
CSV raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-ai-ips.csv

📡 Monitoring only (591 IPs, 82 CIDRs)

Format URL
TXT raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-monitoring-ips.txt
JSON raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-monitoring-ips.json
CSV raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-monitoring-ips.csv

🏗️ Infrastructure only (1,003 IPs, 1,422 CIDRs)

Format URL
TXT raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-infrastructure-ips.txt
JSON raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-infrastructure-ips.json
CSV raw.githubusercontent.com/sefinek/trusted-ips-whitelist/main/lists/all-infrastructure-ips.csv

🌍 Supported Services

🤖 Crawlers (9,910 IPs, 3,599 CIDRs)

Search engines, SEO tools and web testing bots.

Service IPs & CIDRs Sources Downloads
GoogleBot 0 - 311 developers.google.com TXTCSVJSON
Google Special Crawlers 0 - 266 developers.google.com TXTCSVJSON
Google User-Triggered Fetchers 0 - 1,504 Fetchers & Google TXTCSVJSON
BingBot 0 - 28 www.bing.com TXTCSVJSON
DuckDuckBot 0 - 319 duckduckgo.com TXTCSVJSON
YandexBot 0 - 16 yandex.com TXTCSVJSON
FacebookBot 0 - 1,041 RIPEstat & RADB TXTCSVJSON
Applebot 0 - 12 search.developer.apple.com TXTCSVJSON
Kagi 4 - 0 Custom list TXTCSVJSON
AhrefsBot 9,870 - 0 api.ahrefs.com TXTCSVJSON
Semrush 0 - 7 RIPEstat & RADB TXTCSVJSON
WebPageTest Bot 35 - 0 www.webpagetest.org TXTCSVJSON

🧠 AI (0 IPs, 335 CIDRs)

AI crawlers from large language model providers.

Service IPs & CIDRs Sources Downloads
ClaudeBot 0 - 20 claude.com TXTCSVJSON
GPTBot 0 - 21 openai.com TXTCSVJSON
OpenAI SearchBot 0 - 35 openai.com TXTCSVJSON
ChatGPT User 0 - 240 openai.com TXTCSVJSON
PerplexityBot 0 - 8 perplexity.ai TXTCSVJSON
Perplexity User 0 - 4 perplexity.ai TXTCSVJSON

📡 Monitoring (591 IPs, 82 CIDRs)

Uptime monitoring services and internet scanners.

Service IPs & CIDRs Sources Downloads
BetterStack 34 - 0 uptime.betterstack.com TXTCSVJSON
PingdomBot 152 - 0 IPv4 & IPv6 TXTCSVJSON
Pulsetic 48 - 2 IPv4 & IPv6 TXTCSVJSON
UptimeRobot 232 - 0 uptimerobot.com TXTCSVJSON
Censys 0 - 36 RIPEstat & RADB TXTCSVJSON
Modat Scanner 0 - 35 scanner.modat.io TXTCSVJSON
Shodan 95 - 6 Custom list TXTCSVJSON

🏗️ Infrastructure (1,003 IPs, 1,422 CIDRs)

CDN providers, hosting networks, DNS resolvers and web services.

Service IPs & CIDRs Sources Downloads
Cloudflare 0 - 22 IPv4 & IPv6 TXTCSVJSON
Bunny CDN 906 - 0 IPv4 & IPv6 TXTCSVJSON
Canonical 0 - 37 RIPEstat & RADB TXTCSVJSON
NASK PL 0 - 642 RIPEstat & RADB TXTCSVJSON
Palo Alto Networks 0 - 93 RIPEstat, RADB & Custom list TXTCSVJSON
DNS Resolvers 72 - 0 Custom list TXTCSVJSON
Stripe 15 - 0 stripe.com TXTCSVJSON
TelegramBot 0 - 14 core.telegram.org TXTCSVJSON
RSS API 1 - 1 rssapi.net TXTCSVJSON
Baidu 0 - 616 RIPEstat & RADB TXTCSVJSON

About

Collection of IP addresses used by legitimate crawlers and services: Googlebot, Bingbot, AhrefsBot, UptimeRobot, Pingdom, Cloudflare, Bunny CDN, Stripe, Shodan, FacebookBot, TelegramBot, and more.

Topics

Resources

License

Stars

Watchers

Forks

Contributors