Skip to content

Commit 67903f4

Browse files
authored
Merge pull request #10397 from internetarchive/robots-txt-crawl-delays
Add crawl delays to robots.txt
2 parents c8e2621 + e6436fa commit 67903f4

File tree

1 file changed

+32
-1
lines changed

1 file changed

+32
-1
lines changed

static/robots.txt

Lines changed: 32 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,13 +2,44 @@
22
User-agent: *
33
Disallow: /api
44
Disallow: /edit
5+
Disallow: /account
56
Disallow: /search
67
Disallow: /search/inside
8+
Disallow: */borrow*
9+
Disallow: /*.rdf$
710

811
Sitemap: https://openlibrary.org/static/sitemaps/siteindex.xml.gz
912

1013
User-agent: Baiduspider
11-
Crawl-delay: 0.5
14+
Crawl-delay: 10
1215

1316
User-agent: Googlebot
1417
Disallow: /*.rdf$
18+
Crawl-delay: 10
19+
20+
User-agent: AhrefsBot
21+
Crawl-delay: 10
22+
23+
User-agent: meta-externalagent
24+
Crawl-delay: 10
25+
26+
User-agent: anthropic-ai
27+
Crawl-delay: 10
28+
29+
User-agent: ClaudeBot
30+
Crawl-delay: 10
31+
32+
User-agent: openai
33+
Crawl-delay: 10
34+
35+
User-agent: ChatGPT
36+
Crawl-delay: 10
37+
38+
User-agent: bingbot
39+
Crawl-delay: 10
40+
41+
User-agent: Yandex
42+
Crawl-delay: 10
43+
44+
User-agent: *bot
45+
Crawl-delay: 10

0 commit comments

Comments
 (0)