OpenRobotsTXT (@openrobotstxt) 's Twitter Profile
OpenRobotsTXT

@openrobotstxt

OpenRobotsTXT is an open archive of the world’s robots.txt files.

ID: 1918316956043886594

linkhttp://OpenRobotsTxt.org calendar_today02-05-2025 14:50:28

4 Tweet

5 Followers

1 Following

OpenRobotsTXT (@openrobotstxt) 's Twitter Profile Photo

We're delighted to announce OpenRobotsTxt.org - a project to archive and analyse the world’s robots.txt files, kicking off with analysis of 595 million hostnames courtesy of a huge export of data from Majestic.

We're delighted to announce OpenRobotsTxt.org - a project to archive and analyse the world’s robots.txt files, kicking off with analysis of 595 million hostnames courtesy of a huge export of data from <a href="/Majestic/">Majestic</a>.
Glenn Gabe (@glenngabe) 's Twitter Profile Photo

Interested in user-agent data? @majestic has you covered -> Majestic Launches Robots.txt Archive 600M hostnames scanned and 36K user-agents found. "The project has been bootstrapped by a huge data export of robots.txt files collected by the Majestic crawler, MJ12bot. This has

Interested in user-agent data? @majestic has you covered -&gt; Majestic Launches Robots.txt Archive

600M hostnames scanned and 36K user-agents found. 

"The project has been bootstrapped by a huge data export of robots.txt files collected by the Majestic crawler, MJ12bot. This has