HomeCrawlers

Crawlers

May 21, 2025, 4:11 PM Chill Annoying, Tech

One thing that nobody’s talking about:

It’s been long assumed that one of Google’s fundamental moats is their infrastructure: that it is hard to crawl, index, store, and digest the entire internet

Basically all the big labs are disproving this. Their crawlers are everywhere

— John Loeber 🎢 (@johnloeber) May 20, 2025

Only people who didn’t know anything about anything thought that was true.

I personally — and I mean with my own funds — could build enough infrastructure to crawl 90%+ of the web every few days and store the metadata. It’s just not that much data. It’d take about $200,000 of gear and connectivity. That’d buy me a dozen petabytes of storage and a couple dozen 10Gb links. And that’s enough. The software is all open source.

Google’s moat has nothing to do with the ability to crawl or digest anything and more to do with their former search algo dominance.

Leave a Reply Cancel reply

« Dec

January 2026

Feb »

Sun	Mon	Tue	Wed	Thu	Fri	Sat
				1 (6)	2 (14)	3 (11)
4 (10)	5 (9)	6 (6)	7 (13)	8 (10)	9 (10)	10 (11)
11 (13)	12 (18)	13 (10)	14 (4)	15 (10)	16 (13)	17 (11)
18 (15)	19 (13)	20 (9)	21 (13)	22 (14)	23 (4)	24 (19)
25 (15)	26 (11)	27 (5)	28 (11)	29 (3)	30 (0)	31 (0)

Technology as Nature

Your only fighting chance is too stubborn to quit

Crawlers

Leave a Reply Cancel reply