Crawlers

Only people who didnโ€™t know anything about anything thought that was true.

I personally โ€” and I mean with my own funds โ€” could build enough infrastructure to crawl 90%+ of the web every few days and store the metadata. Itโ€™s just not that much data. Itโ€™d take about $200,000 of gear and connectivity. Thatโ€™d buy me a dozen petabytes of storage and a couple dozen 10Gb links. And thatโ€™s enough. The software is all open source.

Googleโ€™s moat has nothing to do with the ability to crawl or digest anything and more to do with their former search algo dominance.

Leave a Reply

Your email address will not be published. Required fields are marked *