https://commoncrawl.org/ Skip to content Toggle navigation Common Crawl * Big Picture + What We Do + What You Can Do + FAQs * The Data + Get Started + Example Projects + Tutorials + Developer's List * About + Our Team + Job Opportunities + Media * Blog * Connect + Donate + Newsletter + Contact Us + Terms of Use * Donate Us We build and maintain an open repository of web crawl data that can be accessed and analyzed by anyone. You Need years of free web page data to help change the world. [downarrow] 40+ Languages Raw Data, Metadata, Text Data We gather it. We aggregate it. You utilize it. And it's all free. How big? We're talking BIG. Petabytes big. Our Story Billions of pages, Trillions of links [box-7] Access to data is a good thing, right? Please donate today, so we can continue to provide you and others like you with this priceless resource. DONATE NOW Don't forget, Common Crawl is a registered 501(c)(3) non-profit so your donation is tax deductible! * Big Picture + What We Do + What You Can Do + FAQs * The Data + Get Started + Example Projects + Tutorials + Developer's List * About Us + Our Team + Media + Jobs * Connect + Donate + Blog + Newsletter + Contact Us + Terms Of Use Common Crawl on Twitter