Post B6O84ItqOxEKhcbals by kkarhan@c.im
(DIR) More posts by kkarhan@c.im
(DIR) Post #B6O6Oqppjyw3pnOp28 by internetarchive@mastodon.archive.org
2026-05-17T12:26:17Z
1 likes, 0 repeats
As high-profile websites vanish, it’s a reminder that the web has no built-in archival layer.But some publishers are now blocking the Wayback Machine.What’s at stake if the web stops being archived? Our new FAQ explains: preserving the public record matters. 🌐📚 https://help.archive.org/help/faq-publishers-blocking-the-wayback-machine/
(DIR) Post #B6O84ItqOxEKhcbals by kkarhan@c.im
2026-05-17T12:31:54Z
0 likes, 0 repeats
@internetarchive such blockades of #InternetArchive should be outlawed!
(DIR) Post #B6ODtpyTgoXNp0AITY by moriel@chaosfem.tw
2026-05-17T13:04:46Z
0 likes, 0 repeats
@internetarchive As i understand it, mostly this is a byproduct of sites trying to block AI scrapers. Would it be possible for the archive to publish a list of the IP addresses used by it's crawlers so that they can be explicitly whitelisted by sites that don't want to knock you out as collateral damage? Or is this already done, even? (If it is i'd appreciate a link to the list so i can whitelist you on my sites, unimportant though they be.)
(DIR) Post #B6OTKq7Ykf1hPIkMFs by cobalt123@beige.party
2026-05-17T16:42:44Z
0 likes, 0 repeats
@internetarchive But these publishers and websites are not protecting the content, content creators or in the case of Reddit, the users of Reddit. They are protecting their own content to select which Ai may have access and whoever will pay for content directly to them to pass their paywalls. The idea it’s to protect users (as in human) is just gaslighting. ““[The] Internet Archive provides a service to the open web, but we’ve been made aware of instances where AI companies violate platform policies, including ours, and scrape data from the Wayback Machine,” a Reddit spokesperson told The Verge at the time. “Until they’re able to defend their site and comply with platform policies…we’re limiting some of their access to Reddit data to protect redditors.”