[HN Gopher] Deep dive into finding RSS feeds
       ___________________________________________________________________
        
       Deep dive into finding RSS feeds
        
       Author : domysee
       Score  : 49 points
       Date   : 2024-12-06 19:17 UTC (3 hours ago)
        
 (HTM) web link (lighthouseapp.io)
 (TXT) w3m dump (lighthouseapp.io)
        
       | superkuh wrote:
       | I generally try: /rss, /feed, /index.xml, /rss.xml, /feed.xml,
       | etc. And at various root or /directory/* locations.
       | https://blog.jim-nielsen.com/2021/feed-urls/ is a good article
       | with statistics on naming.
       | 
       | I've been adding to my feeds.opml since reddit started dying in
       | ~2015 and now I'm up to around ~1700 feeds and mostly independent
       | from aggregators; though I still collect new feeds from
       | HN/IRC/etc. Mostly I just always make a point to look for them
       | whenever I read something cool on the web.
        
       | ulrischa wrote:
       | Would be nice if it is implmented in freshrss
        
         | quaintdev wrote:
         | I think miniflux can do this. I just give it website address
         | and it almost always finds rss feeds
        
       | 1123581321 wrote:
       | It'd be neat for readers to seamlessly integrate with a scraper,
       | either self-hosted or commercial, if no feed is found. I believe
       | Inoreader allows scraping a few sites depending on the plan
       | level; most reader services don't.
        
       | HumblyTossed wrote:
       | Back when I was young, websites had this icon you could click
       | that would take you straight to their RSS feed. You young whipper
       | snappers have gone an fucked that up. Actually, I think it was
       | Google's fault. When they killed their RSS reader people
       | pronounced RSS dead so people just stopped publishing RSS feeds
       | or just didn't link to them.
       | 
       | * Yes, I know the article talks about the RSS icon, i'm just
       | soapboxing.
        
         | AndrewStephens wrote:
         | Even better, for a few months the browsers themselves would
         | highlight RSS feeds and allow you subscribe right in the
         | browser. It was too good to last.
         | 
         | RSS is great but it has one great flaw in that it doesn't scale
         | that well by itself. If 2 million people subscribe to your feed
         | and try to update it once an hour, that is 48 million requests
         | a day just for RSS.
         | 
         | What does work well (and how things have evolved) is to have a
         | service that polls RSS on behalf of its users. This was the
         | beauty of Google Reader but plenty of replacements exist.
        
           | rodary wrote:
           | > Even better, for a few months the browsers themselves would
           | highlight RSS feeds and allow you subscribe right in the
           | browser. It was too good to last.
           | 
           | Vivaldi browser still does that.
        
       | sodality2 wrote:
       | Tried out the feed finder on my blog again and I have another bug
       | to report - it seems the URLs on the page can cause a crash
       | within the web app! my blog (at matthew.science) uses Zola SSG,
       | and it seems the URLs are formatted with a preceding //: '<a
       | href="//matthew.science/posts/riscv/">Basics of the RISC-V
       | ISA</a>'
       | 
       | This causes the following error: TypeError: URL constructor:
       | //matthew.science/posts/riscv/ is not a valid URL.
        
         | jdougan wrote:
         | Theoretically, I think that should work. It (At least it used
         | to be) specified that //site/some/path should assume the uri
         | protocol of the current context. So if it was a link on an http
         | page, it should assume http, same with ftp and https etc. It
         | should work sorta like how a leading slash assumes the current
         | site context.
         | 
         | This was back before the Web became the one true way and is the
         | reason it uses 2 slashes, to distinguish protocol local from
         | site local.
        
       | camel-cdr wrote:
       | This is useful, I set up RSS on my website yesterday.
       | 
       | Turns out the feed finder couldn't find the feeds even though
       | I've linked to them using clickable RSS icons.
       | 
       | I didn't know about the autodiscovery feature so I'll add that
       | now.
        
       | zenlot wrote:
       | Came here through RSS link from miniflux, running on nvidia
       | jetson.
        
       ___________________________________________________________________
       (page generated 2024-12-06 23:00 UTC)