[HN Gopher] Show HN: Hacker News user blogroll
___________________________________________________________________
Show HN: Hacker News user blogroll
I saw this [0] pretty cool thread by user revskill, and wanted a
quicker way to search through it, but also to keep them all in one
place so I can read them at my leisure whenever I get time. Right
now is like 60 lines of Ruby using Nokogiri, but I will certainly
look into it further down the line and improve the list. There's a
cronjob checking the thread every 12 hours but I will eventually
shut that down and it will become static after that. There are
some really awesome blogs in there. I really recommend going
through the list, it made my day. [0] "Could you share your
personal blog here". https://news.ycombinator.com/item?id=36575081
Author : deathbypenguin
Score : 356 points
Date : 2023-07-05 19:06 UTC (3 hours ago)
(HTM) web link (dm.hn)
(TXT) w3m dump (dm.hn)
| LordDragonfang wrote:
| Since hn karma probably correlates to how much hn readers would
| enjoy a blog, I'd love a column/columns that includes the user's:
| - hn profile karma - total karma of posts from that domain
| - as above, but Sum(log(post_karma[i]))
|
| ...or something similar.
|
| Whatever is feasible. For a while I've wanted a list of
| "blogs/domains that hn likes" that isn't polluted by general-
| high-traffic domains.
| ploum wrote:
| That's an awesome idea, I would be really curious to see the
| result.
|
| (hoping that this does not backfire, for example encouraging
| people to spam HN with their own posts to gain some karma on
| the blogroll)
| LordDragonfang wrote:
| To address the exact situation in your parenthetical, I
| considered putting Sum(log(post_karma[i]-k)), for a k such
| that the expected log value is negative unless you get enough
| upvotes.
| joseferben wrote:
| Thanks for putting this together, love the name!
| WoodenChair wrote:
| Unfortunately I made a typo in writing my URL (should be
| https://observationalhazard.com/ not
| https://obervationalhazard.com/) and this has no way to update
| it.
| jefftk wrote:
| Neat! It looks like something is broken with unicode handling?
| For example the "smart" apostrophe in
| https://news.ycombinator.com/item?id=36594375 (U+2019, RIGHT
| SINGLE QUOTATION MARK) is being rendered as "a". Perhaps
| something is interpreiting utf-8 as latin1?
| deathbypenguin wrote:
| Indeed. I'm having a fight with that at the moment and the line
| breaks as well.
| oneeyedpigeon wrote:
| It's great. Is there any real point in sorting on description or
| url? I guess url does group http and https, which might be
| useful, but description definitely seems like it would be nicer
| if the sort option were removed.
| b8 wrote:
| Hmm, my blog wasn't added. Maybe when the data was scraped I
| hadn't posted it yet?
| deathbypenguin wrote:
| It will be picked up in a few hours.
| voigt wrote:
| I'm missing the good old time of webrings. This is very close :)
| PartiallyTyped wrote:
| You may be interested in [1], it's run by a few friends of
| mine. Hopefully it won't get hugged to death.
|
| [1] https://32bit.cafe
| jakebasile wrote:
| Look ma, I'm in an HN link! This is pretty neat.
| Moncefmd wrote:
| This is great! Would've been cool to also be able to sort by
| votes though.
| xwdv wrote:
| I'm going to train an LLM on all these blog posts, make a true HN
| AI.
| deathbypenguin wrote:
| Added feeds thanks to JSTucker. They are being fetched from the
| Gist. I think the cron ran, so there are more blogs now.
|
| json: https://dm.hn/blogroll.json (I'll add the feed to each item
| in a minute)
| astuyvenberg wrote:
| Oh neat - I saw mine on the list earlier but is now removed,
| maybe some kind of rate limiting? Anyway, neat project!
| levysoft wrote:
| I can't believe you had the same idea and necessity but you
| preceded me. Good job!
| zdwolfe wrote:
| Looks cool, thanks for making this.
| ghomem wrote:
| Clap clap clap. This is excellent public service @deathbypenguim.
| Yesterday I was scrolling through that enormous thread and using
| control+F to look for keywords of interest on the posted blog
| descriptions. Now it will be much easier to follow fellow
| bloggers. Thanks for having my blog on your list too.
| AndrewKemendo wrote:
| We reinvented web rings!
| I-M-S wrote:
| Can we do this for HN users' podcasts?
| swyx wrote:
| shameless plug for my pod: https://www.latent.space/podcast
| 1attice wrote:
| Naked self-promotion here, but I was late to the party on the
| original blogroll -- is there any way to add blogs post-ex-facto?
| Is there a submission mechanism?
| TOGoS wrote:
| I commented, but mine got missed, somehow. Maybe because my
| phone auto-capitalized the "H" in "http" and the script didn't
| account for funny capitalization. Sad!
| 1attice wrote:
| I did too, but I didn't include the protocol, just the bare
| domain :/
|
| FWIW it's https://lizmars.net
| eigenhombre wrote:
| Same, I think this is a great idea and would like to submit
| mine as well -- maybe an "add" feature on the page would make
| sense, or re-inquire here at intervals, maybe monthly/yearly?
| petercammeraat wrote:
| Brilliant. Easy to use as filter for subjects (if people
| described their blog)
| do-me wrote:
| That's awesome and so much more practical than scrolling through
| HN. It would also be possible to integrate semantic search so
| people don't necessarily need to know the keywords. If you're
| interested, feel free to ping me or take a look at
| https://github.com/do-me/SemanticFinder. In case I could just
| create a pre-indexed version based on your data dump which would
| be quite convenient to use.
| tenkabuto wrote:
| These threads make me wish that I had a blog, not just a regular
| website. :(
| nelsonfigueroa wrote:
| I took at look at your website and it seems like you could
| easily add blog posts to it!
| tenkabuto wrote:
| Hah, thanks. I've been hoping to do so, but still haven't
| gotten around to it. There's some quirks with the static site
| generator that I use[0] that lead me to keep postponing
| setting up blog-ish features, and I don't know enough python
| to fix them.
|
| [0]: https://github.com/gordonbrander/lettersmith_py
| nelsonfigueroa wrote:
| If you ever want to try a new static site generator, I use
| Hugo[0] to generate my site. There's a lot of pre-built
| themes[1] you can use. Most (if not all) have blogging
| functionality built in, all you need to do is drop in a
| Markdown file with your content. You may need to learn a
| little bit if Golang if you want to customize themes. Just
| throwing it out there as an option.
|
| [0]: https://gohugo.io/
|
| [1]: https://themes.gohugo.io/
| tenkabuto wrote:
| Thanks! In writing out my reply to you I realized that I
| should look into other generators (specifically looking
| into Hugo, as I think I've seen it used by people like
| myself who take notes in Obsidian). The key features I
| want are backlinks support and blogging features, along
| with Markdown support.
| victorbjorklund wrote:
| Would be cool to make an RSS feed that combined all the RSS feeds
| from all the blogs.
| amadeuspagel wrote:
| Here's an HTML feed: https://webloglist.com/hn
| samsquire wrote:
| Thanks for creating this. I will use this to go through people's
| blogs.
|
| I think my blog/journals hasn't been picked up yet
|
| https://news.ycombinator.com/item?id=36588927
|
| My blog is on GitHub, how do you parse the URLs?
| deathbypenguin wrote:
| It should show in a few hours when the cronjob triggers.
| Parsing: nokogiri
| ghoomketu wrote:
| Looks great and congrats on shipping. If it were up to me I'd
| still be deliberating the best framework and design to use for
| this, and how I can pipe the comments through chatgpt to extract
| the category, keywords and do things that make it the best
| blogroll ever.
|
| And then I would have just thought it's too much work for nothing
| and that'd be the end of it :P
| myth2018 wrote:
| Dude I can definitely relate
| deathbypenguin wrote:
| haha I had the exact same ideas, but then I was "bah! I'll put
| it out there and I'll add functionality over time"
| swyx wrote:
| did you just have the .hn TLD standing by? where is that
| from? must have cost a pretty penny
| smokel wrote:
| Great work, thank you for sharing this.
|
| I would prefer to see the entire list, so that I can easily
| search for keywords in the browser. Apparently, all data is
| available on the client side, but the table renderer seems to
| limit the table size to at most 100 entries.
| ryan-duve wrote:
| A workaround while you're waiting for this to be supported by
| OP is to go to inspector and change the last dropdown option to
| <option value="10000">10000</option>
|
| then select it in the UI.
| Aissen wrote:
| It's very nice, thanks ! It would be nice if descriptions had new
| lines; some aren't readable, while they work quite well on HN.
| deathbypenguin wrote:
| I'm on it.
| scarface_74 wrote:
| This is awesome
| abathur wrote:
| Hmm. Any idea why some wouldn't show up? I posted in
| https://news.ycombinator.com/item?id=36588940 but don't see it in
| the list.
| toyg wrote:
| Same for me. Maybe the scraper choked on pagination, maybe they
| just took a snapshot before we posted.
| JSTucker wrote:
| Heres an OPML with all the feeds I could detect from the list!
| https://gist.github.com/Josh-Tucker/030b8cba6557927a27f1c7e6...
| swyx wrote:
| if you share the code for OPML conversion maybe OP could
| incorporate it quickly
| re wrote:
| This made me think of "planets", which I feel had a heyday back
| in the late 2000s before Reddit and social media took over
| everything. Anyone want to take all the blogs with RSS/Atom feeds
| and build an HN planet? :)
|
| > In online media a planet is a feed aggregator application
| designed to collect posts from the weblogs of members of an
| internet community and display them on a single page
|
| https://en.wikipedia.org/wiki/Planet_(software)
| soegaard wrote:
| https://planet.scheme.org/
| ploum wrote:
| Yeah, planet were awesome. I'm proud to say that my blog was
| both on planet.gnome (the original one) and planet.ubuntu.
|
| Now, I feed that the most interesting planet is planet.debian,
| which offers lot of variety without being focused on Debian.
|
| The great feature I liked was that Planet were _not_ about a
| given project. It was about the people contributing to the
| project. Their life. Their interests.
|
| At some point, lot of planets started to ask only "on-topic"
| posts with a specific RSS feeds. Those planets became boring as
| it was mainly stuffs you could find on forum or any tech
| related websites.
| tenkabuto wrote:
| Yes! I've loved Planet Python[0] because it really lets you
| see that the Python community is quite varied, fun, and
| human.
|
| [0]: https://planetpython.org
| susam wrote:
| I still follow a few planets. For example:
|
| https://planet.lisp.org/
|
| https://planet.emacslife.com/
| amadeuspagel wrote:
| Here you go: https://webloglist.com/hn
| susam wrote:
| Very interesting! Thanks for sharing your project here. Out of
| curiosity, I did some searches with some interesting strings. At
| the time of posting this comment, here is what the search results
| look like:
|
| Vim: 8 entries
|
| Emacs: 7 entries
|
| Python: 24 entries
|
| Rust: 24 entries
|
| Lisp: 5 entries
|
| Clojure: 3 entries
|
| Haskell: 5 entries
|
| Zig: 5 entries
|
| Elixir: 4 entries
|
| Scheme: 0 entries
|
| Postgres: 4 entries
|
| MySQL: 0 entries
|
| SQLite: 3 entries
|
| Jekyll: 9 entries
|
| HTML: 40 entries
|
| Markdown: 6 entries
|
| LaTeX: 1 entry
|
| Hugo: 12 entries
|
| Next.js / Nextjs: 4 entries
|
| Gatsby: 2 entries
|
| Pelican: 0 entries
|
| .com: 495 entries
|
| .dev: 90 entries
|
| .net: 84 entries
|
| .io: 82 entries
|
| .me: 53 entries
|
| .org: 43 entries
|
| .xyz: 15 entries
|
| .page: 6 entries
|
| github.io: 46 entries
|
| medium.com: 18 entries
|
| blogspot.com: 8 entries
|
| wordpress.com: 4 entries
|
| livejournal.com: 0 entries
|
| tech: 178 entries
|
| programming: 66 entries
|
| random: 61 entries
|
| thought: 49 entries
|
| math: 16 entries
|
| musing: 12 entries
|
| blag: 1 entry
|
| favorite: 28 entries
|
| favourite: 9 entries
|
| Now all of these results are string search results, so there is
| always going to be a little bit of noise when we try to draw
| conclusions out of these results. For example, the results for
| ".dev" also contains results that look like "*dev*.com".
|
| Despite the noise, I found these results interesting. I remember
| in the early days when the blogosphere was being constructed 20
| km above the tag clouds, it was very fashionable to have blogs
| for random musings or random thoughts. So I am delighted to see
| that most blogs out here are tech blogs. Surprisingly there is
| only blag. I expected at least a few more.
|
| One of the Lisp entries is mine. Also, one of the Vim entries is
| mine. It is a bit ironical because I am actually an Emacs user.
| If I had known the comments we write on HN would become part of
| the search string in this blogroll, I might have chosen my words
| in my comment to the "Ask HN" port more judiciously! :)
| boricj wrote:
| reverse engineering: 5 entries
|
| Ghidra: 1 entry (mine)
|
| On one hand it does bring some level of perspective on the
| popularity of a particular topic you're into. My first reaction
| was "Just 0.5% for reverse-engineering? I guess I'm down in a
| deep dark rabbit hole..."
|
| On the other hand, I haven't seen the blogs of Ken Shirriff,
| Alex Ionescu or Raymond Chen on that list, which I know are
| quite popular and regularly make it to the Hacker News front
| page.
| cavalcade119 wrote:
| [dead]
| kiruio wrote:
| Cool, I forgot to add descriptions. Would be nice if I could fix
| it
| version_five wrote:
| Can you say what criteria you used to filter the thread into
| valid blogs?
| hyperific wrote:
| Thanks for doing this!
| verse wrote:
| Love this, thanks for building it!
| amadeuspagel wrote:
| The latest posts from these blogs: https://webloglist.com/hn
| jakebasile wrote:
| That's cool! Did you pull RSS from all the sites you could and
| use that to aggregate it?
| amadeuspagel wrote:
| Yes, webloglist uses RSS autodiscovery.
| darekkay wrote:
| It seems the autodiscovery didn't work for my blog (link in
| profile). I've posted something 2 days ago but it doesn't
| appear on your site. My feed is on the list from JSTucker,
| who also used some sort of autodiscovery.
| amadeuspagel wrote:
| Atom isn't supported yet. Working on it.
| rambambram wrote:
| Nice list! I was almost going to ask you if you have an OPML
| file with all the feeds, but then I decided to check the list
| manually for interesting latest posts and grab only their
| feeds. Thanks for the list!
| addandsubtract wrote:
| Now we just need ChatGPT to read them all and give us a daily
| update on the interesting ones.
| skilled wrote:
| Good job. I would honestly love this but with RSS feeds also, but
| I know it's a tough ask unfortunately. (Not for you, but in
| general)
| xoranth wrote:
| Most blogs that have RSS also have a `<link rel="alternate"
| type="application/rss+xml">` tag that redirects you to the RSS
| feed. If you pass the link to the homepage to a feed
| reader[^0], it will follow the link tag and find the RSS feed.
|
| [^0]: At least, Liferea on Linux, NetNewsWire and Vienna on
| Mac, do this. AFAIR NetNewsWire is even smarter than that, and
| can sometimes find the RSS feed even when there is no link tag.
| marginalia_nu wrote:
| A bit of a snag is that many CMSes generate multiple feeds,
| and there is no way I'm aware of for identifying which is the
| "canonical" feed.
| 1270018080 wrote:
| Conspiracy: That post was only made to harvest data for someone's
| model
| bachmeier wrote:
| Well, given that blogs are public and the whole point is for
| others to read them, I think that's okay.
| leejoramo wrote:
| This is great. An OPML version of this would be great to bulk
| IMPORT the RSS/ATOM feeds into your favorite feed reading app.
| JSTucker wrote:
| I've scraped what I could from the list and exported and opml
| here: https://gist.github.com/Josh-
| Tucker/030b8cba6557927a27f1c7e6...
| tommy_axle wrote:
| A feed is now added to https://codeinsider.dev
| (https://codeinsider.dev/rss.xml)
| JSTucker wrote:
| Have added your feed to the list
| [deleted]
| deathbypenguin wrote:
| Thanks man, it's been added: https://dm.hn/blogroll.json and
| next to each entry.
| rambambram wrote:
| Indeed! But I guess not every blog has a feed, or there's no
| quick way of letting people add one to the list after the fact.
| prepend wrote:
| I expect that every blog has an rss or atom feed. It would be
| strange for someone to go to the effort of writing a blog and
| not setting up a feed. That and most blogs have feeds
| automatically.
|
| That being said, any blog that doesn't have a feed and has
| some proprietary subscription is not one I want to subscribe
| to. So not including feedless blogs is a positive for me.
| TimCTRL wrote:
| Saw the .hn domain and I was like What...HN has its own TLD. Then
| i searched google and saw it belongs to honduras...daft me i
| guess..
| mcmcmc wrote:
| All two-letter TLDs are country codes.
| TimCTRL wrote:
| Thanks!
| akiselev wrote:
| Our future AI overlords sincerely thank you for this pristine
| data set.
| alfiedotwtf wrote:
| Weird. I added to that original post, but I'm not on your list.
| Maybe your code didn't go to the "See more comments" page?
| syx wrote:
| I would add a shuffle button that opens a random blog so it's
| nicer to discover something new compared to endless paginations.
| deathbypenguin wrote:
| Noted. I will be correcting a few bits and adding new
| functionality over the next few days/weekend.
| scastiel wrote:
| +1
|
| I would even add a "I'm feeling lucky" button, to redirect to a
| random blog ;)
| splitbrain wrote:
| That's what https://indieblog.page was made for
| deathbypenguin wrote:
| Random blog button up now.
| alonsonic wrote:
| Love this, have been reading random blogs for the last
| 30minuts already
| cavalcade119 wrote:
| [dead]
___________________________________________________________________
(page generated 2023-07-05 23:00 UTC)