[HN Gopher] Hacker News Data Map [180MB]
       ___________________________________________________________________
        
       Hacker News Data Map [180MB]
        
       Author : mooreds
       Score  : 55 points
       Date   : 2024-11-03 20:45 UTC (2 hours ago)
        
 (HTM) web link (lmcinnes.github.io)
 (TXT) w3m dump (lmcinnes.github.io)
        
       | avandekleut wrote:
       | Thats a neat visualization. It took about 35 seconds to load for
       | me, and the actual loading progress appeared to get stuck at 15%
       | for most of the time, which tempted me to close before it was
       | ready.
        
       | lucb1e wrote:
       | Maybe add [180MB] to the title, similar to how videos or pdfs are
       | tagged? It starts loading that immediately when you open the
       | page, which would be 18% of my data bundle if I had been on
       | mobile
       | 
       | (This is actually transferred bytes btw, based on seeing ~12MiB/s
       | for ~15 seconds in the system monitor)
       | 
       | Edit: some people are saying they can't view it, especially on
       | mobile browsers. Here's some screenshots:
       | 
       | - Landing overview https://snipboard.io/YTQRZc.jpg
       | 
       | - Zooming into the center, hovering over an item that is too
       | small to see but the title shows in a tooltip:
       | https://snipboard.io/xOvA47.jpg
       | 
       | - Zoomed in further still, now an individual item can be targeted
       | easily and there are lines delimiting topics (looking like height
       | lines on a map): https://snipboard.io/P6UVAv.jpg
       | 
       | - Hovering over the year selector on the bottom left, same zoom
       | position for comparison: https://snipboard.io/VDW2JI.jpg
       | 
       | Clicking the year seems not to do anything, you can't lock into
       | that view. Clicking a title opens the page, not the discussion
       | thread.
       | 
       | ---
       | 
       | Looking into the corresponding GitHub repository (I wonder if
       | they have a bandwidth limit for repositories or if it will foot
       | any bill), <https://github.com/lmcinnes/datamapplot_examples>,
       | there's also a visualization for Wikipedia which is a bit less
       | heavy:
       | https://lmcinnes.github.io/datamapplot_examples/Wikipedia_da...
       | (screenshot <https://snipboard.io/M9GRQt.jpg>)
        
         | walterbell wrote:
         | 180MB download per HN visitor isn't going to be fun for the
         | server either.
         | 
         | More civilized would be a photo snapshot + optional link to
         | 180MB download for interactive UX.
        
           | lucb1e wrote:
           | A photo could work as a quick preview indeed. As another idea
           | for large content spiking in popularity, another solution may
           | be something like webtorrent, or whatever peertube uses
           | 
           | Or a vector map, loading data as needed for the region you're
           | zooming into
        
           | odo1242 wrote:
           | 180MB is probably fine for most servers (especially CDNs), to
           | be honest. My M1 MacBook with 16 gigabytes of RAM is
           | struggling to load/display the data though.
        
         | tomthe wrote:
         | I made a similar map but with tiles that only load of you zoom
         | in far enough: tomthe.github.io/hackmap/ (Sorry for posting my
         | link so often) That way it has to load only a few megabyte for
         | the first view.
        
         | mooreds wrote:
         | > Maybe add [180MB] to the title, similar to how videos or pdfs
         | are tagged
         | 
         | Done.
        
       | rolfan wrote:
       | This website crashed my smartphone xD.
       | 
       | After loading some sections of the map, my screen turned into
       | digital garbage.
        
       | xyst wrote:
       | broken on mobile with safari, I'll check it later today
        
       | deskr wrote:
       | Doesn't work on Firefox - white screen instead of a map.
        
       | anonu wrote:
       | I like how Web Development and User Experience grouping is way
       | outside the central bubble.
       | 
       | Nonetheless, great visualization of a lot of data. I need to
       | learn more about this:
       | 
       | UMAP: https://umap-learn.readthedocs.io/en/latest/
       | 
       | Nomic-Embed: https://www.nomic.ai/blog/posts/nomic-embed-text-v1
       | 
       | The visual groupings aren't perfect. For example, there are a
       | quite a few COVID-19 tagged articles before 2020.
        
         | Cupprum wrote:
         | Is that necessarily a bad thing? Cant some posts be relevant
         | even if they were created before covid?
        
       | andrewmcwatters wrote:
       | Maybe browsers should have resource limits and ask the user if
       | they want to continue loading the page beyond some sort of
       | threshold...
        
       | codingdave wrote:
       | It is a cool visualization, so I don't want to diminish the
       | effort to make it in any way. And as an experiment in
       | visualization, it is interesting. (If a bit large and laggy.) But
       | if the authors expect people to use it to navigate content, it
       | has a few problems:
       | 
       | 1) The topics don't seem to be hierarchical, so as I drill down
       | on one area, I get all kinds of things that don't seem related. I
       | have no idea what I'm missing unless I zoom into the whole thing.
       | 
       | 2) I don't know where my browser is going when I click a link.
       | That is a security problem.
       | 
       | 3) I cannot tell how this data is sourced. Are these all the
       | links posted to HN? Just the ones that got upvotes? Something
       | else? Because while we have some great links here, we also get a
       | lot of stinkers.
       | 
       | 4) Much of the value of HN is the discussions. I didn't see a way
       | to navigate to discussions related to any of the links.
        
       ___________________________________________________________________
       (page generated 2024-11-03 23:00 UTC)