[HN Gopher] DeepDive in everything of Llama3: revealing detailed...
       ___________________________________________________________________
        
       DeepDive in everything of Llama3: revealing detailed insights and
       implementation
        
       Author : therealoliver
       Score  : 88 points
       Date   : 2025-02-21 16:57 UTC (6 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | kevmo314 wrote:
       | I like the use of the functional API here. I learned through a
       | similar route and it was very helpful for me compared to trying
       | to understand `torch.nn.Module`.
       | 
       | Here's a gist of my learning path if it's helpful to anyone:
       | https://gist.github.com/kevmo314/294001659324429bae6749062a9...
        
       | simonw wrote:
       | I hadn't realized OpenAI's tiktoken Python library could work
       | with other models outside of the OpenAI family, that's really
       | useful: https://github.com/therealoliver/Deepdive-llama3-from-
       | scratc...
        
       | aghilmort wrote:
       | great need; mulling over; shows up all the time in AI paradigms
        
       | FreebasingLLMs wrote:
       | Nice info, however the incessant anime shit is retarded.
        
         | jawr wrote:
         | If you've got nothing constructive to say... don't say
         | anything? OP brings a lot of value in a style they like, your
         | comment brings absolutely nothing.
        
       ___________________________________________________________________
       (page generated 2025-02-21 23:00 UTC)