[HN Gopher] DeepDive in everything of Llama3: revealing detailed...
___________________________________________________________________
DeepDive in everything of Llama3: revealing detailed insights and
implementation
Author : therealoliver
Score : 88 points
Date : 2025-02-21 16:57 UTC (6 hours ago)
(HTM) web link (github.com)
(TXT) w3m dump (github.com)
| kevmo314 wrote:
| I like the use of the functional API here. I learned through a
| similar route and it was very helpful for me compared to trying
| to understand `torch.nn.Module`.
|
| Here's a gist of my learning path if it's helpful to anyone:
| https://gist.github.com/kevmo314/294001659324429bae6749062a9...
| simonw wrote:
| I hadn't realized OpenAI's tiktoken Python library could work
| with other models outside of the OpenAI family, that's really
| useful: https://github.com/therealoliver/Deepdive-llama3-from-
| scratc...
| aghilmort wrote:
| great need; mulling over; shows up all the time in AI paradigms
| FreebasingLLMs wrote:
| Nice info, however the incessant anime shit is retarded.
| jawr wrote:
| If you've got nothing constructive to say... don't say
| anything? OP brings a lot of value in a style they like, your
| comment brings absolutely nothing.
___________________________________________________________________
(page generated 2025-02-21 23:00 UTC)