hngopher.com

       [HN Gopher] Hands-On Large Language Models
       ___________________________________________________________________
        
       Hands-On Large Language Models
        
       Author : teleforce
       Score  : 134 points
       Date   : 2025-04-19 01:52 UTC (21 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | relyks wrote:
       | If someone's familiar with this, what would you say are the
       | prerequisites?
        
         | saqrais wrote:
         | This is taken from the book as it is:
         | 
         | Prerequisites This book assumes that you have some experience
         | programming in Python and are familiar with the fundamentals of
         | machine learning. The focus will be on building a strong
         | intuition rather than deriving mathematical equations. As such,
         | illustrations combined with hands-on examples will drive the
         | examples and learning through this book. This book assumes no
         | prior knowledge of popular deep learning frameworks such as
         | PyTorch or TensorFlow nor any prior knowledge of generative
         | modeling. If you are not familiar with Python, a great place to
         | start is Learn Python, where you will find many tutorials on
         | the basics of the language. To further ease the learning
         | process, we made all the code available on Google Colab, a
         | platform where you can run all of the code without the need to
         | install anything locally.
        
       | samchon wrote:
       | I came in thinking it was a free ebook lol
        
         | triyambakam wrote:
         | Well it can be... <<Arkhiv Anny>>
        
       | d_tr wrote:
       | I guess it wouldn't sell shit if it used a language suitable for
       | this type of work.
        
         | d_tr wrote:
         | I mean, what happened to "use the right tool for the job"?
         | There is Rust, C++, Julia, D, and certainly many more. Are they
         | hard or what? Are they harder than mastering the math and
         | algorithms that are relevant to an LLM? The code is actually
         | pretty simple, certainly simpler than many "boring" apps.
        
           | antononcube wrote:
           | I assume you mean book's code shown in the Jypyter notebooks
           | in the repository. (Which I think is both simple and messy.)
        
           | simonw wrote:
           | Arguing that Rust, C++, Julia or D are a better "right tool
           | for the job" than Python for a book that teaches people about
           | LLMs is a bit of an eyebrow-raiser.
        
             | d_tr wrote:
             | How so? Since when is Python a good language for numerical
             | computation? What if the reader wants to try something that
             | cannot be achieved by plumbing canned C++? They are out of
             | luck I guess.
             | 
             | Good job teaching the sloppy semantics of a scripting
             | language for numerics I guess.
        
               | simonw wrote:
               | "Since when is Python a good language for numerical
               | computation?"
               | 
               | 30 years. Numeric came out in 1995, then evolved into
               | NumPy in 2005. https://en.m.wikipedia.org/wiki/NumPy
               | 
               | Almost every AI researcher and AI lab does most of their
               | research work in Python.
        
               | d_tr wrote:
               | I know all of these facts. Doesn't mean it is how it is
               | for the right reasons, and even if it is, it does not
               | imply that it is a good way to teach.
        
               | simonw wrote:
               | Taking constant side-quests into Rust memory management
               | during a class on LLMs doesn't sound like a productive
               | way to teach to me.
        
               | sokoloff wrote:
               | It is possible that the vast majority of AI researchers
               | are flat-out incorrect and need to be shown a better
               | direction by you.
               | 
               | It is also possible that your own fitness-for-purpose
               | coefficients are tuned differently than the majority of
               | the field and they've made a sensible choice for their
               | situation.
               | 
               | I'd wager on the latter.
        
           | belter wrote:
           | Your comment shows such a fundamental misunderstanding of how
           | modern AI/LLM works that is hard to be kind and
           | thoughtful....
           | 
           | Python is simply the orchestration layer. The heavy lifting
           | is done by low-level libraries used in the repo, written in
           | C++, CUDA, and Rust (e.g., PyTorch's C++ backend, Flash
           | Attention's CUDA kernels, FAISS's SIMD optimizations, or
           | Hugging Face's Rust-powered tokenizers).
           | 
           | Python's role is to glue these high-performance components
           | together with minimal overhead, while providing
           | accessibility. Claiming it's "unsuitable" ignores the entire
           | stack beneath the syntax.
           | 
           | A critique that is like blaming the steering wheel for a
           | car's engine performance.
        
             | d_tr wrote:
             | Again, I am extremely well aware of all of this. You
             | assumed too much.
        
       | qwertox wrote:
       | Official code repo for the O'Reilly Book - "Hands-On Large
       | Language Models"
       | 
       | No text of the book in there.
        
       ___________________________________________________________________
       (page generated 2025-04-19 23:02 UTC)