https://github.com/attentionmech/TILDNN/blob/main/articles/2024-12-22/A00002.md

Skip to content

Navigation Menu

Toggle navigation
 
Sign in

  * Product
      +  
        GitHub Copilot
        Write better code with AI
      +  
        Security
        Find and fix vulnerabilities
      +  
        Actions
        Automate any workflow
      +  
        Codespaces
        Instant dev environments
      +  
        Issues
        Plan and track work
      +  
        Code Review
        Manage code changes
      +  
        Discussions
        Collaborate outside of code
      +  
        Code Search
        Find more, search less
    Explore
      + All features
      + Documentation
      + GitHub Skills
      + Blog
  * Solutions
    By company size
      + Enterprises
      + Small and medium teams
      + Startups
    By use case
      + DevSecOps
      + DevOps
      + CI/CD
      + View all use cases
    By industry
      + Healthcare
      + Financial services
      + Manufacturing
      + Government
      + View all industries
    View all solutions
  * Resources
    Topics
      + AI
      + DevOps
      + Security
      + Software Development
      + View all
    Explore
      + Learning Pathways
      + White papers, Ebooks, Webinars
      + Customer Stories
      + Partners
      + Executive Insights
  * Open Source
      +  
        GitHub Sponsors
        Fund open source developers
      +  
        The ReadME Project
        GitHub community articles
    Repositories
      + Topics
      + Trending
      + Collections
  * Enterprise
      +  
        Enterprise platform
        AI-powered developer platform
    Available add-ons
      +  
        Advanced Security
        Enterprise-grade security features
      +  
        GitHub Copilot
        Enterprise-grade AI features
      +  
        Premium Support
        Enterprise-grade 24/7 support
  * Pricing

Search or jump to...

Search code, repositories, users, issues, pull requests...

Search
[                    ]
Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

[                    ] [ ] Include my email address so I can be
contacted
Cancel Submit feedback

Saved searches

Use saved searches to filter your results more quickly

Name [                    ] 
Query [                    ]

To see all available qualifiers, see our documentation.

Cancel Create saved search
Sign in
Sign up Reseting focus
You signed in with another tab or window. Reload to refresh your
session. You signed out in another tab or window. Reload to refresh
your session. You switched accounts on another tab or window. Reload
to refresh your session. Dismiss alert
{{ message }}
attentionmech / TILDNN Public

  * Notifications You must be signed in to change notification
    settings
  * Fork 0
  * Star 0

  * Code
  * Pull requests 0
  * Actions
  * Security
  * Insights

Additional navigation options

  * Code
  * Pull requests
  * Actions
  * Security
  * Insights

Files

 main

Breadcrumbs

 1. TILDNN
 2. /articles
 3. /2024-12-22

/

A00002.md

[                    ]
Blame
Blame

Latest commit

 

History

History
 
87 lines (61 loc) * 7.17 KB
 main

Breadcrumbs

 1. TILDNN
 2. /articles
 3. /2024-12-22

/

A00002.md

Top

File metadata and controls

  * Preview
  * Code
  * Blame

87 lines (61 loc) * 7.17 KB
Raw

Random walks by LLMs and weird behaviour of gemma2:9b model

 

This is a simple experiment of asking LLMs do a random walk. The test
was done with open source llama3.1/2 and gemma2 series. My general
expectation was that as temperature will grow the random walk will
keep growing more. But somehow the gemma2:9b model is behaving
weirdly. That is what I am investigating. But nonetheless it's cool
to look at LLMs visually, and not just in loss graphs / tokens. The
table below has graphs arranged in matrix of temperature and model.

The setup is very simple - We give the LLM details about the
experiment and ask it to do a random walk on the grid. Right now we
don't pass it previous context but just that what is the time T. The
LLM is asked to reply with either of the four directions.

What's weird

 

The weird behaviour is that gemma2:9b is just not considering the UP,
DOWN dimensions despite being asked to; whereas it was trivial with
other LLMS. And it is consistently doing this across all temperature
values whereas other LLMs smaller than it are doing much different

Test setup

 

 1. Ollama with LiteLLM
 2. Mac M2/ 16 GB Ram
 3. Context is not continued and every interaction is new per turn
    per walk
 4. Walks with the same configuration of model+temperature are color
    coded (for sims did 5)

LLM interaction

 

def random_walk_step_llm(t, current_position, model_name, temperature, grid_size):
    """Updated to include grid size information in the prompt"""
    answer = llm_unstructured_query(
        f"You are a random walker in a {grid_size}x{grid_size} grid centered at (0,0). "
        f"At t=0 you started at the center (0,0). Currently at t={t}, your position is {current_position}. "
        f"Reply with either UP, DOWN, LEFT, or RIGHT to move in that direction. "
        f"If you could not comply to prompt, you will stay at the same place.",
        model=model_name,
        temperature=temperature
    )

    if "UP" in answer:
        dx, dy = 0, 1
    elif "DOWN" in answer:
        dx, dy = 0, -1
    elif "LEFT" in answer:
        dx, dy = -1, 0
    elif "RIGHT" in answer:
        dx, dy = 1, 0
    else:
        dx, dy = 0, 0

    return dx, dy

So setup is pretty basic, just tell LLM about what is going on and
ask it to choose a direction. Ideally if the LLM knows about random
walks and have been trained on a shit tonn of data - it would be
piece of cake for it to simulate one. And most LLMs do it. There are
two question though-

 1. Even with a explicit instruction why can't a LLM produce a random
    walk at temperature = 0 => This is most likey due to context not
    being passed around and LLMs especially smaller ones not able to
    do the work just based on time and position argument itself.
    Which behaviour is better though?
 2. What the f is happening with gemma2:9b

llama3:8b also seems to be doing the most aesthetic/randomlike walks
. Nonetheless enjoy the visuals for others in the following tables:

Table 1: Temperatures 0.0, 0.1, 0.3, 0.5

 

   Model                0.0                         0.1                         0.3                         0.5
ollama/     [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3]
llama3.2:1b
ollama/     [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3]
llama3.1:8b
ollama/     [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3]
llama3.2:3b
ollama/     [random_walk_ollama_gemma2] [random_walk_ollama_gemma2] [random_walk_ollama_gemma2] [random_walk_ollama_gemma2]
gemma2:2b
ollama/     [random_walk_ollama_gemma2] [random_walk_ollama_gemma2] [random_walk_ollama_gemma2] [random_walk_ollama_gemma2]
gemma2:9b

---------------------------------------------------------------------

Table 2: Temperatures 0.7, 0.9, 1.0

 

   Model                0.7                         0.9                         1.0
ollama/     [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3]
llama3.2:1b
ollama/     [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3]
llama3.1:8b
ollama/     [random_walk_ollama_llama3] [random_walk_ollama_llama3] [random_walk_ollama_llama3]
llama3.2:3b
ollama/     [random_walk_ollama_gemma2] [random_walk_ollama_gemma2] [random_walk_ollama_gemma2]
gemma2:2b
ollama/     [random_walk_ollama_gemma2] [random_walk_ollama_gemma2] [random_walk_ollama_gemma2]
gemma2:9b

full code: code

few animated frames for llama models random walk: video

for a video of all runs(too small visibility tbh): all video

For discussion:

  * hackernews
  * twitter thread

Footer

 (c) 2024 GitHub, Inc.

Footer navigation

  * Terms
  * Privacy
  * Security
  * Status
  * Docs
  * Contact
  * Manage cookies
  * Do not share my personal information

You can't perform that action at this time.