https://github.com/bklieger-groq/g1

Skip to content

Navigation Menu

Toggle navigation
 
Sign in

  * Product
      +  
        Actions
        Automate any workflow
      +  
        Packages
        Host and manage packages
      +  
        Security
        Find and fix vulnerabilities
      +  
        Codespaces
        Instant dev environments
      +  
        GitHub Copilot
        Write better code with AI
      +  
        Code review
        Manage code changes
      +  
        Issues
        Plan and track work
      +  
        Discussions
        Collaborate outside of code
    Explore
      + All features
      + Documentation
      + GitHub Skills
      + Blog
  * Solutions
    By size
      + Enterprise
      + Teams
      + Startups
    By industry
      + Healthcare
      + Financial services
      + Manufacturing
    By use case
      + CI/CD & Automation
      + DevOps
      + DevSecOps
  * Resources
    Topics
      + AI
      + DevOps
      + Security
      + Software Development
      + View all
    Explore
      + Learning Pathways
      + White papers, Ebooks, Webinars
      + Customer Stories
      + Partners
  * Open Source
      +  
        GitHub Sponsors
        Fund open source developers
      +  
        The ReadME Project
        GitHub community articles
    Repositories
      + Topics
      + Trending
      + Collections
  * Enterprise
      +  
        Enterprise platform
        AI-powered developer platform
    Available add-ons
      +  
        Advanced Security
        Enterprise-grade security features
      +  
        GitHub Copilot
        Enterprise-grade AI features
      +  
        Premium Support
        Enterprise-grade 24/7 support
  * Pricing

Search or jump to...

Search code, repositories, users, issues, pull requests...

Search
[                    ]
Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

[                    ] [ ] Include my email address so I can be
contacted
Cancel Submit feedback

Saved searches

Use saved searches to filter your results more quickly

Name [                    ] 
Query [                    ]

To see all available qualifiers, see our documentation.

Cancel Create saved search
Sign in
Sign up Reseting focus
You signed in with another tab or window. Reload to refresh your
session. You signed out in another tab or window. Reload to refresh
your session. You switched accounts on another tab or window. Reload
to refresh your session. Dismiss alert
{{ message }}
bklieger-groq / g1 Public

  * Notifications You must be signed in to change notification
    settings
  * Fork 21
  * Star 235

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

License

MIT license
235 stars 21 forks Branches Tags Activity
Star
Notifications You must be signed in to change notification settings

  * Code
  * Issues 1
  * Pull requests 0
  * Actions
  * Projects 0
  * Security
  * Insights

Additional navigation options

  * Code
  * Issues
  * Pull requests
  * Actions
  * Projects
  * Security
  * Insights

bklieger-groq/g1

This commit does not belong to any branch on this repository, and may
belong to a fork outside of the repository.
 main
BranchesTags
  
Go to file
Code

Folders and files

      Name              Name          Last commit       Last commit
                                        message            date
Latest commit

 

History

8 Commits
 
examples          examples                             
.gitignore        .gitignore                           
LICENSE           LICENSE                              
README.md         README.md                            
app.py            app.py                               
example.env       example.env                          
requirements.txt  requirements.txt                     
View all files

Repository files navigation

  * README
  * MIT license

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

 
g1_demo.1.mp4

This is an early prototype of using prompting strategies to improve
the LLM's reasoning capabilities through o1-like reasoning chains.
This allows the LLM to "think" and solve logical problems that
usually otherwise stump leading models. Unlike o1, all the reasoning
tokens are shown, and the app uses an open source model.

g1 is experimental and being open sourced to help inspire the open
source community to develop new strategies to produce o1-like
reasoning. This is an experiment to show the power of prompting
reasoning in visualized steps, not a comparison to or full
replication of o1, which uses different techniques. Let's build!

Examples

 

Important

g1 is not perfect, but it can perform significantly better than LLMs
out-of-the-box. From initial testing, g1 accurately solves simple
logic problems 60-80% of the time that usually stump LLMs. However,
accuracy has yet to be formally evaluated. See examples below.

How many Rs are in strawberry?

 

Prompt: How many Rs are in strawberry?

Result:

Strawberry example

---------------------------------------------------------------------

Prompt: Which is larger, .9 or .11?

Result:

0.9 or 0.11 example

Quickstart

 

python3 -m venv venv

source venv/bin/activate

pip3 install -r requirements.txt

export GROQ_API_KEY=gsk...

streamlit run app.py

Prompting Strategy

 

The prompt is as follows:

You are an expert AI assistant that explains your reasoning step by step. For each step, provide a title that describes what you're doing in that step, along with the content. Decide if you need another step or if you're ready to give the final answer. Respond in JSON format with 'title', 'content', and 'next_action' (either 'continue' or 'final_answer') keys. USE AS MANY REASONING STEPS AS POSSIBLE. AT LEAST 3. BE AWARE OF YOUR LIMITATIONS AS AN LLM AND WHAT YOU CAN AND CANNOT DO. IN YOUR REASONING, INCLUDE EXPLORATION OF ALTERNATIVE ANSWERS. CONSIDER YOU MAY BE WRONG, AND IF YOU ARE WRONG IN YOUR REASONING, WHERE IT WOULD BE. FULLY TEST ALL OTHER POSSIBILITIES. YOU CAN BE WRONG. WHEN YOU SAY YOU ARE RE-EXAMINING, ACTUALLY RE-EXAMINE, AND USE ANOTHER APPROACH TO DO SO. DO NOT JUST SAY YOU ARE RE-EXAMINING. USE AT LEAST 3 METHODS TO DERIVE THE ANSWER. USE BEST PRACTICES.

Example of a valid JSON response:
json
{
    "title": "Identifying Key Information",
    "content": "To begin solving this problem, we need to carefully examine the given information and identify the crucial elements that will guide our solution process. This involves...",
    "next_action": "continue"
}

Breakdown

 

First, a persona is added:

    You are an expert AI assistant that explains your reasoning step
    by step.

Then, instructions to describe the expected step-by-step reasoning
process while titling each reasoning step. This includes the ability
for the LLM to decide if another reasoning step is needed or if the
final answer can be provided.

    For each step, provide a title that describes what you're doing
    in that step, along with the content. Decide if you need another
    step or if you're ready to give the final answer.

JSON formatting is introduced with an example provided later.

    Respond in JSON format with 'title', 'content', and 'next_action'
    (either 'continue' or 'final_answer') keys.

In all-caps to improve prompt compliance by emphesizing the
importance of the instruction, a set of tips and best practices are
included.

 1. Use as many reasoning steps as possible. At least 3. -> This
    ensures the LLM actually takes the time to think first, and
    results usually in about 5-10 steps.
 2. Be aware of your limitations as an llm and what you can and
    cannot do. -> This helps the LLM remember to use techniques which
    produce better results, like breaking "strawberry" down into
    individual letters before counting.
 3. Include exploration of alternative answers. Consider you may be
    wrong, and if you are wrong in your reasoning, where it would be.
    -> A large part of the gains seem to come from the LLM
    re-evaluating its initial response to ensure it logically aligns
    with the problem.
 4. When you say you are re-examining, actually re-examine, and use
    another approach to do so. Do not just say you are re-examining.
    -> This encourages the prevention of the LLM just saying it
    re-examined a problem without actually trying a new approach.
 5. Use at least 3 methods to derive the answer. -> This helps the
    LLM come to the right answer by trying multiple methods to derive
    it.
 6. Use best practices. -> This is as simple as the "Do better"
    prompts which improve LLM code output. By telling the LLM to use
    best practices, or do better, it generally performs better!

    USE AS MANY REASONING STEPS AS POSSIBLE. AT LEAST 3. BE AWARE OF
    YOUR LIMITATIONS AS AN LLM AND WHAT YOU CAN AND CANNOT DO. IN
    YOUR REASONING, INCLUDE EXPLORATION OF ALTERNATIVE ANSWERS.
    CONSIDER YOU MAY BE WRONG, AND IF YOU ARE WRONG IN YOUR
    REASONING, WHERE IT WOULD BE. FULLY TEST ALL OTHER POSSIBILITIES.
    YOU CAN BE WRONG. WHEN YOU SAY YOU ARE RE-EXAMINING, ACTUALLY
    RE-EXAMINE, AND USE ANOTHER APPROACH TO DO SO. DO NOT JUST SAY
    YOU ARE RE-EXAMINING. USE AT LEAST 3 METHODS TO DERIVE THE
    ANSWER. USE BEST PRACTICES.

Credits

 

This app was developed by Benjamin Klieger.

About

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Resources

Readme

License

MIT license
Activity

Stars

235 stars

Watchers

3 watching

Forks

21 forks
Report repository

Releases

No releases published

Packages 0

No packages published

Languages

  * Python 100.0%

Footer

 (c) 2024 GitHub, Inc.

Footer navigation

  * Terms
  * Privacy
  * Security
  * Status
  * Docs
  * Contact
  * Manage cookies
  * Do not share my personal information

You can't perform that action at this time.