[HN Gopher] General Reasoning: Free, open resource for building ...
       ___________________________________________________________________
        
       General Reasoning: Free, open resource for building large reasoning
       models
        
       Author : rglover
       Score  : 38 points
       Date   : 2025-02-21 19:01 UTC (3 hours ago)
        
 (HTM) web link (gr.inc)
 (TXT) w3m dump (gr.inc)
        
       | carimura wrote:
       | nginx not happy.
        
         | rosstaylor90 wrote:
         | Happier now, upgraded the backend :) (co-creator here)
        
       | emorning3 wrote:
       | LLMs cannot reason, they can only say things that sound
       | reasonable, there's a difference. Duh.
        
         | rosstaylor90 wrote:
         | What's your AIME 2025 score? https://gr.inc/RJT1990/AIME2025/
        
           | nyrikki wrote:
           | The is the point of the AIME, it is a 3 hour closed book
           | examination in which each answer is an integer number from 0
           | to 999 and should only depend on pre-calc...for a human with
           | no calculator, notes, or internet access.
           | 
           | The concepts are _heavily_ covered in the training corpus,
           | and if people were allowed to take it more than once, with
           | even a book let alone access to the internet it wouldn 't be
           | very hard.
           | 
           | Examples:
           | 
           | 1) Find the sum of all integer bases $b>9$ for which $17_b$
           | is a divisor of $97_b.$
           | 
           | In the corpus: https://www.quora.com/In-what-bases-b-
           | does-b-7-divide-into-9...
           | 
           | And one more:
           | 
           | 3) https://artofproblemsolving.com/wiki/index.php/2025_AIME_I
           | _P...
           | 
           | Is just the the number of ways to distribute k
           | indistinguishable balls (players) into n distinguishable
           | boxes (flavors, without exclusion, in such a way that no box
           | is empty.
           | 
           | Thus in the corpus for any courses that need to cover
           | combinatorial problems including physics, discreet math,
           | logistics etc...
           | 
           | IMHO these concept classes from a typical AIME are so common,
           | the scores you gave demonstrate that those models are doing
           | no "general reasoning" at all and are actually failing at
           | approximate retrieval.
        
         | nh23423fefe wrote:
         | jokes on you, they can't even speak. so obviously your sentence
         | is meaningless. arguing about definitions is very fruitful!
        
         | perching_aix wrote:
         | emorning3 cannot reason, he can only say things that sound
         | reasonable, there's a difference. Duh.
         | 
         | good luck. as a reminder, there are people who, with varying
         | degrees of certainty, think their loved ones have been replaced
         | by actors, as well as people who think they're actually the god
         | of the world around them, for it is just their imagination.
        
       ___________________________________________________________________
       (page generated 2025-02-21 23:00 UTC)