[HN Gopher] New Gemini model significantly outperforms others on...
       ___________________________________________________________________
        
       New Gemini model significantly outperforms others on Chatbot Arena
       (LMSYS)
        
       Author : zopper
       Score  : 33 points
       Date   : 2024-12-06 17:18 UTC (5 hours ago)
        
 (HTM) web link (lmarena.ai)
 (TXT) w3m dump (lmarena.ai)
        
       | impulser_ wrote:
       | Based on my testing, this model is significantly better than
       | other Gemini models especially with programming/math related
       | tasks. The current Gemini models are pretty useless for anything
       | related to programming/math, but this experiment model puts
       | Gemini ahead of GPT4o, and pretty close to Claude 3.5.
       | 
       | The major problem with Claude 3.5 is you can't have conversation
       | with a large amount of text because you will constantly hit rate
       | limits and it's very annoying.
       | 
       | This model with a 2 million context window is probably the best
       | model right now for programming.
        
       | leobg wrote:
       | Google has one moat that is often being overlooked: Googlebot.
       | They get to scrape content that is invisible to pretty much every
       | other crawler, thanks to Cloudflare and paywalls.
        
       | chenxi9649 wrote:
       | I feel like it's at the point where I'm not too sure how these
       | rankings impact the my choice of LLM. Every time a new model tops
       | the charts, I'll try them for a bit and go back to
       | claude-3.5-sonnet. Both for coding and day to day questions.
       | 
       | I don't know if I'm just getting used to the claude style of
       | response, or the orangy UI that I kind of find cozy, but I think
       | we need better ways to convey the difference between models.
        
       | ralfd wrote:
       | What is the new Gemini model? 1.5-pro-002?
        
         | og_kalu wrote:
         | Gemini Experimental 1206. It's on aistudio
        
         | alphabetting wrote:
         | Here is link to this latest one:
         | https://aistudio.google.com/app/prompts/new_chat?model=gemin...
         | 
         | 1.5 Pro-002 came out a couple months ago.
        
       ___________________________________________________________________
       (page generated 2024-12-06 23:01 UTC)