[HN Gopher] New Gemini model significantly outperforms others on...
___________________________________________________________________
New Gemini model significantly outperforms others on Chatbot Arena
(LMSYS)
Author : zopper
Score : 33 points
Date : 2024-12-06 17:18 UTC (5 hours ago)
(HTM) web link (lmarena.ai)
(TXT) w3m dump (lmarena.ai)
| impulser_ wrote:
| Based on my testing, this model is significantly better than
| other Gemini models especially with programming/math related
| tasks. The current Gemini models are pretty useless for anything
| related to programming/math, but this experiment model puts
| Gemini ahead of GPT4o, and pretty close to Claude 3.5.
|
| The major problem with Claude 3.5 is you can't have conversation
| with a large amount of text because you will constantly hit rate
| limits and it's very annoying.
|
| This model with a 2 million context window is probably the best
| model right now for programming.
| leobg wrote:
| Google has one moat that is often being overlooked: Googlebot.
| They get to scrape content that is invisible to pretty much every
| other crawler, thanks to Cloudflare and paywalls.
| chenxi9649 wrote:
| I feel like it's at the point where I'm not too sure how these
| rankings impact the my choice of LLM. Every time a new model tops
| the charts, I'll try them for a bit and go back to
| claude-3.5-sonnet. Both for coding and day to day questions.
|
| I don't know if I'm just getting used to the claude style of
| response, or the orangy UI that I kind of find cozy, but I think
| we need better ways to convey the difference between models.
| ralfd wrote:
| What is the new Gemini model? 1.5-pro-002?
| og_kalu wrote:
| Gemini Experimental 1206. It's on aistudio
| alphabetting wrote:
| Here is link to this latest one:
| https://aistudio.google.com/app/prompts/new_chat?model=gemin...
|
| 1.5 Pro-002 came out a couple months ago.
___________________________________________________________________
(page generated 2024-12-06 23:01 UTC)