[HN Gopher] GPT-5.1 for Developers
___________________________________________________________________
GPT-5.1 for Developers
Author : tedsanders
Score : 45 points
Date : 2025-11-13 18:46 UTC (4 hours ago)
(HTM) web link (openai.com)
(TXT) w3m dump (openai.com)
| felixbraun wrote:
| Already live in Cursor btw
| kevinkatzke wrote:
| This got only a single comment and 34 points in 3 hours. Crazy
| how the dynamics have changed around model releases in just a
| single year.
| throwup238 wrote:
| There was already an announcement post for 5.1 yesterday:
| https://news.ycombinator.com/item?id=45904551
| dang wrote:
| Thanks! Macroexpanded:
|
| _GPT-5.1: A smarter, more conversational ChatGPT_ -
| https://news.ycombinator.com/item?id=45904551 - Nov 2025 (672
| comments)
| amelius wrote:
| More of the same, I suppose.
|
| You have to be called Apple to get raving reviews for that.
| observationist wrote:
| This is the first low-key, silent feature rollout, treated like
| "just another software update", with no hype or buzz
| beforehand. Prior to this point, every other feature release
| was pumped for weeks or even months with "leaks" from insiders
| and deliberately getting people amped. I don't know if OpenAI
| changed marketing tactics, or if they're in a new chapter in
| some book, but this is a radical shift from what they were
| doing before.
| __jl__ wrote:
| The prompt caching change is awesome for any agent. Claude is far
| behind with increased costs for caching and manual caching
| checkpoints. Certainly depends on your application but prompt
| caching is also ignored in a lot of cost comparisons.
| pants2 wrote:
| Though to be fair, thinking tokens are also ignored in a lot of
| cost comparisons and in my experience Claude generally uses
| fewer thinking tokens for the same intelligence
| miohtama wrote:
| > On coding, we've worked closely with startups like Cursor,
| Cognition, Augment Code, Factory, and Warp to improve GPT-5.1's
| coding personality, steerability, and code quality.
|
| Why no GitHub?
| dweekly wrote:
| A few hours of playing around and I'm suitably impressed.
|
| Claude 4.5 Sonnet definitely struggles with Swift 6.2 Concurrency
| semantics and has several times gotten itself stuck rather badly.
| Additionally Claude Code has developed a number of bugs,
| including rapidly re-scrolling the terminal buffer, pegging local
| CPU to 100%, and consuming vast amounts of RAM. Codex CLI was
| woefully behind a few months ago and, despite overly conservative
| out-of-the-box sandbox settings, has quite caught up to Claude
| Code. (Gemini CLI is an altogether embarrassing experience, but
| Google did just put a solid PM behind it and 3.0 Pro should be
| out this month if we're lucky.)
|
| Codex with 5.1 high managed to thoughtfully paw through the
| documentation and source code and - with a little help pulling
| down parts of the Swift Book - managed to correctly resolve the
| issue.
|
| I remember getting the thread manager right being one of the
| harder parts of my operating systems course doing an undergrad in
| computer science; testing threaded programs has always been a
| challenge. It's a strange circle-of-life moment to realize that
| what was hard for undergrads also serves as a benchmark for
| coding agents!
___________________________________________________________________
(page generated 2025-11-13 23:01 UTC)