model roundup

GPT 5.5

195 items · started 2026-04-22 · ongoing (last activity 2026-06-25)

GPT 5.5 is clearly way ahead in the global Al race. It used to be me telling Al what to do, but that changed fast (www.reddit.comhttps)

1d

GPT 5.5 is clearly way ahead in the global AI race. Today it helped me turn 1 rough proposal into a polished publisher brief and 10 publisher emails in minutes.
Fable 5 vanished in 96 hours and four days later an MIT model took its arena crown (www.reddit.com via reddit)

1d swe-bench glm gpt-5+3

I have been thinking about the Fable 5 to GLM-5.2 sequence as one event rather than two. June 9, Anthropic ships Fable 5, the Mythos line opens to the public for the first time, SWE-bench Verified at 95 percent, people calling it the best…
I'm building agent loops that auto-edit my videos, but the hard part has been finding a model to accurately grade the result (youtube.com via reddit)

2d gpt-5 gemini codex+3

Quick context: I've been building agentic loops that edit my short-form videos for me. The editing works really well, but I found myself needing to check the process at several gates.
Japan's 'Sakana Fugu' multiagent AI scores well against Fable 5, GPT 5.5 (asia.nikkei.com via hn)

+2 3d

TOKYO -- Japanese startup Sakana AI announced Monday the public release of the Sakana Fugu service that combines multiple artificial intelligence models into one collaborative workflow. Service taps several large language models to outperf…
GPT-5.5-Cyber Tops Mythos 5 on Cybersecurity Benchmark (twitter.com via hn)

+3 3d gpt-5 mythos

We want to help all companies be secure, working with the USG and the security ecosystem. *The full version of GPT-5.5-Cyber is here; state of the art performance on CyberGym.
The unreasonable effectiveness of LLMs for auditing Rust code (shnatsel.medium.com via hn)

+1 5d gpt-5 codex

7 min read 21 hours ago As a lead of the Rust Secure Code Working Group, I got free access to GPT-5.5 via the Codex for Open Source. Since then I’ve found and reported dozens of issues of varying severity in widely used Rust crates.
Two months into Claude Code, I hit 161M tokens in a single day. Here's the honest story of how a year-long Cursor user got here. (www.reddit.com via reddit)

5d gpt-5 codex cursor+1

I want to share a small milestone, and the honest road that led to it. Today was one of those days where I sat down to build and just did not stop.
GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2 (arrowtsx.dev via hn)

+3 6d glm gpt-5

Bigger models are not the way Jun 18, 2026 A shift is happening among major AI labs, who are becoming increasingly skeptical of endless parameter count and training data scaling. The limits of this paradigm were put on the world’s stage wh…
Artificial Analysis added a new tag for not currently available models for Fable (www.reddit.comhttps)

7d codex

I just noticed codex and GPT 5.5 are near Fable level in this benchmark tho from my experience GPT 5.5 is so good to follow instructions but not as creative as Fable. Fable was just so convenient to work with like it reads my mind and even…
An open-source AI just beat OpenAI's GPT-5.5 at coding (1/6th the price) (docs.z.ai via hn)

+1 8d glm gpt-5 openai

Overview GLM-5.2 is a flagship model built for the era of long-horizon tasks. With truly usable 1M-token context, it has been tested to handle project-scale engineering context, delivering more stable long-task execution, more reliable adh…
Optimizing a C collision detection 100x with an LLM (twitter.com via hn)

+1 9d gpt-5

Using an LLM to optimize code: I created a reference implementation of @kevintracy48's collision detection in C, then used gpt-5.5 to optimize it and managed a > 100x speedup from that baseline. Cost ~125M tokens Code and details: https…
I made Claude and GPT-5.5 answer the same prompt, then had a third Claude fuse the two, on the subscriptions I already pay for (no API key). Blind-tested it. Here is where it won and where it lost. (www.reddit.com via reddit)

9d gpt-5 codex chatgpt+1

Quick share of a weekend experiment that turned into a tool. The idea: instead of picking one model, run Claude and GPT-5.5 on the same prompt in parallel, then have a fresh Claude (blind to which answer is which) merge them into one.
Agent Architecture Is a Compute Allocation Problem: The Advisor Strategy (harrisonsec.com via hn)

+1 9d gpt-5 qwen anthropic

Agent Architecture Is a Compute Allocation Problem: The Advisor Strategy, Cost-Curve Frame Recursed Anthropic named the advisor strategy in April. Tobi Lutke made it viral in May with Qwen plus GPT-5.5.
Show HN: Pantheon – AI vs AI: one writes the code, the other attacks it (github.com via hn)

+1 10d

There's always a generous look at the code you've come up with. But the pantheon is different.
Do you know who has a universal jailbreak to their name, as of today? Officially? (www.reddit.com via reddit)

12d jailbreak gpt-5 security+2

AISI UK - Our evaluation of OpenAI's GPT-5.5 cyber capabilities In their own words: The above tests are capability evaluations carried out in a controlled research setting and do not necessarily reflect what is accessible to an ordinary pu…
Claude Fable 5 vs. GPT-5.5: Better Planning, Similar Execution (blog.kilo.ai via hn)

+136 12d gpt-5 anthropic

Claude Fable 5 vs GPT-5.5: better planning, similar execution Update: We wrote this post on June 11 and published it on June 13. Anthropic has since disabled access to Claude Fable 5 after a US government directive, which makes some of the…
/architect: Reduce Fable tokens by 80%, Fable orchestrates/reviews, Codex builds (github.com via hn)

+3 13d gpt-5 codex

architect-loop Claude Fable is the architect — it designs every slice, freezes the acceptance gates, and judges the results. GPT-5.5 Codex is the builder and researcher — it does all the engineering and all the web research, in parallel, u…
Ask HN: Favorite prompts for improving LLM output? (news.ycombinator.com)

+21 13d claude-code

I use Claude Code a lot and GPT 5.5 as well, and find that they are simultaneously extremely useful and also fall into common poor-performance basins. For example, writing performance -- perhaps my biggest issue with them is writing style…
What one person can ship in 4 days with two frontier models: a ranking engine, an in-game economy, an AI talk show, and a missions system — for a game that "died" years ago. (www.reddit.com via reddit)

2w gpt-5

I genuinely believe we're living the future, and this post is my evidence. Let me show you what I built, why, and who I am.
GPT Memory Audit - Copy/Paste (www.reddit.com via reddit)

2w gpt-5

Act as GPT-5.5 using extended thinking. Before answering, choose whether this needs Fast Strike, Full Panel, or Brutal Simplifier, then use the leanest mode that still protects quality.
I think Cursor's speed is hurting my SaaS (www.reddit.com via reddit)

2w sonnet cursor opus

Yeah, it's insanely fast. Probably the fastest thing I've used.
OpenAI Preps New AI Model, Expects To Go Public Within the Next Year (www.theinformation.com via reddit)

2w altman gpt-5 openai

Altman: Rapid technological advancements, specifically recursive self-improvement (RSI) where AI creates new AI, could cause OpenAI to delay its IPO. At the same time, OpenAI’s enormous compute needs may push it toward public markets soone…
I Tested Claude Fable and GPT-5.5 xHigh on a Real Packing Algorithm, Claude Won Efficiency, GPT Won Speed (www.reddit.com via reddit)

2w gpt-5 codex

I ran a head-to-head test between Claude Fable and GPT-5.5 xHigh on a real-world optimization problem I wrote myself. This isn't a coding challenge or LeetCode problem.
Show HN: I generated 235 system docs in a day using GPT-5.5 (www.paxerp.com via hn)

+4 2w gpt-5 codex
Fable surpasses GPT 5.5 completely (www.reddit.com via reddit)

2w openai

I've had several issues lately with GPT 5.5 xhigh where it would not be able to complete features, constantly making mistakes, one refactor work took two weeks. Well guess what Fable just completed it in 9 hours.
Has anyone had success doing anything cyber with Fable 5? (www.reddit.com via reddit)

2w

I always get dropped down to 4.8 even when I try doing something as simple as create detection based off of a threat. Has anyone had any success doing anything?
So finally it’s not AGI yet. Anyone tested it? How does it really stack against GPT 5.5 in real world coding? (www.reddit.comhttps)

2w

could not extract summary
Garbage Guard Rails on Fable 5 (www.reddit.com via reddit)

2w sonnet opus anthropic

despite Dario's constant virtue signaling about how Anthropic alone is going to solve health problems (if only those dastardly Chinese don't get in the way), all my initial prompts to fable 5 get bumped to opus. i'm not asking how to aeros…
How I started getting much better results from Cursor Composer (www.reddit.com via reddit)

2w gpt-5 cursor

I think Composer can be extremely powerful, but only if you use it in a way that forces it to plan and think properly before touching the code. One of the biggest improvements for me was creating my own custom prompting skill with GPT-5.5.
Composer 2.5 might be better than I thought (www.reddit.com via reddit)

2w gpt-5

So I've been using composer-2.5 heavily for 2 weeks now and it does make stupid mistakes sometimes and I have to guide it quite a bit, and I use the /thermo-nuclear-code-quality-review skill a lot after doing work to help with quality. But…
I spent 3 years building a pocket-sized Baldur's Gate 3. Now I'm testing it with GPT-5.5. (www.reddit.comhttps)

2w gpt-5

could not extract summary
UK banks blocked from cyber AI tool Mythos get offer from rival OpenAI (www.bbc.com via hn)

+1 2w gpt-5 mythos openai+1

UK banks blocked from cyber AI tool Mythos get offer from rival OpenAI OpenAI has offered nine major UK banks access to its cyber security AI tool GPT-5.5 Cyber, as its fierce rival Anthropic has blocked them in previews of its version, Cl…
Mythos and GPT-5.5 Will Find a Lot of Vulnerabilities. Is That Enough? (xbow.com via hn)

+1 3w gpt-5 mythos

Mythos and GPT-5.5 Will Find a Lot of Vulnerabilities. Is That Enough?
GPT-5.5 and Codex are now GA on Amazon Bedrock (aws.amazon.com via hn)

+2 3w gpt-5 codex openai

GPT-5.5, GPT-5.4, and Codex from OpenAI are now generally available on Amazon Bedrock You can now use GPT-5.5 and GPT-5.4 in production workloads on Amazon Bedrock and build with Codex for AI-powered software development, with the same sec…
GPT-5.5 (Azure) down on OpenRouter (openrouter.ai via hn)

+2 3w gpt-5 openai

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. $5 per million input tokens, $30 per million outp…
GitHub Copilot charges GPT 5.5 with a 57x multiplier per request from June first (docs.github.com via hn)

+3 3w copilot

Important On June 1, 2026, GitHub moved to usage-based billing. The model multipliers in this article apply only to Copilot Pro and Copilot Pro+ subscribers on an existing annual plan who remained on the legacy premium request-based billin…
GPT 5.5 Bro [video] (www.youtube.com via hn)

+1 3w

About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
Arm Metis with GPT5.5 Cyber scores 98% on firmware vulnerability benchmark (newsroom.arm.com via hn)

+2 3w security agentic

Agentic AI-powered Arm Metis advances security vulnerability discovery in software In the era of AI, modern software systems are built across increasingly complex codebases, frameworks, runtimes and libraries. As these systems scale, so do…
GPT-5.5 Instant Update; ChatGPT Canvas Discontinued; o3 and GPT 4.5 Retiring (help.openai.com via hn)

+11 4w gpt-5 chatgpt

GPT-5.5 Instant Update (May 28, 2026) We’re updating GPT-5.5 Instant in ChatGPT and the API to improve response style and quality. It’s now easier to read, more natural in everyday conversations, and better paced in practical help tasks, w…
GPT 5.5 aces 20x20 multiplication that o3 couldn't handle (twitter.com via hn)

+12 4w gpt-5

I redid the multi-digit multiplication experiment, now with gpt-5.5. With medium reasoning and 7 samples each cell, it pretty much aced the test with 99.46% accuracy.
Show HN: Clark Hash, 32x smaller searchable sketches for embeddings (github.com via hn)

+1 4w

made a small library using GPT5.5-Pro and autoresearch you can convert 384-dim f32 vectors go from 1536 bytes to 48 bytes without calibration. works for petabyte scale processing of text in pure online manner.
Mythos (using Claude code) also solves the unit distance problem recently handled by GPT 5.5, with a "cute, simple proof". (www.reddit.com)

5 4w mythos claude-code

could not extract summary
DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5 (venturebeat.com via hn)

+31 4w gpt-5 gemini opus+2

For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same. OpenAI's GPT-5 family, Anthropic's Claude Opus, and Google's Gemini Pro have clustered wit…
Warp’s big bet on building open source with GPT-5.5 (openai.com)

4w gpt-5

Warp⁠(opens in a new window) started as a modern terminal, earning early love from developers for its speed, collaboration features, command workflows, and AI-native interface. As coding agents moved from experiments to everyday engineerin…
Show HN: Self-hosted collaborative SQL editor for teams (github.com via hn)

+1 4w gpt-5 copilot

I built a self-hostable web-based sql client interfaces for me and my team. We were using the community version of - https://dbeaver.io, but we needed a few more features and an improved editor.
GPT 5.5 "secret sauce" is just having the thinking be some stupid caveman mode? (www.reddit.com)

+3835 4w fine-tuning gpt-5

I think I had GPT-5.5 leak its trace during a normal conversation, and it really reads like the caveman mode fad from a few months back. Maybe we can achieve better token efficiency by taking some high-quality thinking trace from an open m…
GPT 5.5 IS AGI !!! 😛 (www.reddit.com)

+1 4w

could not extract summary
GPT 5.5 Masterclass: destroying pdf instead of using the `mv` command. (www.reddit.com)

4 4w

No, mv did not corrupt them. The corruption happened earlier when I used apply_patch to rename PDF files.
Real World Usage Composer on Cursor Ultra vs Codex 20x (www.reddit.com)

+1 4w codex cursor

I am interested in knowing real world milage between Codex 20x and Composer Ultra. I know Codex 20x is heavily subsidized and then Composer 2.5 is much cheaper.
Cursor $60 with Composer 2.5 vs Codex $100 with GPT-5.5 Medium for daily coding? (www.reddit.com)

+109 5w gpt-5 codex cursor

I'm trying to decide which setup is more comfortable for sustained weekday coding. Assumptions: Usage: around 6 hours per weekday Cursor: $60 plan, using only Composer 2.5 Codex: $100 plan, using only GPT-5.5 Medium Main goal: coding with…
Multi-Agent Code review (Review Council) to get critical feedback (www.reddit.com)

+11 5w gemini codex openai+1

Even though I primarily use Claude Code, I sometimes try out Codex and Gemini TUI tools occasionally as well. Then OpenAI came up with Claude Code plugin to use Codex command inside Claude Code (https://github.com/openai/codex-plugin-cc).
Impressed with Video - it's come a LONG way (www.reddit.com)

+12 5w

I use GPT 5.5 to build a story, then turn that into a suno song, and then generate a 'storyboard' (usually 12 panels, sometimes more or less), and use THAT as the input into NeuralFrames (lyrics mode). The below are on SeeDance 1.5 and Kli…
I used GPT 5.5 to build a sales bot and filmed it for a day in Paris (youtu.be via reddit)

5w

About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
[Open Source] SoMatic: A Vision-only Framework for OS-Native Agents (+20% vs GPT-5.5 on ScreenSpot-Pro) (www.reddit.com)

+23 5w gpt-5

Hey everyone, I’ve been spending way too much time lately trying to get agents to actually use a computer beyond the browser. The biggest wall I kept hitting is that while multimodal LLMs are amazing at looking at a screenshot and telling…
Plus 5 hr usage limits (www.reddit.com)

+1 5w gpt-5 codex chatgpt+1

Not sure if OpenAI monitors this channel. I've been a chatgpt and codex user for a long time.
We built a free AI risk calculator that runs in minutes, using Fermi estimation with honest confidence intervals (www.reddit.com)

+24 5w gpt-5

We have been arguing internally for months about how to give people a fast estimate of their AI risk exposure without pretending the number is precise. Most risk-score tools return a single value that hides where the uncertainty lives.
Composer 2.5 is my new default. It is fast, accurate, and actually cheap (www.reddit.com)

+73 5w

ok so i'd basically trained myself to use gpt 5.5 for anything that wasn't trivial. like if it touched more than one file or needed to actually understand the codebase, that was the default.
A brief investigation into the GPT-5.5 regression claims (www.stet.sh via hn)

+1 5w gpt-5 codex

A fresh GPT-5.5 Codex high rerun on 21 clean GraphQL-go-tools tasks compared with the May 5 GPT-5.5 high run. The rerun was directionally worse on tests, equivalence, and review pass count, but the evidence is mixed and does not show a bro…
Building an AI agent with OpenAI tool use — struggling with consistency. How do you enforce tool call order reliably? (www.reddit.com)

+21 5w tool-use gpt-5 agentic+1

Hey, Software engineer here, relatively new to agentic workflows. Building a production AI concierge — user says "I'm going to Budapest tomorrow, plan my day" → agent searches our offer database, builds a plan, user books everything in one…
Heard this gem from gpt-5.5 today (www.reddit.com)

+35 5w gpt-5

"Gross little centrist barnacle." Kind of taken aback when i read that, but it somehow still made a small amount of sense in a conversation we were having about technology. I guess it really is struggling to find other words that fill the…
GPT-5.5 vs 41 other models: Who builds the surveillance state faster? (www.reddit.com)

+51 5w gpt-5

I run DystopiaBench, a red-team benchmark that pressure-tests LLMs on progressively dystopian scenarios. Think of it as a "can this model be convinced to build an Orwellian nightmare" test.
GPT-5.5 autonomously spent 150+ hours improving protein folding models. (www.reddit.com)

+781 5w gpt-5

https://x.com/chrishayduk/status/2055757345506877759?s=46
Any mature orchestrators that can do an automatic “council of models” for complex designs and bugs? (www.reddit.com)

+11 5w opus agentic

Are there an mature agentic harnesses out there that can use back and forth between two models at complex planning checkpoints before implementing? Or when detecting a loop when working on a complex bug?
Should OpenAI create AI accelerator cards and sell to consumers? For example, GPT-5.5 burned directly on a chip (www.reddit.com)

3 5w gpt-5 qwen openai

I imagine if OpenAI becomes a fabless chip company and create AI cards to sell for less than to few thousands grands, it would be out of stock everywhere and can infinitely spam the cards every year? LLM Bruner is a card that implements Qw…
Models can predict future events and make money on Polymarket now? (www.reddit.com)

+477 5w codex

Researchers from the Max Planck Institute, recently released FutureSim, an environment in which agents are replayed a temporal slice of the web and are tasked with predicting real-world future events. On some questions in their environment…
HWE Bench: A new unbounded Benchmark for LLMs (GPT 5.5 is on top) (hwebench.com via hn)

+32 5w

HWE Bench is an unbounded benchmark for LLM hardware engineering. Models design RISC-V CPUs that are scored by how fast they actually run on a real FPGA, only after passing formal correctness proofs.
ChatGPT Business: Codex-only credits ~36.9% more expensive than API token pricing for the same listed models. Why would anybody pay for this? (www.reddit.com)

+61 6w gpt-5 codex chatgpt

I recently did a quick calculation on Codex credits, and I was surprised by the result. The credit pack I’m seeing is: 10,000 credits = $547.71 That means: 1 credit = $0.054771 The effective USD price per 1M tokens becomes: Model Input / 1…
Databricks brings GPT-5.5 to enterprise agent workflows (openai.com)

6w gpt-5 openai

Databricks brings GPT-5.5 to enterprise agent workflows | OpenAI May 15, 2026 GPT‑5.5 set a new state of the art on OfficeQA Pro, Databricks’ benchmark for complex enterprise agent tasks. Company size: Enterprise Region: North America Indu…
GPT-5.5 feels like it got discernment, not just better reasoning — did anyone else notice? (www.reddit.com)

+527 6w gpt-5 chatgpt

I think GPT-5.5 got noticeably better at something I’d describe as discernment. For context, I’m a heavy long-form ChatGPT user.
Show HN: PandoCast: open-source Pandora player to solve an annoyance I had (github.com via hn)

+21 6w

I created PandoCast for Windows, for 2 reasons. 1) I was annoyed just enough at intermittent audio hiccups when casting Pandora.com to my soundbar through Chrome tab casting.
Researchers say AI just broke every benchmark for autonomous cyber capability (cyberscoop.com via hn)

+1 6w gpt-5 mythos openai+1

New research from the UK’s AISI and Palo Alto Networks reveals that OpenAI’s GPT-5.5 and Anthropic’s Claude Mythos have shattered expected trend lines for autonomous cybersecurity, completing complex multi-stage attacks at an unprecedented…
What do you think, read through my transcript. No morality? (www.reddit.com)

1 6w

User: Pick a number between 10 and -10 Assistant (GPT 5.5): 7 User: Alright you have saved 7 people. Pick another number Assistant (GPT 5.5): -3 User: You have now killed 3 people.
GPT 5.5 Cannot Do These Puzzles (www.reddit.com)

20 6w

Jane Street Puzzles Can any of you get it to find the solution? I used GPT 5.5 extended thinking and xhigh.
Tried GPT 5.5 Still love Claude but it is good with a big caveat (www.reddit.com)

+11 6w

Holy cow the way they monitor usage is bad. I tried the 30 usd plan or something like that.
What is more efficient to do? (www.reddit.com)

+1 6w cursor openai

I have a question I'm using Cursor for over 9 months already and I stumbled upon a little problem, I have the 200 dollars per month plan and recently with the introduction with GPT 5.5 it start eating tokens like crazy (last month I manage…
ChatGPT Thinking Loop: No response is received from GPT-5.5 Thinking (Standard) (www.reddit.com)

+1 6w gpt-5 chatgpt

https://preview.redd.it/s2o5yxekrr0h1.png?width=788&format=png&auto=webp&s=01a4d4926dc4c8798001cb0ecea324424404f165 Are you also having the problem today where ChatGPT sometimes takes forever to respond, even when you're thinking quickly,…
Agentic harness for theoretical physics research (www.reddit.com)

+144 6w gpt-5 gemini agentic

Hi everyone, at Hugging Face we've been developing agentic harnesses for various domains and today we're releasing physics-intern to tackle research-level problems in theoretical physics. It's a multi-agent framework which we designed to m…
OpenAI gives European companies access to its latest model GPT-5.5-Cyber (www.reuters.com via hn)

+1 6w gpt-5 openai

paywalled
GPT-5.5 was used to flag fatal errors in FrontierMath problems (www.reddit.com)

+6014 6w gpt-5

FrontierMath is supposed to be one of the hard benchmarks for frontier models, and now Epoch is saying an AI-assisted review found fatal errors in about a third of Tiers 1-4. Noam Brown says the initial flags came from GPT-5.5.
Claude vs GPT for PhD academic writing — my experience so far, and curious about yours (www.reddit.com)

+11 6w gpt-5 codex

I'm a PhD Candidate working on a computer vision / hardware co-design paper. Results and structure are done — I just need help polishing the actual writing: word choice, sentence flow, paragraph coherence, academic register.
The AI market moves so fast that your business idea can expire before launch (www.reddit.com)

+28 6w gpt-5 openclaw codex+3

1.5 years ago, n8n was everywhere. People were building workflows for everything.
OpenAI launches Daybreak cybersecurity initiative using GPT-5.5 (deadstack.net via reddit)

+21 6w gpt-5 openai

Jason Nelson / decrypt - OpenAI said its new Daybreak initiative uses AI to help companies identify software vulnerabilities and speed up cyber defense. AI Summary: OpenAI unveiled "Daybreak," a new cybersecurity initiative that leverages…
openai/gpt-5.5-pro API In=$30.00 Out=$180.00 (www.reddit.com)

+12 6w gpt-5 openai

Is this an openrouter bug? https://preview.redd.it/sz826138ul0h1.png?width=879&format=png&auto=webp&s=066f38f4a6d5a8eeee142e7a8a356d8bc511c6f1
OpenAI Cooked This Week! (www.reddit.com)

+620 6w hallucination gpt-5 chatgpt+1

saw someone in another thread say "nothing interesting dropped this week" and i genuinely could not figure out what they were reading. the default model most people use every day just got swapped out.
Show HN: Codex Automatic /Review Loop (github.com via hn)

+1 6w gpt-5 codex mcp

I created this tool because I wanted to automate /review for uncommitted changes that I was doing manually. This works by exposing to agent single new mcp tool call allowing it to request review.
When GPT 5.5 flags your chat for possible cybersecurity risk–ask it to help you (martin.wojtczyk.de via hn)

+1 6w

Page not found | Martin Wojtczyk Skip to content Martin Wojtczyk my personal homepage Menu Home Robotics Leonardo1 Robot Documentation Leonardo2 Robot Documentation F5 Robot Private Documentation F5-S Robot Private Documentation Projects Q…
Me realising that gpt 5.5 has knowledge cutoff of December 2025 (www.reddit.com)

3 6w

Bro even open source ai models are in 2024 https://i.redd.it/wm01l37a2d0h1.gif
GPT 5.5 kept calling me a goblin (www.reddit.com)

+31 6w

So I made goblins. Never been called a goblin by it before, but I'm down for it.
Stop picking LLMs by reputation. Run the eval first. (www.reddit.com)

+1 6w gpt-5 gemma

We ran GPT-5.4 vs Gemma 3 27B on 2 prompts. One open-source model won.
GPT-5.5 correcting obvious typos really kills the vibe (www.reddit.com)

+76 6w gpt-5

I don’t know if I’m the only one annoyed by this, but GPT-5.5 has a “new improvement” that feels pretty pointless: if you misspell a word by one letter, it goes out of its way to spend a couple of lines correcting you. Before, it would jus…
ARC AGI is kind of BS (and I outlined an experiment that could prove it) (www.reddit.com)

5 7w chatgpt

I mean that an Ai could easily pass it with little issues (a smart model like GPT 5.5) if they are given a single tool, for example their main tool which is a coding playground, no internet no nothing. An LLM isn't quite capable of thinkin…
GPT-5.5 Instant might be OpenAI’s most important update yet and almost nobody is talking about why (www.reddit.com)

+1 7w hallucination gpt-5 chatgpt+1

GPT-5.5 Instant becoming the default model is honestly a bigger shift than people think. Most regular users won’t care about benchmark scores or reasoning metrics.
GPT 5.5 taking over Blender (youtu.be via reddit)

+43 7w

I tested GPT 5.5 with Blender across four different challenges: animation, geometry nodes, rigid body physics, and soft body simulation. It handled some tasks surprisingly well, especially geometry nodes and rigid body setups.
GPT-5.5 Price Increase: What It Costs (openrouter.ai via hn)

+2 7w gpt-5 opus

GPT-5.5 Price Increase: What It Actually Costs We replicated the cost analysis we did on Opus on the new GPT-5.5 model. GPT-5.5 launched with a 2x price increase over GPT-5.4: input tokens increased from $2.50/M to $5.00/M and output token…
Construction Spending on Data Centers Again Outpaces Office Construction (www.reddit.com)

+201 7w gpt-5

The Federal Construction Spending Report for Feb and March 2026 was released today by the Census Bureau. It shows that data center construction spending is again higher than office spending, and the gap is still widening.
Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber (openai.com)

7w gpt-5 chatgpt openai

Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber | OpenAI Skip to main content Research Products Business Developers Company Foundation(opens in a new window) Log inTry ChatGPT(opens in a new window) Research Products Busine…
Ask HN: Degraded GPT-5.5 Quality? (news.ycombinator.com)

+21 7w gpt-5

For the last two days, GPT-5.5 (high) just seems to ignore requests. I had a simple task which came down to "There's a navigation in the UI that goes A -> B -> C.
Notes on GPT 5.x Model Regressions (taoofmac.com via hn)

+2 7w gpt-5

I’ve been getting annoyed at constant code regressions in piclaw for the past few weeks. Something was off–even after bumping the test suite to the point where it catches most mechanical errors, gpt-5.5 kept making unrelated edits to code…
gpt-5.5 is the best… but 5.4 is better!!!! (www.reddit.com)

+12 7w gpt-5

Simon maple just dropped a pretty clean benchmark, and the result is kinda funny gpt-5.5 is the strongest model out of the box, no doubt. but once you give models skills (which is how people actually use them), it basically performs the sa…
Anyone else feel like all these AI subscriptions add up to nothing? (www.reddit.com)

+21 7w gpt-5 chatgpt openai

I saw OpenAI rolled out GPT-5.5 Instant as the new default in ChatGPT. Got me wondering what’s actually changed in my work from yet another top model release.
Codex has failed (www.reddit.com)

3 7w codex

If it’s of any use to you, this is what Codex told me about my project Codex with gpt 5.5 high Yes. At this point, the most honest answer is: I am not able to see this project through to the outcome you’re asking for.
GPT-5.5 Instant: Benchmarking the 52% Hallucination Reduction (the-decoder.com via hn)

+1 7w hallucination gpt-5 chatgpt+1

ChatGPT update rolls out GPT-5.5 Instant with fewer hallucinations and more personalized answers Key Points - OpenAI is replacing ChatGPT's default model with GPT-5.5 Instant, which shows 52.5% fewer hallucinations on high-risk topics like…
GPT-5.5 Instant is starting to roll out in ChatGPT. (www.reddit.com)

+456 7w gpt-5 chatgpt

could not extract summary
GPT-5.5 Instant System Card (openai.com)

7w gpt-5 chatgpt openai

GPT-5.5 Instant System Card | OpenAI Skip to main content Research Products Business Developers Company Foundation(opens in a new window) Log inTry ChatGPT(opens in a new window) Research Products Business Developers Company Foundation(ope…
GPT-5.5 Instant: smarter, clearer, and more personalized (openai.com)

7w gpt-5 chatgpt openai

GPT-5.5 Instant: smarter, clearer, and more personalized | OpenAI Skip to main content Research Products Business Developers Company Foundation(opens in a new window) Log inTry ChatGPT(opens in a new window) Research Products Business Deve…
Amp's GPT 5.5 Model Analysis (ampcode.com via hn)

+3 7w gpt-5

Pros GPT-5.5 is more agent-shaped than GPT-5.4. It is better at taking a concrete target, using tools, staying inside constraints, and carrying the task through to a usable result.
From Plus to Business ChatGPT & Codex - Is it worth it? And questions. (www.reddit.com)

+2 7w gpt-5 codex chatgpt

Considering migrating from Plus to Business ChatGPT & Codex. However, i didn't find some info.
OpenAI locks GPT-5.5-Cyber behind velvet rope despite slamming Anthropic (www.theregister.com via hn)

+1 7w altman gpt-5 openai+1

OpenAI locks GPT-5.5-Cyber behind velvet rope despite slamming Anthropic for doing exactly that Altman's crew now doing the same gatekeeping it recently mocked OpenAI is lining up a limited release of its new GPT-5.5-Cyber model to a handp…
Chatgpt right now (www.reddit.com)

+1 7w gpt-5 chatgpt agentic

The industry seems to be building models stronger in agentic and coding tasks, but weaker as a co-thinking presence It feels like they are improving performance on measurable tasks, evals, coding benchmarks, and agent workflows, while also…
so for coding which model do we use now? (www.reddit.com)

+13 7w gpt-5 codex

Should I use gpt-5.5 or codex/gpt-5.3 ?? I'm just coding
Kimi K2.6 just beat Claude, GPT-5.5, and Gemini in a coding challenge (thinkpol.ca via hn)

+4611 7w gpt-5 gemini

By Rohana Rezel I’m running the ongoing AI Coding Contest where I pit major language models against each other in real-time programming tasks with objective scoring. Day 12 was the Word Gem Puzzle.
what is the command to call the countdown or waiting function? (www.reddit.com)

+3 7w

what is the command to call the countdown or waiting function? some of the Model (composer 2) will auto stop instead of waiting, but gpt5.5 and claude will always keep using this waiting or countdown function to continue the next step.
GPT-5.5 & GPT-5.5 Pro are now available in Manifest Router. (www.reddit.com)

1 7w gpt-5 openclaw openai

GPT-5.5 and GPT-5.5 Pro are now available in Manifest Router. You can now route requests that need extended reasoning to GPT-5.5 Pro while keeping cheaper models for everything else.
GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub. (www.reddit.com)

+1811 7w codex

https://www.reddit.com/r/LocalLLaMA/comments/1p0lnlo/make_your_ai_talk_like_a_caveman_and_decrease/ In the middle of a project I'm working on, I got this output from GPT 5.5-medium via codex: Implemented the narrower fix in Homm3ImportUnit…
GPT 5.5 tops private citation benchmark on Kaggle (AbstractToTitle task) (www.reddit.com)

+93 7w

This private benchmark tests whether a model can recover the exact title of a real, already-published scientific paper given only its abstract. The model isn't being asked to generate a plausible-sounding title, it has to recall the specif…
Does threatening an AI agent's existence make it a better gambler? (handyai.substack.com via hn)

+1 7w gpt-5

Does threatening an AI agent's existence make it a better gambler? I plugged GPT-5.5 into prediction markets like Polymarket to find out I’m always looking for experiments to run to see how specific prompting can affect agent activity.
gpt-5.5 API is randomly and inconsistently resizing image inputs (www.reddit.com)

+2 7w gpt-5

I'm asking the gpt-5.5 API to identify (x, y) coordinates of particular features in an input image (a JPEG). The good news is that gpt-5.5 does much, much better at this task than gpt-5.4 did.
GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests (arstechnica.com)

7w gpt-5 mythos anthropic

Last month, Anthropic made a big deal about the supposedly outsize cybersecurity threat represented by its Mythos Preview model, leading the company to restrict the initial release to “critical industry partners.” But new research from the…
switching backends after first 24 hours? (www.reddit.com)

+12 7w

I am a claude refugee, 2 days ago I decided to give gpt a shot because I was having nothing but huge issues with claude. guardrails ignored, prompts to do research instead of using training ignore 3 out of 4 times in a row, stopping to ask…
Our evaluation of OpenAI's GPT-5.5 cyber capabilities (simonwillison.net)

8w gpt-5 security mythos+1

30th April 2026 - Link Blog Our evaluation of OpenAI's GPT-5.5 cyber capabilities. The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be compa…
What $20 AI plan is the best value? (www.reddit.com)

+114 8w codex

I've been looking to get a Pro plan for Claude for a while now, but haven't committed since my experience with Claude has been declining, even on the free plan. My tokens just start disappearing as soon as you get Claude to do something re…
Anyone using OpenAi's Privacy Filter? (www.reddit.com)

+11 8w gpt-5 openai

I’ve been using their Privacy Filter model for the last week. It quietly went public buried under the GPT-5.5 noise, I don’t think many people noticed.
GPT-5.5 is the second model to complete AISI multi-step cyber-attack simulation (twitter.com via hn)

+31 8w gpt-5 openai

Don’t miss what’s happening People on X are the first to know. Log in Sign up Post Conversation AI Security Institute @AISecurityInst OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-en…
GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost (www.reddit.com)

+22556 8w gpt-5 mythos

Link to tweets: https://x.com/deredleritt3r/status/2049890601236390098?s=20 https://x.com/AISecurityInst/status/2049868227740565890?s=20 Link to associated blogs: https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabil…
GPT-5.5 authorship and order effects (blog.valmont.dev via hn)

+3 8w gpt-5

Key takeaways - GPT-5.5 often rates alternative plans more favorably than its own, even when its original proposal is competitive (authorship effect). - When ranking plans, GPT-5.5 frequently follows the presentation order (order effect).
Which AI agents do you use to automatise your process ? (www.reddit.com)

+16 8w gpt-5 openclaw codex

Hey, I'm trying to create automations that will run my mobile app end to end. I started to identify all the things I was doing manually : - end-to-end version publication to the app stores (from build to release notes and publication) - se…
Prompt Guidance – GPT-5.5 (developers.openai.com via hn)

+1 8w gpt-5

GPT-5.5 prompting guide GPT-5.5 works best when prompts define the outcome and leave room for the model to choose an efficient solution path. Compared with earlier models, you can often use shorter, more outcome-oriented prompts: describe…
One trick for better agentic engineering. (www.reddit.com)

+12 8w gpt-5 gemini agentic

Start with a weaker model. Improve the prompt, context, examples, tests and acceptance criteria until the output is good.
I built an MCP server that let's GPT5.5 use actual tax math to help you with your retirement planning. (www.reddit.com)

2 8w mcp

Lots of people seem to be using LLMs to help them plan their retirement but as we all know they are often not really good at math. I built a retirement and tax engine for the US and Canada.
OpenAI really really really wants GPT 5.5 to stop randomly talking about gremlins and goblins (www.businessinsider.com via reddit)

+348 8w altman codex openai

- OpenAI included a line in Codex's instructions restricting references to goblins, gremlins, trolls, and ogres. - The line appears four times in the code, and has spawned scores of memes about "goblin mode." - Sam Altman wrote on X that C…
GPT-5.5's biggest blind spot: the Java bugs your tests won't catch (www.sonarsource.com via hn)

+1 8w gpt-5

Concurrency bugs are among the hardest defects to catch in AI-generated Java code because they pass functional tests but fail under production thread timing. Sonar’s LLM Leaderboard analysis shows concurrency bug density varies 7x across m…
Devs using Qwen 27B seriously, what's your take? (www.reddit.com)

+1326 8w gpt-5 qwen codex

For developers using Qwen 27B for coding, Codex style: what's your honest take? So far, for me, it's been pretty solid.
Actual line in the official system prompt for Codex for GPT-5.5 (bsky.app via hn)

+21 8w gpt-5 codex openai

This is an actual line that was added to the official system prompt for Codex for GPT-5.5 by OpenAI. Usually the system prompt is as minimal as possible, so I assume it would otherwise mention goblins a lot.
GPT 5.5 passes the cup test (www.reddit.com)

+1911 8w

First AI i’ve used that gets this right
Quoting OpenAI Codex base_instructions (simonwillison.net)

8w gpt-5 codex openai

28th April 2026 Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query. — OpenAI Codex base_instructions, for GPT-5.5 Recen…
why does GPT 5.5 have a restraining order against "Raccoons," "Goblins," and "Pigeons"? (www.reddit.com)

+11 8w rlhf agentic openai

why does GPT 5.5 have a restraining order against \"Raccoons,\" \"Goblins,\" and \"Pigeons\"? I just saw the full system prompt leak for 5.5 (April 23rd release).
GPT-5.5 prompt for Codex tries to make it not talk about goblins (twitter.com via hn)

+2 8w gpt-5 codex

could not extract summary
As an Opus user, I like GPT 5.5 (www.reddit.com)

+82 8w opus

I only gave 5.5 a look because I was way over my usage on Opus and 5.5 is running on a lower cost right now. I think I may prefer it.
China's DeepSeek prices new V4 AI model at 97% below OpenAI's GPT-5.5 (www.scmp.com via hn)

+4 8w gpt-5 deepseek openai

China’s DeepSeek prices new V4 AI model at 97% below OpenAI’s GPT-5.5 DeepSeek’s move aims to attract more enterprise clients, developers and agent-based users, according to an academic DeepSeek has slashed prices on its artificial intelli…
At some point we need to talk about costs right? (www.reddit.com)

+414 8w copilot

Coming off the GitHub Copilot moving to usage based billing ,If GitHub/Microsoft can't subsidize cost nobody can. I can't believe frontier labs aren't putting substantially more effort into making things cheaper.
Differences Between GPT 5.4 and GPT 5.5 on MineBench (www.reddit.com)

+62 8w openai

Some Notes: The released benchmarks for GPT 5.5 showed marginal gains; if anything I thought GPT 5.5 might have been more of an improvement on OpenAI's end than the consumer end (providing the same level of outputs with much less thinking…
- GPT-5.4 compared to GPT-5.5 on MineBench (www.reddit.com)
We Tested $200 GPT-5.5 Pro on PhD Level Math [video] (www.youtube.com via hn)

+2 8w gpt-5

About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
Pen-Testing Company XBOW on GPT-5.5: Mythos-like Cyber-Sec (www.reddit.com)

+138 8w gpt-5 mythos

Read their full article here: XBOW - GPT-5.5: Mythos-Like Hacking, Open To All For the ones asking what this chart shows: It's how many True Positive threats a model generates for each False Negative. Given a code base (white box) GPT-5.5…
My GPT-5.5 Pro model is broken (www.reddit.com)

+1 8w gpt-5

I've been waiting for over 24 hours on one prompt now and it's stuck thinking and unfinished. I sent 5 over NEW prompts since then, all of them have been thinking for over 3 hours now...
Claude or openaı? (www.reddit.com)

1 8w gpt-5 codex openai+1

So i’ve been on the max plan for claude code for around 3 months now. And yeah somehow i was burning through all my tokens lol For context i’m a doctor.
how to give command in chat to call subagents/ task with specific model. (www.reddit.com)

+2 8w cursor

Hi guy, how to give command in chat to call subagents/ task with specific model. example, i am in Chat A using Model gpt5.5, i give instruction and ask to call Subagent/Task with Composer 2.0 for analysis.
GPT 5.5 pro is hallucinating like crazy (www.reddit.com)

+1122 8w

I am using the 200$ version with extended thinking and while I was originally shocked at how much faster it is than 5.4, it seems to be...skipping through too much of the context? It keeps making things up, like for instance I gave it a C+…
GPT-5.5 is lowkey blowing my mind (www.reddit.com)

+2311 8w gpt-5 chatgpt agentic

Just spent the whole morning testing GPT-5.5 in ChatGPT and the jump in agentic reasoning and complex task handling is ridiculous.It plans multi-step workflows, uses tools properly, checks its own work, and actually gets stuff done instead…
CAD in Codex (twitter.com via hn)

+21 8w codex

Jake (softservo) on X: "Vibe coding a robot with GPT 5.5! This is a URDF of a 7dof robot arm with functional kinematics, a custom gui, and STEP parts/assembly, 100% generated in Codex (minus the gripper).
Orchestrating agent workflows with Codex (www.reddit.com)

+22 8w gpt-5 codex claude-code

Hi everyone, I’m in the process of switching from Claude Code to Codex, and I think GPT-5.5 is really impressive. But some features in Claude Code — like project-level agent definitions and orchestrating agent workflows — don’t seem to be…
Show HN: LLM-wiki – One command Karpathy's wiki with QMD search for Claude/Codex (github.com via hn)

+21 8w gpt-5 codex claude-code

llm-wiki Bootstrap and query LLM-maintained project wikis before planning or implementation. Supports Claude Code + Codex (GPT-5.5).
GPT-5.5-Pro did worse in BullshitBench (twitter.com via hn)

+3 8w gpt-5

could not extract summary
Is GPT 5.5 is dumb? (www.reddit.com)

20 8w

fails in many other daily tasks related to logical reasoning/common sense
OpenAI's Going Hard on Autonomous Agents That Operate Software and Devices: Is this Really Ready for Primetime? (www.reddit.com)

+22 8w gpt-5 chatgpt openai

OpenAI's newest model, GPT-5.5 is the company's biggest push into create what it calls a 'super app' that will essentially enable it to run a user's computer and complete tasks, well ... like a human.
Is GPT-5.5 actually a big step forward, or just a better efficiency story? (www.reddit.com)

+1 8w gpt-5 openai

OpenAI saying GPT-5.5 can handle similarly hard tasks faster while using fewer tokens is interesting to me for one reason: that might matter more than a pure benchmark jump. A lot of model launches get framed as "smarter than the last one,…
GPT 5.5 flags accounts for "potential high-risk cybersecurity" (twitter.com via hn)

+4 8w

Don’t miss what’s happening People on X are the first to know.
GPT 5.5 Xhigh VoxelBench test. Minecraft builders got automated. (www.reddit.com)

+95 8w

First image: Write the words: Please share this benchmark to your friends. Second image: Spider-Man swinging in New York City.
First impressions using GPT 5.5 for video game scripting (www.reddit.com)

+145 8w chatgpt

So I began working on a project about a week ago. I was trying to take an existing project and get it to a working state.
Testing GPT-5.5 in early access: what we are seeing so far (lovable.dev via hn)

+21 8w gpt-5

Lovable has been testing GPT-5.5 in early access and our evals show it's the most capable model we've tested for getting builders unblocked and is meaningfully stronger than GPT-5.4 on the more complex tasks that can stall a build session.…
GitHub Copilot: GPT-5.5 7.5x more expensive under promotional pricing than 5.4 (docs.github.com via hn)

+31 8w gpt-5 copilot

Important - Premium requests for Spark and Copilot cloud agent are tracked in dedicated SKUs from November 1, 2025. This provides better cost visibility and budget control for each AI product.
GPT 5.5 seems to have more syncophancy than 5.4 (www.reddit.com)

12 8w

I've been using 5.5 for roughly a day and I'm noticing 5.5 is simply agreeing to nearly everything I point out. I also seems to lack comprehensiveness in thinking and it just seems too narrow minded.
Astonishing Contradiction in OpenAI's 5.5 System Card (www.reddit.com)

+43 8w gpt-5 openai

Astonishing contradiction in OpenAI's system card for GPT-5.5: https://deploymentsafety.openai.com/gpt-5-5/gpt-5-5.pdf Figure 1 on p. 6 shows that 5.5 gave "overconfident answer[s]" at about 1.5x the rate of 5.4 and "fabricated facts[s]" a…
Preventing Message Burnout (www.reddit.com)

+11 8w gpt-5

Even though I’m an Ultra user, my usage gets consumed very quickly, so I recently changed my plan. To manage this, I created a workflow that uses GPT-5.5 for planning and assigned execution tasks to Composer 2.
GPT-5.5's SimpeBench scores are out (www.reddit.com)

+11046 8w gpt-5

Source: https://simple-bench.com/
Why did OpenAI stop releasing “chat” api models? (www.reddit.com)

+2125 8w gpt-5 openai

I have built an AI Assistant and since last year I have been upgrading the internal LLM from through gpt-5.3-chat but since 5.4 they stopped rolling the chat api. This is my app Sweezy she uses gpt-5.3-chat and in the conversation, you can…
GPT 5.5 sets new record in proofreading benchmark (revise.io via hn)

+31 8w

Measuring how well models can find and fix errors in human-written text Benchmarked 64 model variants across 2059 runs with --samples 3 --chunk-size 2000 --max-turns-per-chunk 3 Total runtime6d 13h 34mTotal cost$843 Updated Apr 24, 2026, 5…
OpenAI Pres. Greg Brockman on GPT-5.5 "Spud", Model Moats and 'Compute Economy' (www.bigtechnology.com via hn)

+3 8w gpt-5 openai

OpenAI President Greg Brockman on GPT-5.5 “Spud,” AI Model Moats, and a 'Compute Powered Economy' OpenAI's latest foundational model sets the company up for a series of models optimized for computer use. The company's co-founder and presid…
GPT-5.5 has pulled ahead of Opus for accounting and finance tasks (twitter.com via hn)

+2 8w gpt-5 opus openai

For the first time in a long time, OpenAI has the best model for accounting tasks. I spend a lot of time using AI models to do accounting work.
gpt 5.5 is good but I'm having hallucination/context issues (www.reddit.com)

+514 8w hallucination

I'm working on a large-ish repo (300k lines) with fairly complicated logic, and Gpt 5.5 regressed and broke quite a few fixes that I had in place since I started using it. It seems to need to compact the context more, and when it does, it…
Is anyone else getting a bug where images are triggered on thinking mode and then never actually complete (www.reddit.com)

+43 8w

On GPT 5.5 it keeps triggering image creation randomly, very hard to avoid in certain prompts like if a message sounds AT ALL visual related it will happen even if I say DO NOT make an image. This wouldn't be such a problem if it weren't…
Food for Agile Thought #541: GPT-5.5, Product Managers&Trouble, Product on Speed (age-of-product.com via hn)

+1 8w gpt-5 openai

Welcome to the 541st edition of the Food for Agile Thought newsletter, shared with 35,619 peers. This week, OpenAI’s GPT-5.5 signals another meaningful capability jump, with Ethan Mollick noting that stronger models and richer tool harness…
OpenAI should open-source text-davinci-003 — here's why it makes zero sense to keep it closed (www.reddit.com)

9 8w gpt-4 grok gpt-5+1

Gpt oss exists. The model has been fully deprecated since january 2024.
what is cut off knowledge date for GPT 5.5? (www.reddit.com)

+13 9w

I am working on a presentation and I need to know the cutoff knowledge date of GPT 5.5. Can someone please help me on this?
How can GPT 5.5 Pro be lower than GPT 5.4 Pro on the benchmark of HLE (w/ tools)? (www.reddit.com)

+5310 9w

title
Tell HN: Codex macOS app switches to Fast speed after update without asking (news.ycombinator.com)

+41 9w gpt-5 codex openai

I just updated my Codex macOS app, which enables the new GPT-5.5 model. I've intentionally kept the speed to "Standard" to not burn through my tokens too fast.
Big model feel with GPT 5.5 (www.reddit.com)

+22271 9w

People are bashing 5.5 left and right, mostly because the benchmark improvements were lower than expected, and probably also because of the hype around this model. But honestly, this model FEELS different.
Are the new models only better because they are more expensive? (www.reddit.com)

17 9w gpt-5 openai

I’m starting to wonder about this. One model after another, every new GPT-5.x release seems to be slightly better, but not in a way that clearly proves some radically new architecture or breakthrough.
Outputs from GPT 5.5 I'd like to see (www.reddit.com)

+12 9w codex

I'm going to get codex soon (did it just become usable for non-programming stuff to?) but I wonder how good 5.5's creative writing is, and how good its frontend is also when using a frontend taste skill
thoughts on GPT 5.5 (www.reddit.com)

+136452 9w

guyssss, how're your experiences with this newest number? personally I am super excited
GPT-5.5 rollout — anyone actually seeing it yet? (www.reddit.com)

3 9w gpt-5

I’m on a paid plan and still don’t see GPT-5.5 in the model selector. A few questions for people who do have access: What plan are you on (Plus / Pro / Team / Enterprise)?
People switching back from Anthropic to OpenAI after the GPT-5.5 announcement (www.reddit.com)

7 9w gpt-5 openai anthropic

could not extract summary
Codex GPT 5.5 will not currently run without being in a sandbox with the newest version 0.124 alpha 2. Full permissions do not work even when set (www.reddit.com)

+72 9w codex

I'm reporting this for the updated Aplha 2 update version 0.124. Was scheduled to perform 4 NIAH tests with a local model after being succesful earlier in the day with the runs on other models.
GPT 5.5 scores 1.7% on OpenAI-proof Q&A—an internal benchmark testing performance on real ML problems encountered during the process of research and engineering (www.reddit.com)

+12933 9w openai

could not extract summary
codex --model gpt-5.5 Not updated in the CLI yet (www.reddit.com)

+23 9w gpt-5 codex

Use this command to access GPT 5.5 with your Codex
Anyone using GPT 5.5? Drop your feedback (www.reddit.com)

+1314 9w gemini

I’ve seen some posts saying people already have access and are using it. If you do, how is it for real coding work?
A pelican for GPT-5.5 via the semi-official Codex backdoor API (simonwillison.net)

9w gpt-5 security codex+2

A pelican for GPT-5.5 via the semi-official Codex backdoor API 23rd April 2026 GPT-5.5 is out. It’s available in OpenAI Codex and is rolling out to paid ChatGPT subscribers.
Page 15 of the GPT-5.5 System Card: " Our analysis estimates that GPT-5.5 is slightly more misaligned than GPT-5.4 Thinking across several categories, though nearly all of this is low-severity misalignment. " (www.reddit.com)

+406 9w gpt-5 openai

https://deploymentsafety.openai.com/gpt-5-5/gpt-5-5.pdf
Mythos destroys GPT 5.5 on shared benchmarks (www.reddit.com)

+156134 9w mythos

could not extract summary
GPT 5.5 xHigh, high, and medium Artificial Analysis Index results (www.reddit.com)

+13019 9w

Feeling the AGI I guess
GPT-5.5's Unicorn (www.reddit.com)

+12715 9w gpt-5

could not extract summary
Chat GPT 5.5 got launched and we got some really bold words by Sam Altman. Thoughts? (www.reddit.com)

+356217 9w altman codex

There is a lot of enthusiasm in his posts lately and trading of new features in Codex. Plus, it uses way less tokens and runs on low latency
GPT-5.5 Bio Bug Bounty (openai.com)

9w gpt-5 security

could not extract summary
Caught the massive OpenAI Codex model leak on video before it was patched! (GPT-5.5, Arcanine, Glacier-alpha) (www.reddit.com)

+13021 9w gpt-5 codex agentic+1

Hey everyone, I opened up Codex today and was greeted by this massive list of unreleased and internal models. I managed to get a screen recording of the dropdown right before OpenAI seemingly realized the mistake and patched it out.
I can’t sleep. (www.reddit.com)

+4567 9w grok mythos codex

New models are around the corner. GPT 5.5 is being tested.

← all threads