David OndrejYouTube

OpenAI just shipped the Mythos killer (GPT 5.5)

32:33English126 segments5,182 words · 26 min read

Search inside any video

SavedThat transcribes your saved videos and lets you search across all of them instantly. Save this video and find any moment.

TL;DR

In this video, David Ondrej reviews OpenAI's GPT 5.5, discussing its capabilities in web development and comparing it to previous models and competitors.

OpenAI GPT 5.5 reviewGPT 5.5 capabilitiesAI model comparisonweb development AICodex performanceEnthropic Opus comparisonAI model benchmarksGPT 5.5 features

Chapters

Transcript

Okay, so OpenAI just dropped GPD 5.5 minutes ago. So, we're going to look at it. Supposedly, it's on the level of Mythos. I already have it in chat GBD. U pro users got it first on my team's account. I don't have it. So, we're going to look at how good it is

we're going to look at how good it is and test it inside of Cordex app to see what it can build because supposedly it's insane at SVG graphics and 3D and any kind type of web development and front end. So this is opening as response to clo mythos. The difference is enthropics

mythos is not available. GPD 5.5 is available now. So let's go through the article first. Introducing GBD 5.5 a new class of intelligence for real work. Okay. So 90 second video. Let's look at it. It is different in the sense that it understands what I'm trying to

understands what I'm trying to tell it to do. I see. Okay. That's a big issue with Opus 4.7. Opus 4.7 does not understand what you're telling it to do. In fact, I would even say that Opus 4.7 is clear sign of Enthropic's compute crunch. This is why

2026 could be the year of OpenAI because Enthropic they're running out of compute. Opus 4.7 is the first model that Enthropic released in the last two years where people think it's worse. This is completely unprecedented. Andropic has in the last three weeks

they've suffered massive reputation hits first because they didn't release Mythos but then there were user spotted regressions in Opus 4.6 and 4.7 dropped and the vibes are off. Sure on the benchmarks it's better than 4.6 but if you used it and I use it

if you used it and I use it every single day. I still prefer to use Opus 4.6. In fact, if I open up a new terminal, boom, and I open CL code. Enter. Guess what model is selected? 4.6. 6 fast. Now, mainly it's because 4.7 doesn't have fast mode, but still

4.6 just listens to your instructions. It just does what you want. 4.7 sometimes it gives you tuning insight, but most of the times it's just So, OpenAI has a great opportunity to not only catch up to Enthropic, but overtake them once again

Enthropic, but overtake them once again because they have secured more compute. Dario Amu, the CEO of Enthropic, he was very safe. He played a lot of very safe with how much compute Enthropic has invested in and because they had insane growth in the first quarter of 2026. Now

growth in the first quarter of 2026. Now we're seeing that Enthropic is running out of compute and users are sporting massive degradations whether it's inside of cloud as the app or cloud code or just usage limits people are hitting usage limits super fast. So, OpenAI has a real opportunity because

they invested way more money into compute, infrastructure, data centers than Enthropic. And this year, they're going to have way more compute which will allow them to make better models. So, let's see if GPD 5.5 is the first hint of this OpenAI comeback

hint of this OpenAI comeback comes up with potentially multiple options of how we could do it. And then, , so obviously we will test that. this guy is an engineer from RAMP which is the finance credit card company. But we will test that at the end. Okay. After we go through the benchmarks and the main info about the model, I'll jump

we go through the benchmarks and the main info about the model, I'll jump straight into Codex which by the way few minutes ago there was update. It was a little late because I was did they added to Codex? Did they not add it to Codex? But anyways, you can see that the UI is a bit different. You can see GBD 5.5 here. Speed fast. We're going to do

all kinds of tests. But first I want to learn what this model is about and how good it is. And in the meantime, I'm just going to copy the full page. I'm going to go into CHBT. I'm going to do page. Just do page XML tags. Definitely not pro mode. Let's do thinking normal. And I'm going to say,

thinking normal. And I'm going to say, give me a concise summary of the most interesting points about this new GPD 5.5 model and especially what is unusual or new about this model compared to other cutting edge AI model releases. Be very concise. Now, obviously, we are

using thinking with GBD 5.5 already here selected. So, we're going to have GBD 5.5 summarize this page about 5.5. And it did, I would say, a 98% job all by itself. And I buttoned some stuff up and it was done. It was able to tr The problem with these testimonials or

The problem with these testimonials or these clips is that I think Opus 4.6 probably could have done them. , she says she had a bunch of bugs. I don't know if 5 GBD 5.5.4 or if Opus 4.6 could have fixed these bugs. So, yeah, this is not really valuable. Let's read through this text to see what's

yeah, this is not really valuable. Let's read through this text to see what's interesting here. understands what you're trying to do faster. The gains are especially strong in agentic coding, computer use, knowledge work, and early scientific research. This is this is direct jab and enthropic. Look at this.

direct jab and enthropic. Look at this. Larger, more capable models are often slower to serve, but GPD 5.5 matches GPD 5.4 per token latency in real world serving while performing at a much higher level of intelligence. It also uses significantly fewer tokens to

uses significantly fewer tokens to complete the same correct task. Another hint, a hit, another jab at Enthropic because if you remember my Opus 4.7 video, it has a new tokenizer which burns more tokens for the same

task. So, OpenAI is flexing their compute. Here we are releasing GPT. no. no. We're releasing GP 5.5 strongest set of safeguards to date designed to reduce misuse.

So, it's even more censored. Yikes. Today GPD 5.5 is rolling out to plus pro business enterprise users. So nothing for the free users guys. This is why you need to pay for AI. I don't care if it's JGBD cloud perplexity. Just pay for some

account, okay, to use the latest and greatest models. Okay, some benchmarks. Let's look at it. Also, the included 5.5 Pro as well, which we can also test out. I have it obviously I'm on the pro plan. We have a terminal bench where it absolutely demolishes Opus 4.7 expert

SWE. So this is very risky. This is this is dirty from OpenAI did not include SWE bench verified where Opus is better. They just use some benchmark that Opus doesn't have benchmark. This is very strange. GDP val so this is

economically valuable tasks. 5.5 wins. OS world verify barely wins. Tulafon another Opus doesn't have it. Browser comp. Okay, much better. Frontier math much better. Okay, so on math it

absolutely destroys Opus and Cyber Gym. It's probably some cyber security stuff it destroys. But this is very small list of benchmarks. If you look at Opus, they included way larger list in their release way more benchmarks. So shady from OpenAI, but let's keep going. , let's let's look at the

, let's let's look at the summary from 5.5 to see what's interesting here. It is positioned less a smarter chatbot and more an agentic work model. The big claim is not just better answers, it is better at taking messy task planning using this. Okay, coding jump looks real, especially

long horizon coding. Yeah, SWench Pro is way behind. , not way behind, but quite behind Opus 4.7. Yeah, fewer tokens while doing better. GBD 5.5 improved the infra serving GBD 5.5. So this is hints of u self-improvement you

know recurring recursive self-improvement obviously not on the level of training of the models but we're getting there very long context is now materially better GBD 5.5 supports 1 million contexts in the API and performs much better than GBD 5.4 or 512. Okay.

All . That's a huge jump on MRCR. GBD 5.5 pro seems to be aimed at high accuracy professional work. yeah this takes if you're in chat GBD pro queries take 20 minutes to answer. So obviously that's not for typical tasks or everyday chatting scientific research capability.

chatting scientific research capability. I think this is just the openi wanting to have good PR Google deep mind cyber biochemistry API not okay this is interesting API not available immediately. Let's check open router GPT. Yeah it's not here. Wow.

CHGBT and Codex get it first. API access very soon. Damn. They just want to have more people using Codex, which again, we're going to test that in a second. , now, because we just went through it. Bottom line, unusual angle is not marketed mainly as getting better answer

marketed mainly as getting better answer question. Marketed as a persistent tool calling. First of all, I'll have to say this answer was much better than 5.4. Something weird was happening with GBD 5.4 before where the answers were hard to understand. This was pretty easy read and was nicely formatted. So

read and was nicely formatted. So conversationally just from this one answer I can say it's already feeling better than GBD 5.4. But what we care about is whether you can build anything with this model. So what I'm going to do I'm going to open a new project existing folder.

There we go. Create a new folder GBD 5.5. And let's go here. So obviously standard settings inside of Codex is full access don't have it on default permissions full access for sure then model obviously use

GB 5.5 there's no reason you would use any other model I don't know why they even make it available use 5.5 okay now medium or high are good base points extra high is for fixing debugs and big refactors so I think we can start with medium and then probably go to high

and then speed Make sure to do fast because it's a lot faster inference. It consumes the limits a bit faster, but hey, I want a $200 a month plan. I don't care. So, let's see what people have been building with this. Some of the most impressive not societies

unicorn test. Okay. So, I'm just going to screenshot this big Zcode. I don't know what that is, but let's try to recreate it. Recreate this unicorn. exactly as on the image attached.

Keep reading — 88 more segments

Sign in free to read the full transcript, save this video, and search inside everything you save.

Sign in to continue reading

Prefer the original? Watch the video

Are you the creator or rights holder of this video? Request removal of this transcript.

Related Transcripts

GPT 5.6 Sol Made This Entire Video

GPT 5.6 Sol Made This Entire Video

Nate Herk | AI Automation

Hermes Agent is insane… 100,000+ github stars

Hermes Agent is insane… 100,000+ github stars

Make Ultra Realistic AI Short Films with Fable 5 + Seedance 4K (Full Workflow)

Make Ultra Realistic AI Short Films with Fable 5 + Seedance 4K (Full Workflow)

Higgsfield AI

Я посадил ИИ в свой Obsidian и он изменил мою работу

Я посадил ИИ в свой Obsidian и он изменил мою работу

Рустам Агамалиев

Claude Fable 5 + Higgsfield MCP Will Make You Rich!

Claude Fable 5 + Higgsfield MCP Will Make You Rich!

Higgsfield AI

I Built an Entire Marketing Agency With 1 AI Tool!

I Built an Entire Marketing Agency With 1 AI Tool!

Higgsfield AI

Never lose a moment again

Save videos from YouTube, Instagram, and TikTok. Search across all your transcripts with AI-powered semantic search.

Start Saving Videos — Free Trial