Learning Library

← Back to Library

Altman Targets Model Overload with Orion

7m • Unknown Channel • ai-ml • news • intermediate • Watch on YouTube ↗

Key Points

Sam Altman’s recent blog post outlines Open AI’s roadmap, mentioning the upcoming GPT‑5 and a previously leaked internal project called “Orion,” now slated for release as GPT 4.5.
Altman criticizes the current ChatGPT UI for offering an overwhelming and confusing list of model options, arguing that intelligent systems should not require users to navigate a complex dropdown menu.
He categorizes language models into three tiers: fast “reasoning” models that respond instantly, medium‑speed “thinking” models that operate under a minute, and deep‑inference models that take longer than a minute to yield more nuanced, higher‑quality results (e.g., 01 Pro).
The emergence of “deep research” models—capable of extensive inference and producing detailed reports—has begun to surface online, demonstrating the tangible benefits of the third‑class, deep‑inference approach.

Sections

00:00:00 Altman's Roadmap & Model Overload - The speaker summarizes Sam Altman's new blog post detailing a bewildering array of GPT models in the app’s dropdown, his criticism of that complexity, and the upcoming launch of a streamlined Orion/GPT‑4.5 version that won’t be a reasoning‑focused model.

Full Transcript

# Altman Targets Model Overload with Orion **Source:** [https://www.youtube.com/watch?v=BA-TyC-ZTUs](https://www.youtube.com/watch?v=BA-TyC-ZTUs) **Duration:** 00:07:03 ## Summary - Sam Altman’s recent blog post outlines Open AI’s roadmap, mentioning the upcoming GPT‑5 and a previously leaked internal project called “Orion,” now slated for release as GPT 4.5. - Altman criticizes the current ChatGPT UI for offering an overwhelming and confusing list of model options, arguing that intelligent systems should not require users to navigate a complex dropdown menu. - He categorizes language models into three tiers: fast “reasoning” models that respond instantly, medium‑speed “thinking” models that operate under a minute, and deep‑inference models that take longer than a minute to yield more nuanced, higher‑quality results (e.g., 01 Pro). - The emergence of “deep research” models—capable of extensive inference and producing detailed reports—has begun to surface online, demonstrating the tangible benefits of the third‑class, deep‑inference approach. ## Sections - [00:00:00](https://www.youtube.com/watch?v=BA-TyC-ZTUs&t=0s) **Altman's Roadmap & Model Overload** - The speaker summarizes Sam Altman's new blog post detailing a bewildering array of GPT models in the app’s dropdown, his criticism of that complexity, and the upcoming launch of a streamlined Orion/GPT‑4.5 version that won’t be a reasoning‑focused model. ## Full Transcript

0:00I'm here to tell you all about Chad gp5 0:02and know that is not just clickbait we 0:04actually have real news on it so Sam 0:07mman published a blog post uh a couple 0:09of hours ago that lays out his personal 0:12Vision or roadmap for where they're 0:14going next and yes it includes talk of 0:16GPT 5 it includes what uh they call 0:18internally Orion which got leaked all 0:20over the internet in the fall and then 0:22never materialized let me tell you about 0:24it net net what Sam wanted to emphasize 0:28in his blog post is that he has become 0:31very concerned about the number of 0:33models in the drop down and how complex 0:36they are to use just to give you a 0:38couple examples if I hit my drop-down 0:40now in the chat GPT app I can see one 0:44two three four five six models that are 0:47visible and more models hiding in a 0:49carrot plus a temporary chat icon that's 0:51too many like if you can see 4 40 with 0:54scheduled task 01 03 mini 03 mini High 0:5701 PR mode and by the way none of those 1:00is operator none of those is deep 1:03research I have to go to chat gp.com to 1:05use deep research I can't even use the 1:07app right now even though the results 1:09appear in the app it's a very very 1:11confused 1:13experience and so Sam is basically 1:15saying this is not what we want the 1:17magic of intelligence to be it should 1:19not be this ridiculously complex picker 1:22where you don't even know what the what 1:24the heck you're 1:25picking and so what he said was they are 1:28going to release Orion or GP 2 4.5 it is 1:31supposed to be very good but it is not a 1:33reasoning model which means it doesn't 1:36use inference time compute and I want to 1:38be really precise about this because 1:40people are kind of equating the cheaper 1:44more widely available reasoning models 1:46with deep inference reasoning models and 1:49I think that we need to get better at 1:50naming you have models like GPT 4 o 1:54right now or Claude that just come back 1:59with something very quick 2:01you also have models that claim to be 2:04thinking models but that do not think 2:06for minutes at a time they think for 2:08under 60 seconds generally I would argue 2:11that uh the flash thinking model from 2:14Google is a good example I think deep 2:15seek is a good example uh there's a 2:18number of other examples as well and 2:21then you have models in a third class 2:23that think for more than 60 seconds at a 2:26time and do deep inference 01 Pro is a 2:29great example there there aren't a lot 2:31of other examples and it's unfortunate 2:33that 01 Pro is not more widely available 2:35because I think if people used it they 2:37would understand there is that third 2:38class and the results are definitely 2:41different and meaningfully better if you 2:43give the model that much time I think 2:45the closest that people are starting to 2:47see is when they're starting to see deep 2:50research and starting to see the results 2:52of deep research published on the 2:53internet deep research is another 2:55thinking model it takes a very long time 2:57to infer and come up with something but 2:58it's very compelling report and that's 3:00leaking all over the internet now people 3:02are seeing the results and are rightly 3:05amazed so we have at least three classes 3:07of model people are generally conflating 3:09them into two I think that's not 3:11helpful and what Sam is saying is 4.5 or 3:16oion is really in that first class it's 3:18a fast response model it's just very 3:20very 3:21good now that being said he has made it 3:25clear that that is the last Model that 3:27they are going to release as a separate 3:29model 3:31it will be named 4.5 probably and it 3:33will appear in the UI and that will be 3:35that since he gave a hint previously it 3:39would be during the winter I think that 3:41gives him about one more month to 3:42release it and then he says he's moving 3:45on what is he moving to chat 3:48gp5 so he's keeping the GPT branding but 3:53every other capability they have built 3:56is getting wrapped under that name label 3:59and brand which is a good branding 4:01decision Chad gp5 will have deep 4:03research Chad gp5 will have operator 4:06mode Chad gp5 can think for minutes at a 4:08time or think for a second it will 4:11decide and it will decide based on the 4:13task that you give it as someone uh on 4:16Twitter said very snarkily chat GPT is 4:19the friends we made along the way 4:20basically it's all these models along 4:22the way and we bring them together and 4:25we wrapped them into one clean consumer 4:27bow so the consumer can use them now 4:30that all depends on a lot of really 4:31smart interpretation of user intent so 4:34we will have to see how well they do 4:36with that history suggests they're going 4:37to do pretty 4:39well and in terms of timing he didn't 4:41give a time but I'm strongly expecting 4:44GPT five to come out uh probably before 4:48the fall if I had to guess given their 4:50Pace they've been doing major releases 4:52every two or three months so I would 4:53expect sometime in the summer is my 4:56current guess and that is a guess I do 4:57not have private unknowable information 5:00it's just a 5:02guess so there you go that's what we 5:04know about the road map I am glad 5:06they're making the effort but I do have 5:09a fairly big question around 03 because 5:12we were promised 5:1303 I we were promised 03 Pro I don't 5:17know where that is now I don't know 5:19where it is so we will just have to see 5:23um it may get wrapped in as GPT 5 and we 5:27just have to tolerate that I thought we 5:29were on a slightly faster time frame 5:31with it but maybe what Sam felt is that 5:33at the end of the day 5:36it's too complicated for people to add 5:40yet another number in model and so he 5:42decided to change it one more tidbit for 5:45you this is not about Sam mman it's not 5:46about open AI uh I just learned this the 5:51owners of ai.com which currently 5:54redirects to deep seek it's not deep 5:56seek I thought it was deep seek it's in 5:59pardonable that I think it's deep seek 6:01because it goes to deep seek so I 6:02thought deep seek bought it that's 6:04normally how domains work but no it's 6:06apparently a for profit separate entity 6:11run by run out of 6:13Malaysia um and someone bought that 6:16domain back in 1993 I wish i' bought 6:19that domain back in 1993 and they now 6:22make a mint renting the domain to 6:26whoever pays them and it appears that 6:27deep seek is currently using some their 6:29five and a half million or whatever they 6:31use to train the model to uh buy access 6:36or redirects from ai.com to deep seek 6:41so it's just a tidbit I have nothing 6:44more to follow up on other than the 6:46internet is weird and you can make money 6:47in lots of ways and someone has figured 6:49out out of Malaysia how to rent ai.com 6:52for what is presumably a pretty penny I 6:56am sure it is not cheap to do that so 6:58there you go that's your tidbit lots 7:00more on GPT 5 hope you enjoyed it