Learning Library

← Back to Library

Super‑Exponential AI Timeline Explained

10m • AI News & Strategy Daily | Nate B Jones • ai-ml • deep-dive • intermediate • Watch on YouTube ↗

Key Points

MER, a nonprofit model‑evaluation and threat‑research group, tracks how long AI agents can perform tasks compared to humans, using success‑rate thresholds (50 % and 80 %).
Because the task‑relative metric has no upper limit, unlike fixed‑scope benchmarks, it reveals that AI progress is not merely exponential but super‑exponential.
The latest Opus 4.5 results show AI achieving roughly five hours of human‑equivalent work at a 50 % success rate (and 2,728 min at 80 %), indicating a doubling of capability roughly every 4–4½ months.
Projections suggest AI will handle 10 h of work by Q1, 20 h by mid‑year, and 40 h by year‑end, a self‑reinforcing flywheel that makes 2025 the “last normal year” and points to dramatically accelerated, AI‑driven development from 2026 onward.

Sections

Full Transcript

# Super‑Exponential AI Timeline Explained **Source:** [https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D) **Duration:** 00:10:23 ## Summary - MER, a nonprofit model‑evaluation and threat‑research group, tracks how long AI agents can perform tasks compared to humans, using success‑rate thresholds (50 % and 80 %). - Because the task‑relative metric has no upper limit, unlike fixed‑scope benchmarks, it reveals that AI progress is not merely exponential but super‑exponential. - The latest Opus 4.5 results show AI achieving roughly five hours of human‑equivalent work at a 50 % success rate (and 2,728 min at 80 %), indicating a doubling of capability roughly every 4–4½ months. - Projections suggest AI will handle 10 h of work by Q1, 20 h by mid‑year, and 40 h by year‑end, a self‑reinforcing flywheel that makes 2025 the “last normal year” and points to dramatically accelerated, AI‑driven development from 2026 onward. ## Sections - [00:00:00](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D&t=0s) **Super‑Exponential AI Timeline Explained** - The speaker clarifies MER’s benchmark graph, showing how AI agents increasingly outperform human task times without a ceiling, indicating a super‑exponential growth trend in AI capabilities. - [00:03:20](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D&t=200s) **Accelerating AI Task Delegation** - The speaker argues that swiftly learning to identify and assign high‑quality, week‑long tasks to AI will create a power‑law advantage in a super‑exponential future, leaving late adopters far behind. - [00:07:00](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D&t=420s) **Future Work: Managing AI Agents** - The speaker predicts 2026 will require workers to relinquish traditional career models and become outcome‑obsessed leaders of delegated AI agents. ## Full Transcript

0:00We are on the super exponential timeline 0:02for AI agents and I want to explain what 0:04that means and why it's super important 0:05that we all pay attention to it. MER is 0:08the model evaluation and threat research 0:12company. It's a nonprofit. It's 0:14dedicated to understanding how models 0:16perform and they are famous for 0:18producing a graph that shows how long 0:22models can do useful agentic work for at 0:24a time. It's a little bit of a confusing 0:26graph to understand, so I'm going to 0:27explain it really simply. Basically, 0:29they take a task and they measure how 0:32long a human takes to do that work task. 0:35And then they want to find out if the AI 0:37can do that task with at least a 50% 0:41likelihood of success. Why 50%, because 0:45they had to pick a number somewhere. 0:46They also measure it at 80%. And we'll 0:48get to that. PTR is important because it 0:52does not top out. And so if you have a 0:54lot of these these benchmarks like 0:56Swebench is an engineering one, it tops 0:58out at 100% and we're already in the 1:01like way way up at the top it doesn't 1:03matter like you can go from 91 to 93 and 1:06you don't really get a sense of how the 1:07models change. TR is different because 1:10that graph has no top end. It can just 1:12keep doing more and more work and that 1:14allows it to show super exponential 1:17progress. And one of the biggest debates 1:19of 2025 was are we on an exponential 1:23time scale with AI or are we on a super 1:26exponential where it's increasing faster 1:28than exponentially. It seems like we're 1:30on the super exponential trend line. And 1:32one of the things that made us think 1:33that is this latest result from Opus 4.5 1:36which shows over 4 hours 4 hours and 45 1:40minutes almost 5 hours of human 1:42equivalent work done at a 50% likelihood 1:45of success. Now the 80% mark is also 1:48measured and it is 2728 minutes for Opus 1:524.5 which you might think oh that's not 1:54that far but keep in mind it was not 1:56that long ago that we were 1 minute 2 1:58minute 10 minute 30 minutes and now 2:01we're up to almost 5 hours and that is 2:03the point of a super exponential curve. 2:06We are on a doubling rate every 4 to 4 2:10and 1/2 months right now. And so if the 2:12number is 50% complete, but the time 2:15horizon is four almost 5 hours, we're 2:18going to be at 10 hours by the end of 2:21Q1, we'll be at 20 hours by the end of 2:23Q2 into Q3, and we may be at 40 hours by 2:28the end of the year or past. And that is 2:31why we have to pay attention to this. 2:33Super exponential gains suggest that we 2:36have hit a selfreinforcing 2:39flywheel with AI. And that is indeed 2:41what we hear out of model makers and 2:43that is why 2025 was the last normal 2:47year. We are going to see really really 2:49weird progress from AI in 2026 and every 2:54year after because AI itself is starting 2:57to reinforce AI systems. We're bringing 2:59AI in to help train AI systems. Now that 3:02is going to become more and more 3:03automated. We are going to have 3:05capabilities that AI itself helps to 3:08grow speeding up the whole process and 3:10all of that is going to allow us to 3:12continue to make progress on these tough 3:15tasks that don't have an upper limit. 3:18And this matters because really our 3:20ability to do meaningful work is going 3:22to be determined by whether or not we 3:24can define useful high taste highquality 3:27work that an AI can do over a period of 3:29time. Do you have something for an AI 3:32that would take you a week to do? Maybe 3:34it's your taxes. I don't know. But 3:36that's going to increasingly become the 3:38question. And if you don't, then the 3:40question is going to be what does it 3:41take for you to get there? What does it 3:43take for you to gain the skill to assign 3:46that work? Because in a super 3:48exponential world, the skill we need to 3:50learn is also super exponential. The 3:53people who figure out how to assign 3:56agents work now in January and February 3:59and March are going to have a much 4:01easier time learning how to continue 4:03assigning agents work when the agents 4:06can do much harder stuff. Whereas if you 4:08wait and say, "I'm going to catch up. 4:09I've scheduled this for Q2 or Q3 next 4:12year. That's my AI quarter." Good luck 4:14with that. Like it doesn't work that 4:16way. There will be people who are 4:17running circles around you because they 4:18can assign their agents a week's worth 4:20of work. And once you can assign your 4:22agents a week's worth of work and spend 4:23up two or three of them, look at how 4:25much more productive that makes you. 4:26You're going to be running circles 4:27around people. And that that is the 4:30power law distribution world we're going 4:31to live in. Super exponentials create 4:33power laws. So power law is that the 4:35idea that the world we live in is not 4:38normally distributed. A normally 4:39distributed world, most people are on 4:41the average, a few people are on the 4:42tails. Einstein's way over here, right? 4:44But in a power law world, just a few 4:47people are going to be able to do a 4:50tremendous, tremendous amount. And it's 4:51not because they're necessarily going to 4:53have lots of money to do it. It's 4:55because they have the skills to do it. 4:57AI is going to disproportionately 5:01reward skill development where it's 5:03related to artificial intelligence and 5:06everything else. People are going to 5:08start to lose traction. If you are 5:11looking to make a dent in your career, I 5:14would look less in 2026 at your job 5:17family's traditional requirements and 5:19look more at where can an agent do a 5:22meaningful amount of work for a week in 5:24this traditional job family area and how 5:26can I make sure I set myself up so I 5:29know how to define and assign that work, 5:31know how to hold it accountable, know 5:32how to put good taste down so I know 5:34what excellent looks like in that work, 5:36know how to intervene, keep the agent on 5:38track, have the technical foundations 5:39necessary to define and set up an 5:41agentic system. This is going to become 5:43more and more relevant for all of us. 5:45The technical skill sets are going to 5:48spread across the job families. The 5:50non-technical skill sets are also going 5:52to spread across the job families. 5:54Engineers who traditionally just had to 5:55do code are going to have to have some 5:57business fluency and customer fluency 5:59now because they have to be the ones 6:00with good taste when they're 6:02architecting systems. And frankly, they 6:03now have to architect systems that 6:05non-technical people can contribute code 6:07to. So just that one shift, that ability 6:11of agents to do work over time is going 6:15to multiply the impacts across all of 6:18the rest of us. Having agents that work 6:20longer means all of our jobs are going 6:23to change forever. And you might think 6:26I'm like a hype person. This is not 6:29being me being hypy. This is me just 6:31talking about the reality that we are on 6:33a super exponential curve. Humans are 6:36bad at estimating super exponential 6:38curves. And so I just want to make it 6:39really concrete for you. I do think 6:40there is no way that work will not 6:43change for everybody if we are in a 6:45place where it's 5 hours and doubling 6:48every four months. Because you look at 6:50it by April you're going to be at 10 6:52hours. By what July September you're 6:54going to be at 20 hours by December 6:56you're going to be at 40 hours maybe. 6:58Right? Maybe it's not even like it's 7:00just it's going to be crazy. Are you 7:02able to delegate a week's worth of work? 7:03That is that is the question of 2026. We 7:06will all have to let go of a lot. We 7:10will have to let go of our traditional 7:11understandings about career progression. 7:13We'll have to let go of our traditional 7:14understandings about job families. What 7:16job families know and what they don't. 7:18We are going to have to be outcome 7:20obsessed, ownership obsessed. The work 7:23of the future is going to reward people 7:25who are ownership and outcome obsessed 7:27because that's where human value shows 7:29up. It's when we make sure that what's 7:31made is actually relevant for people, is 7:34actually useful, is actually good. It's 7:35not just vibecoded slop. There will be 7:37lots of vibecoded slop. In fact, I would 7:40expect it to 100x in 2026 because you 7:43can ask your agent to do a lot of 7:44terrible, terrible things. It's going to 7:46be up to you to decide that the agent's 7:49work is worth it. That you are assigning 7:50the agent and the agent is doing good 7:52work to get meaningful work done that 7:54compounds over time. The strategy 7:57rewards used to acrue to leaders. 8:00Strategy is now an individual thing 8:02because you are effectively a strategic 8:04manager of a team of agents or you will 8:06be in 2026. You can make them yourself. 8:09There will probably be startups that 8:10market them to you. But either way, you 8:13will end up with a team of agents 8:15working for you. Do you know how to 8:16manage them? Do you know how to lead 8:18them? Do you know how to drive them to 8:20develop compounding advantage over time? 8:22That used to be a question for directors 8:23and above. It's not for if now it's for 8:26everybody. Everyone will need to be able 8:27to do this and the people who can are 8:30going to look like they can do anything 8:32like that. The span is going to be 8:34incredible because they're able to 8:36leverage their own domain expertise and 8:39expand their scope of impact from there. 8:41I do not mean that you can do anything 8:45that requires deep domain expertise that 8:48you do not have. There are still going 8:50to be real value that you can't get to. 8:53there's going to be real value you can't 8:55get to just by adding agents. So, for 8:58example, if you are a lawyer and you 9:00have decades of experience, agents are 9:02going to transform the legal profession 9:04and how you work, but it it's not going 9:06to transform it to the point where I, as 9:08a non-awyer, can come in and do work for 9:11a white shoe law firm and get exactly 9:14the same quality of work done at the end 9:16as the lawyer who's got decades of 9:18experience. there's going to be a reward 9:21for understanding the business deeply 9:24that will show up in your ability to 9:26direct AI agents toward useful ends. And 9:28so as much as it may seem like I'm 9:30saying the agent can do work, we won't 9:32do any. What I'm really saying is our 9:34domain expertise is worth more and more, 9:37but boy do we have to be smart and 9:39leverage it really, really differently 9:41to get where we need to go in 2026. And 9:44that's going to change all of our skill 9:45sets. We're all going to have to learn 9:47together. We've never gone through this 9:48workflow and workforce transformation 9:50before. So, we're all going to have to 9:52just jump in and figure out how to do it 9:53together. But, I do think it's real. I 9:55do think it's coming. And I do think the 9:56key is that super exponential graph. 9:58Opus 4.5 was just the latest getting to 10:015 hours. It won't be the last. It's not 10:03like Claude has a special, you know, 10:05Claude doesn't have a special monopoly 10:07on this, right? We're going to see this 10:08from Gemini. We're going to see this 10:10from Chad GPT. We'll see it from other 10:12model makers as well. We will continue 10:14to see exponential gains from agent 10:16working time in 2026 and that will 10:19change the way all of us have to do our 10:21work.