Learning Library

← Back to Library

Super‑Exponential AI Timeline Explained

Key Points

  • MER, a nonprofit model‑evaluation and threat‑research group, tracks how long AI agents can perform tasks compared to humans, using success‑rate thresholds (50 % and 80 %).
  • Because the task‑relative metric has no upper limit, unlike fixed‑scope benchmarks, it reveals that AI progress is not merely exponential but super‑exponential.
  • The latest Opus 4.5 results show AI achieving roughly five hours of human‑equivalent work at a 50 % success rate (and 2,728 min at 80 %), indicating a doubling of capability roughly every 4–4½ months.
  • Projections suggest AI will handle 10 h of work by Q1, 20 h by mid‑year, and 40 h by year‑end, a self‑reinforcing flywheel that makes 2025 the “last normal year” and points to dramatically accelerated, AI‑driven development from 2026 onward.

Full Transcript

# Super‑Exponential AI Timeline Explained **Source:** [https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D) **Duration:** 00:10:23 ## Summary - MER, a nonprofit model‑evaluation and threat‑research group, tracks how long AI agents can perform tasks compared to humans, using success‑rate thresholds (50 % and 80 %). - Because the task‑relative metric has no upper limit, unlike fixed‑scope benchmarks, it reveals that AI progress is not merely exponential but super‑exponential. - The latest Opus 4.5 results show AI achieving roughly five hours of human‑equivalent work at a 50 % success rate (and 2,728 min at 80 %), indicating a doubling of capability roughly every 4–4½ months. - Projections suggest AI will handle 10 h of work by Q1, 20 h by mid‑year, and 40 h by year‑end, a self‑reinforcing flywheel that makes 2025 the “last normal year” and points to dramatically accelerated, AI‑driven development from 2026 onward. ## Sections - [00:00:00](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D&t=0s) **Super‑Exponential AI Timeline Explained** - The speaker clarifies MER’s benchmark graph, showing how AI agents increasingly outperform human task times without a ceiling, indicating a super‑exponential growth trend in AI capabilities. - [00:03:20](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D&t=200s) **Accelerating AI Task Delegation** - The speaker argues that swiftly learning to identify and assign high‑quality, week‑long tasks to AI will create a power‑law advantage in a super‑exponential future, leaving late adopters far behind. - [00:07:00](https://www.youtube.com/watch?v=X_EJi6yCuTM&pp=ugUEEgJlbg%3D%3D&t=420s) **Future Work: Managing AI Agents** - The speaker predicts 2026 will require workers to relinquish traditional career models and become outcome‑obsessed leaders of delegated AI agents. ## Full Transcript
0:00We are on the super exponential timeline 0:02for AI agents and I want to explain what 0:04that means and why it's super important 0:05that we all pay attention to it. MER is 0:08the model evaluation and threat research 0:12company. It's a nonprofit. It's 0:14dedicated to understanding how models 0:16perform and they are famous for 0:18producing a graph that shows how long 0:22models can do useful agentic work for at 0:24a time. It's a little bit of a confusing 0:26graph to understand, so I'm going to 0:27explain it really simply. Basically, 0:29they take a task and they measure how 0:32long a human takes to do that work task. 0:35And then they want to find out if the AI 0:37can do that task with at least a 50% 0:41likelihood of success. Why 50%, because 0:45they had to pick a number somewhere. 0:46They also measure it at 80%. And we'll 0:48get to that. PTR is important because it 0:52does not top out. And so if you have a 0:54lot of these these benchmarks like 0:56Swebench is an engineering one, it tops 0:58out at 100% and we're already in the 1:01like way way up at the top it doesn't 1:03matter like you can go from 91 to 93 and 1:06you don't really get a sense of how the 1:07models change. TR is different because 1:10that graph has no top end. It can just 1:12keep doing more and more work and that 1:14allows it to show super exponential 1:17progress. And one of the biggest debates 1:19of 2025 was are we on an exponential 1:23time scale with AI or are we on a super 1:26exponential where it's increasing faster 1:28than exponentially. It seems like we're 1:30on the super exponential trend line. And 1:32one of the things that made us think 1:33that is this latest result from Opus 4.5 1:36which shows over 4 hours 4 hours and 45 1:40minutes almost 5 hours of human 1:42equivalent work done at a 50% likelihood 1:45of success. Now the 80% mark is also 1:48measured and it is 2728 minutes for Opus 1:524.5 which you might think oh that's not 1:54that far but keep in mind it was not 1:56that long ago that we were 1 minute 2 1:58minute 10 minute 30 minutes and now 2:01we're up to almost 5 hours and that is 2:03the point of a super exponential curve. 2:06We are on a doubling rate every 4 to 4 2:10and 1/2 months right now. And so if the 2:12number is 50% complete, but the time 2:15horizon is four almost 5 hours, we're 2:18going to be at 10 hours by the end of 2:21Q1, we'll be at 20 hours by the end of 2:23Q2 into Q3, and we may be at 40 hours by 2:28the end of the year or past. And that is 2:31why we have to pay attention to this. 2:33Super exponential gains suggest that we 2:36have hit a selfreinforcing 2:39flywheel with AI. And that is indeed 2:41what we hear out of model makers and 2:43that is why 2025 was the last normal 2:47year. We are going to see really really 2:49weird progress from AI in 2026 and every 2:54year after because AI itself is starting 2:57to reinforce AI systems. We're bringing 2:59AI in to help train AI systems. Now that 3:02is going to become more and more 3:03automated. We are going to have 3:05capabilities that AI itself helps to 3:08grow speeding up the whole process and 3:10all of that is going to allow us to 3:12continue to make progress on these tough 3:15tasks that don't have an upper limit. 3:18And this matters because really our 3:20ability to do meaningful work is going 3:22to be determined by whether or not we 3:24can define useful high taste highquality 3:27work that an AI can do over a period of 3:29time. Do you have something for an AI 3:32that would take you a week to do? Maybe 3:34it's your taxes. I don't know. But 3:36that's going to increasingly become the 3:38question. And if you don't, then the 3:40question is going to be what does it 3:41take for you to get there? What does it 3:43take for you to gain the skill to assign 3:46that work? Because in a super 3:48exponential world, the skill we need to 3:50learn is also super exponential. The 3:53people who figure out how to assign 3:56agents work now in January and February 3:59and March are going to have a much 4:01easier time learning how to continue 4:03assigning agents work when the agents 4:06can do much harder stuff. Whereas if you 4:08wait and say, "I'm going to catch up. 4:09I've scheduled this for Q2 or Q3 next 4:12year. That's my AI quarter." Good luck 4:14with that. Like it doesn't work that 4:16way. There will be people who are 4:17running circles around you because they 4:18can assign their agents a week's worth 4:20of work. And once you can assign your 4:22agents a week's worth of work and spend 4:23up two or three of them, look at how 4:25much more productive that makes you. 4:26You're going to be running circles 4:27around people. And that that is the 4:30power law distribution world we're going 4:31to live in. Super exponentials create 4:33power laws. So power law is that the 4:35idea that the world we live in is not 4:38normally distributed. A normally 4:39distributed world, most people are on 4:41the average, a few people are on the 4:42tails. Einstein's way over here, right? 4:44But in a power law world, just a few 4:47people are going to be able to do a 4:50tremendous, tremendous amount. And it's 4:51not because they're necessarily going to 4:53have lots of money to do it. It's 4:55because they have the skills to do it. 4:57AI is going to disproportionately 5:01reward skill development where it's 5:03related to artificial intelligence and 5:06everything else. People are going to 5:08start to lose traction. If you are 5:11looking to make a dent in your career, I 5:14would look less in 2026 at your job 5:17family's traditional requirements and 5:19look more at where can an agent do a 5:22meaningful amount of work for a week in 5:24this traditional job family area and how 5:26can I make sure I set myself up so I 5:29know how to define and assign that work, 5:31know how to hold it accountable, know 5:32how to put good taste down so I know 5:34what excellent looks like in that work, 5:36know how to intervene, keep the agent on 5:38track, have the technical foundations 5:39necessary to define and set up an 5:41agentic system. This is going to become 5:43more and more relevant for all of us. 5:45The technical skill sets are going to 5:48spread across the job families. The 5:50non-technical skill sets are also going 5:52to spread across the job families. 5:54Engineers who traditionally just had to 5:55do code are going to have to have some 5:57business fluency and customer fluency 5:59now because they have to be the ones 6:00with good taste when they're 6:02architecting systems. And frankly, they 6:03now have to architect systems that 6:05non-technical people can contribute code 6:07to. So just that one shift, that ability 6:11of agents to do work over time is going 6:15to multiply the impacts across all of 6:18the rest of us. Having agents that work 6:20longer means all of our jobs are going 6:23to change forever. And you might think 6:26I'm like a hype person. This is not 6:29being me being hypy. This is me just 6:31talking about the reality that we are on 6:33a super exponential curve. Humans are 6:36bad at estimating super exponential 6:38curves. And so I just want to make it 6:39really concrete for you. I do think 6:40there is no way that work will not 6:43change for everybody if we are in a 6:45place where it's 5 hours and doubling 6:48every four months. Because you look at 6:50it by April you're going to be at 10 6:52hours. By what July September you're 6:54going to be at 20 hours by December 6:56you're going to be at 40 hours maybe. 6:58Right? Maybe it's not even like it's 7:00just it's going to be crazy. Are you 7:02able to delegate a week's worth of work? 7:03That is that is the question of 2026. We 7:06will all have to let go of a lot. We 7:10will have to let go of our traditional 7:11understandings about career progression. 7:13We'll have to let go of our traditional 7:14understandings about job families. What 7:16job families know and what they don't. 7:18We are going to have to be outcome 7:20obsessed, ownership obsessed. The work 7:23of the future is going to reward people 7:25who are ownership and outcome obsessed 7:27because that's where human value shows 7:29up. It's when we make sure that what's 7:31made is actually relevant for people, is 7:34actually useful, is actually good. It's 7:35not just vibecoded slop. There will be 7:37lots of vibecoded slop. In fact, I would 7:40expect it to 100x in 2026 because you 7:43can ask your agent to do a lot of 7:44terrible, terrible things. It's going to 7:46be up to you to decide that the agent's 7:49work is worth it. That you are assigning 7:50the agent and the agent is doing good 7:52work to get meaningful work done that 7:54compounds over time. The strategy 7:57rewards used to acrue to leaders. 8:00Strategy is now an individual thing 8:02because you are effectively a strategic 8:04manager of a team of agents or you will 8:06be in 2026. You can make them yourself. 8:09There will probably be startups that 8:10market them to you. But either way, you 8:13will end up with a team of agents 8:15working for you. Do you know how to 8:16manage them? Do you know how to lead 8:18them? Do you know how to drive them to 8:20develop compounding advantage over time? 8:22That used to be a question for directors 8:23and above. It's not for if now it's for 8:26everybody. Everyone will need to be able 8:27to do this and the people who can are 8:30going to look like they can do anything 8:32like that. The span is going to be 8:34incredible because they're able to 8:36leverage their own domain expertise and 8:39expand their scope of impact from there. 8:41I do not mean that you can do anything 8:45that requires deep domain expertise that 8:48you do not have. There are still going 8:50to be real value that you can't get to. 8:53there's going to be real value you can't 8:55get to just by adding agents. So, for 8:58example, if you are a lawyer and you 9:00have decades of experience, agents are 9:02going to transform the legal profession 9:04and how you work, but it it's not going 9:06to transform it to the point where I, as 9:08a non-awyer, can come in and do work for 9:11a white shoe law firm and get exactly 9:14the same quality of work done at the end 9:16as the lawyer who's got decades of 9:18experience. there's going to be a reward 9:21for understanding the business deeply 9:24that will show up in your ability to 9:26direct AI agents toward useful ends. And 9:28so as much as it may seem like I'm 9:30saying the agent can do work, we won't 9:32do any. What I'm really saying is our 9:34domain expertise is worth more and more, 9:37but boy do we have to be smart and 9:39leverage it really, really differently 9:41to get where we need to go in 2026. And 9:44that's going to change all of our skill 9:45sets. We're all going to have to learn 9:47together. We've never gone through this 9:48workflow and workforce transformation 9:50before. So, we're all going to have to 9:52just jump in and figure out how to do it 9:53together. But, I do think it's real. I 9:55do think it's coming. And I do think the 9:56key is that super exponential graph. 9:58Opus 4.5 was just the latest getting to 10:015 hours. It won't be the last. It's not 10:03like Claude has a special, you know, 10:05Claude doesn't have a special monopoly 10:07on this, right? We're going to see this 10:08from Gemini. We're going to see this 10:10from Chad GPT. We'll see it from other 10:12model makers as well. We will continue 10:14to see exponential gains from agent 10:16working time in 2026 and that will 10:19change the way all of us have to do our 10:21work.