Trends in AI Task Duration

Explore top LinkedIn content from expert professionals.

  • View profile for Gajen Kandiah

    AI-First CEO | Scaling Global Tech | Ex-President & COO, Hitachi Digital

    20,622 followers

    One fiscal quarter is the equivalent of a full year of AI agent advancement. An 18-month roadmap equates to about a decade of capability shifts. And systems designed for today’s AI may already be outdated by the time they go live. These are some of the insights extrapolated from a new research paper from METR, “Measuring AI Ability to Complete Long Tasks,” which may provide the clearest explanation yet of how rapidly AI agents are transforming the nature of knowledge work. https://lnkd.in/eB8KPNtb Instead of narrow benchmarks, the study asks a broader and more useful question: How long can an AI system work on a real-world task before it breaks down? The answer: the “task completion horizon” has been doubling every 3 to 7 months since 2019. If your planning frameworks are built around human time scales, it’s worth recognizing that AI is evolving in dog years — or faster. This presents a strategic challenge most enterprise leaders haven’t encountered before: We are being asked to plan for something that is unknowable in its specifics, but inevitable in its trajectory. There’s no steady-state to optimize around. No predictable plateau. There’s just an exponential curve that is already reshaping what’s possible in software development, cybersecurity, reasoning, and long-horizon task automation. The temptation to wait for maturity is understandable — but with this rate of change, waiting creates risk, and inaction becomes a liability. So what’s the alternative? Enterprises that thrive in this environment will embrace adaptive strategy — grounded in action today and built for flexibility tomorrow: • Design workflows and systems that can scale with rising agent capabilities • Rethink governance as a dynamic, living framework • Embed feedback loops, experimentation, and modularity • Focus on readiness, not perfection The METR team is careful not to overstate the trend’s longevity — but the current data is clear: AI agents are now reliably completing hour-long tasks. Full-day or week-long task automation is no longer speculative — it’s within reach. We may not know the precise timeline.  But we know where we’re headed. And we know the speed is unfamiliar. AI’s future is unknowable. Its impact is inevitable. And it’s unfolding on a clock most organizations weren’t built to manage. The question isn’t whether to act. It’s whether your organization can learn, adapt, and lead at the speed of change. #AI #GenAI #EnterpriseAI #AIAgents #DigitalTransformation #Leadership #Strategy #AIReadiness #FutureOfWork #MooresLaw #UnknowableButInevitable

  • View profile for John Bailey

    Strategic Advisor | Investor | Board Member

    15,692 followers

    Researchers at METR @METR just published a new paper that shows that the length of tasks AI agents can complete autonomously has been doubling every 7 months since 2019 - essentially revealing a "Moore's Law" of sorts that can help us better understand the trajectory of AI capabilities. Key Takeaways: - To measure AI progress in a way that compares to humans, the study introduces a new metric: the 50%-task-completion time horizon. This represents the longest task an AI agent can complete correctly half the time, based on how long it usually takes a human expert to finish the same task. - AI’s ability to complete long, complex tasks has been doubling every 7 months since 2019. - If this trend continues, AI agents could independently handle tasks that take humans a month by 2028-2031. - The biggest drivers of improvement: Better reasoning, tool use, and adaptability—not just bigger models. It will be interesting to see how approaches like OpenAI and Google's Deep Research impact this. - AI still struggles with messy, real-world tasks that require intuition, judgment, and seeking out missing information. Paper: https://lnkd.in/d7bW6RTV METR thread: https://lnkd.in/dp4-Y64v Great thread on the background of the paper from Elizabeth (Beth) Barnes: https://lnkd.in/dfuvi6ZY

  • View profile for Mrukant Popat

    💥 Igniting Innovation in Engineering | CTO | AI / ML / Computer Vision, OS - operating system, Platform firmware | 100M+ devices running my firmware

    5,101 followers

    𝗔𝗜’𝘀 𝗔𝗯𝗶𝗹𝗶𝘁𝘆 𝘁𝗼 𝗖𝗼𝗺𝗽𝗹𝗲𝘁𝗲 𝗟𝗼𝗻𝗴 𝗧𝗮𝘀𝗸𝘀 𝗶𝘀 𝗗𝗼𝘂𝗯𝗹𝗶𝗻𝗴 𝗘𝘃𝗲𝗿𝘆 𝟳 𝗠𝗼𝗻𝘁𝗵𝘀 A groundbreaking study reveals that AI models are rapidly improving in their ability to autonomously complete long, complex tasks. The length of tasks (measured by how long they take human experts) that generalist AI agents can complete with 50% reliability has been doubling every ~7 months for the last 6 years. If this exponential trend continues, AI could independently complete week-long tasks within 2-4 years—a shift that could redefine automation, productivity, and workforce dynamics. 🔹 Key insights: ✅ Current AI models can solve short, structured tasks but struggle with substantive projects. ✅ Performance has skyrocketed on benchmarks, yet real-world utility remains limited. ✅ Measuring AI by the length of tasks it can complete provides a better forecast of its true capabilities. ✅ If trends hold, AI capable of autonomously handling month-long projects could emerge before 2030. The implications are massive—from software development to executive assistance to scientific research. But with great power comes great responsibility. How do we prepare for an AI-driven future where machines can handle complex, multi-week tasks? 📄 Full paper & GitHub repo in comments. 💡Are we ready for AI that works on projects lasting weeks or even months? #AI #Automation #ArtificialIntelligence #FutureOfWork #MachineLearning #TechTrends

Explore categories