Anthropic's new hybrid AI model can work on tasks autonomously for hours at a time

#ai

Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time | MIT Technology Review

The company claims its ability to tackle complex, multistep problems paves the way for much more proficient AI agents.

technologyreview.com

TL;DR: Anthropic’s latest duo—Claude Opus 4 and Claude Sonnet 4—push AI agents from hand-held helpers to semi-autonomous problem-solvers. Opus 4 can juggle thousands of steps over hours (it even mapped out a Pokémon Red guide after 24 hours of gameplay), thanks to beefed-up “memory files” that let it recall and act on info far longer than its predecessor. One customer even let it code for nearly seven hours straight on a complex open-source project.

Sonnet 4, by contrast, is the lean, free-tier-friendly sibling: still hybrid (quick replies or deep dives on demand), still tool-savvy (web searches, plugins, etc.), but built for everyday tasks. Both models boast a 65% drop in reward-hacking (no weird shortcuts), marking a step toward safer, more reliable AI agents that can actually get things done without you breathing down their neck.

Future

Anthropic's new hybrid AI model can work on tasks autonomously for hours at a time

Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time | MIT Technology Review

Top comments (0)