AI's Self-Improvement: Analysis of Anthropic & OpenAI Risks

Are we really panicking about AI taking our jobs, or are we missing the far more unsettling possibility that it’s about to take over its own development? Last month’s protests in San Francisco, complete with “Don’t Build Skynet” banners aimed at Anthropic, OpenAI, and xAI, weren’t just about existential dread – they were a clumsy, public expression of a fear rapidly solidifying within Silicon Valley itself: the rise of AI that can design even better AI, and do it at an accelerating pace. The real story here isn't the threat of robots replacing us – it’s the prospect of them replacing the people building them.

The frenzy isn’t about general artificial intelligence (AGI) anymore, though that’s still the ultimate goal. It’s about recursive self-improvement, a concept floated by statistician I.J. Good in the 1960s, now suddenly feeling less like science fiction and more like a looming engineering problem. For decades, the idea of a machine capable of designing its own successors was a thought experiment. Just a few years ago, with ChatGPT struggling with basic arithmetic, the notion of AI conducting world-class machine learning research was frankly laughable. Now, companies are openly bragging about automating their own research and development. OpenAI aims to launch an “intern-level AI research assistant” within six months, and Anthropic claims a staggering 90 percent of its code is already written by Claude.

This isn’t about AI suddenly becoming sentient and deciding to overthrow humanity. It’s about efficiency. AI research is a slog – curating datasets, running endless experiments, tweaking parameters. Coding bots, like Google DeepMind’s AlphaEvolve, are already delivering incremental gains, improving data center efficiency by 0.7 percent and cutting Gemini’s training time by 1 percent. These numbers might seem small, but consider the scale: Google’s global data center fleet is enormous, and even a 1 percent reduction in training time for a model like Gemini translates to significant cost savings and faster iteration. Dario Amodei, CEO of Anthropic, estimates these tools boost his company’s workflows by 15 to 20 percent. The incentive to automate is simply too strong to ignore.

However, the details remain frustratingly opaque. When Anthropic states that Claude writes almost all of its code, what level of human oversight is involved? A recent podcast featuring Jack Clark, the company’s head of policy, revealed a priority to “better understand the extent to which we are automating aspects of A.I. development,” suggesting even they aren’t entirely sure. OpenAI’s “intern” is similarly vague, described as assisting with literature reviews and experiment interpretation. This lack of transparency isn’t necessarily malicious, but it highlights a critical tension: companies are racing ahead, touting progress, while simultaneously downplaying the potential risks and obscuring the specifics of their methods. It’s a classic Silicon Valley move – move fast and ask forgiveness later.

Source material: theatlantic.com.

The real danger isn’t that AI will suddenly develop “research taste” – that elusive mix of creativity and judgment exhibited by top software engineers – overnight. It’s the compounding effect of these incremental improvements. Nick Bostrom, a leading AI risk philosopher, notes we’re “starting to see AI progress feed back on itself.” Instead of waiting months for breakthroughs, we might see them weekly, then daily. Eli Lifland at the AI Futures Project forecasts full automation of AI R&D by 2032, building on the rapid acceleration of capabilities. A few years ago, AI could handle tasks taking a human seconds; now it autonomously completes tasks taking hours. Neev Parikh of METR doesn’t “expect a reason for it to slow down.” This isn’t necessarily a prediction of doom, but a recognition that the rate of change is becoming increasingly difficult to predict, let alone control.

Even if recursive self-improvement remains a distant dream, these marginal gains will accelerate AI development, altering the competitive landscape and potentially reshaping global power dynamics. As former Trump advisor Dean Ball points out, governments and civil society are already woefully behind. We’re still grappling with the implications of the internet, while the IRS relies on COBOL, a programming language from 1960. Regulations, safety protocols, and even basic understanding of the technology are lagging far behind the pace of innovation. Bostrom himself has shifted from “fretful optimist” to “moderate fatalist,” a telling sign of the growing unease within the field. A recent academic study interviewing researchers at leading AI labs found that 20 out of 25 identified the automation of AI research as a “most severe and urgent” risk.

The hype, as always, is a distraction. The protests, the bold claims from tech CEOs, the breathless media coverage – it all serves to obscure the fundamental shift underway. We’re not just building smarter machines; we’re building machines that can build even smarter machines, and we have very little idea what that will look like. Watch for a surge in “AI safety” initiatives, not as genuine attempts to mitigate risk, but as PR maneuvers designed to deflect scrutiny and maintain the illusion of control. The critical question isn’t if AI will automate its own development, but when – and whether we’ll even recognize it happening until it’s too late.