Anthropic Proposes AI Pause Mechanism: Why Developers Are Talking About Control Over Self-Improving Models
Anthropic believes that leading artificial intelligence developers should agree in advance on a mechanism for slowing down or temporarily stopping the development of AI systems. Such a step may be needed if models begin improving themselves faster than society can control them. This was reported by Reuters. Time for Action looked into why this statement matters not only for the technology sector. This is about an issue that is becoming increasingly difficult to postpone: who should control the development of systems capable of taking part in creating their own next versions, and how that control should work.
AI Is No Longer Just a Tool
Anthropic warns that the emergence of systems capable of independently creating their own improved versions could become one of the most important technological breakthroughs in history. But along with this, the risk increases that humans could lose real control over the speed of such development. The problem is not the mere appearance of more powerful AI. The problem is that technology may begin developing faster than companies, governments, and society can assess the consequences. If models can write code, find solutions for their own improvement, and speed up the work of engineers, ordinary internal checks may no longer be enough. That is why Anthropic is not talking about an emotional ban, but about a mechanism prepared in advance. The company proposes defining the conditions under which the development of models could be slowed down or temporarily stopped.
Why One Company Will Not Be Able to Stop the Risks
Anthropic separately emphasizes that a unilateral pause by one developer would change almost nothing. If one laboratory suspends its work while others continue development, the overall level of safety will not improve. That is why the company proposes a coordinated agreement between several major AI laboratories. Such an agreement should clearly define three things:
- under what conditions a slowdown or temporary pause is introduced;
- what must happen for development to be resumed;
- who will monitor the implementation of these rules, and how.
This is an important detail. Anthropic effectively acknowledges that without common rules, competition may push companies to move faster even when risks are already becoming obvious.
The Claude Figure Shows How Quickly Development Is Changing
As an example, Anthropic cited its own experience. As of May, more than 80% of the code added to the company’s codebase was written by its Claude model. This fact shows that AI is already deeply involved in the process of creating software. The model does not only respond to user requests, but also helps build the company’s own digital infrastructure. This creates a new level of responsibility. Code written by AI must be checked not only for whether it works. It is important to understand whether it contains hidden errors, vulnerabilities, or decisions that may reveal themselves later. The more models are involved in development, the more important monitoring, testing, and control of their behavior become.
The Issue Goes Beyond Business
Anthropic plans to hold consultations involving politicians, scientists, civil society organizations, and other AI developers. Among the topics for discussion are the risks of artificial intelligence self-improvement and the creation of international coordination mechanisms in the field of AI safety. This means that the issue can no longer be left only inside technology companies. AI affects programming, cybersecurity, government processes, the judicial system, elections, and the work of democratic institutions. If such systems become more powerful, the rules for their development must be discussed not only by those who create them. This is also connected to Anthropic’s creation of a new AI & Rule of Law team. The division will study the impact of artificial intelligence on democratic institutions, the judicial system, elections, and the rule of law.
Why Anthropic’s Position Carries Weight
Anthropic is an American artificial intelligence company founded in 2021 by former OpenAI researchers. Its main product is the Claude chatbot. According to the company itself, Claude is the first model available on the world’s three largest cloud platforms: Amazon Web Services, Google Cloud, and Microsoft Azure. AWS remains Anthropic’s main cloud partner. The company is also working on Mythos-class models, which are capable of automatically finding critical vulnerabilities in software code. Such systems can be useful for cybersecurity, but at the same time they require especially strict control. If a model is capable of finding weak points in code, it is important to understand who has access to such capabilities and how they are used. In early June, Anthropic also filed a confidential statement with the U.S. Securities and Exchange Commission for an initial public offering. This adds another dimension to the issue major AI companies are dealing with safety risks, investors, government structures, and the global market at the same time.
The Main Question Is Not Speed, but Control
Anthropic’s statement shows that AI developers are already preparing for a stage where the main problem will not only be the creation of more powerful models. The question is whether humans will be able to control systems that are increasingly involved in their own improvement. If rules appear only after a crisis, they may come too late. That is why Anthropic proposes agreeing in advance: defining red lines, the conditions for a pause, the criteria for returning to work, and oversight mechanisms. This discussion is not about future fiction, but about the real direction in which the industry is already moving. AI writes code, helps build software systems, takes part in finding vulnerabilities, and becomes part of technological production. That is why safety can no longer be reduced to how well a chatbot answers a user. The stakes are much higher whether companies, governments, and society will be able to agree on rules before AI development becomes faster than the ability to control it.












