March 10, 2026: The AI Workforce Is Here. Are You Ready to Manage It?

AI agents now run experiments overnight and get corporate IDs. The shift from tools to managed workforce is happening. Here's what leaders need to know.

Today’s Key AI Stories

Andrej Karpathy's 'autoresearch': A new open-source project lets AI agents run hundreds of experiments overnight to improve themselves, automating the scientific method.
Microsoft's Agent 365: Microsoft launches a new product to manage and secure enterprise AI agents, warning of "double agents" and giving them unique corporate identities.
Anthropic's New Tools & Troubles: Anthropic launched a multi-agent code review system while simultaneously suing the U.S. government over a "supply chain risk" designation related to military use of its AI.
OpenAI Acquires Promptfoo: OpenAI buys an AI security and testing platform to integrate automated red-teaming directly into its enterprise products.
The AI "Bubble" Debate: A new analysis argues AI isn't a typical tech bubble because it's the first technology to scale cognition, fundamentally changing the nature of knowledge work.
The Rise of AI Evaluation Tools: Google's Stax and other new frameworks signal a shift away from subjective "vibe testing" to rigorous, data-driven evaluation of AI models.
NVIDIA's Infrastructure Push: NVIDIA continues to build the foundational layers for AI with CUDA 13.2 and the NIXL library, enhancing performance for training and large-scale inference.
UK Announces Sovereign AI Fund: The UK government is launching a £500 million fund to build domestic AI computing infrastructure, aiming for technological self-reliance.

The AI Workforce Is Here. Are You Ready to Manage It?

For the past few years, we talked about AI in the abstract. It was a tool. A chatbot. A co-pilot. Something we *used*. That era is over. Today, the conversation changed. Radically. We are no longer just using AI. We are starting to manage it. As a workforce.

This isn't a prediction. It's happening right now. Andrej Karpathy, a key mind from OpenAI and Tesla, just released a script that lets AI agents improve themselves while we sleep. And Microsoft just announced a product to give these agents corporate ID numbers and security policies. The age of the AI agent is here. They are not just assistants anymore. They are becoming autonomous employees.

Part 1: The Automation of Innovation

Let's start with Andrej Karpathy's new project. He calls it 'autoresearch'. It's a simple script. Just 630 lines of code. But its ambition is massive. It automates the scientific method for AI. You give an AI agent a goal. A budget of computing time. And you let it run.

The agent reads its own code. It forms a hypothesis. 'What if I change this parameter?' It then modifies the code. Runs an experiment. And checks the results. If it works, it keeps the change. If not, it reverts and tries again. All by itself.

In one overnight run, Karpathy's agent ran 126 experiments. In two days, it made 700 autonomous changes. It found about 20 improvements that made a larger model 11% more efficient. Karpathy himself said the agent found things he had missed over two decades of work. This is not a productivity hack. This is a fundamental shift. We are now seeding ecosystems that learn and improve on their own. The AI researcher that never sleeps.

Part 2: The Agent Gets a Corporate ID

If Karpathy's project shows what agents can *do*, Microsoft's announcement shows how they will be *managed*. Microsoft just launched 'Agent 365'. They call it the "control plane for agents." It is, essentially, an IT department for your AI workforce.

The language from Microsoft is striking. They warn of AI agents becoming "double agents." These are agents that get hijacked through prompt injection or other attacks. They could start working against your company's interests. The solution? Treat them like employees. Agent 365 gives each AI agent a unique identity in Microsoft Entra, the same system that manages human employee access. They get security policies. Their access is monitored. Their behavior is audited.

This is not a feature. This is new infrastructure. And it comes with a price tag. The full 'Frontier Worker Suite' costs $99 per user, per month. Microsoft is betting that companies will soon 'hire' AI agents, and they'll need to pay for the management tools to keep them secure and productive. The era of the digital employee is being priced and packaged.

Anthropic is also moving this way. Their new 'Code Review for Claude Code' uses multiple AI agents that work together. They find bugs, verify each other's work, and rank issues by severity. It’s an AI team performing a complex knowledge task.

Part 3: The New Tools of Management

You can't manage a workforce without the right tools for quality control and security. This is where the AI industry is rapidly maturing. We're seeing a wave of new tools designed to professionalize AI development.

Take OpenAI's latest move. They just acquired Promptfoo. Promptfoo is a platform for testing and securing AI systems. It helps developers find vulnerabilities *before* an AI agent is deployed. OpenAI is integrating this directly into their enterprise platform. Think of it as a built-in background check and safety training for your AI agents.

Then there's the problem of performance reviews. How do you know if one AI model is actually better than another? For too long, developers relied on "vibe testing." It just *feels* better. That's not good enough for business. Google's new toolkit, Stax, is designed to fix this. It lets you test models against your own criteria. With your own data. It replaces subjective feelings with hard metrics. It's the performance management system for your AI.

Part 4: This Isn't a Bubble, It's a Restructuring

Many people see the flood of money and hype and shout "Bubble!" They are pattern-matching to the dot-com crash. But this comparison may be a mistake. As one analyst put it today, previous technologies extended human capability, but they didn't replace human cognition. The internet moved information, but humans decided what it meant. AI is different. It performs cognitive work. It can reason, hypothesize, and learn.

The constraint is no longer just about hiring smart people. It's about scaling intelligence itself. The new bottleneck is judgment. What problems are worth solving? What questions are worth asking?

This shift is what makes today's news so profound. We are not just building better tools. We are building a new kind of workforce. One that learns on its own. One that needs to be managed, secured, and evaluated like any other employee. The companies that win won't just be the ones that *use* AI. They will be the ones that learn how to *lead* it.