We’re excited to introduce two highly effective improvements in Azure AI Foundry.
AI brokers are reworking industries by automating workflows, enhancing productiveness, and enabling clever decision-making. Companies are leveraging AI brokers to course of insurance coverage claims, handle IT service desks, optimize provide chain logistics, and even help healthcare professionals in analyzing medical information. The potential is huge, and we’re excited to introduce two highly effective improvements in Azure AI Foundry:
- Responses API: A robust API enabling AI-powered purposes to retrieve info, course of knowledge, and take motion seamlessly.
- Pc-Utilizing Agent (CUA): A breakthrough AI mannequin that navigates software program interfaces, executes duties, and automates workflows.
Collectively, these capabilities empower companies to reimagine AI not simply as an assistant—however as an lively digital workforce. Enterprise prospects will quickly acquire entry to those improvements driving automation, effectivity, and intelligence at scale.
Enhancing AI Brokers with the Responses API
The Responses API is the important thing to unlocking agentic AI in Azure AI Foundry, reworking how enterprises harness AI for real-world influence. It’s the new basis for leveraging Azure OpenAI Service’s highly effective built-in instruments, combining the simplicity of the Chat Completions API with the superior capabilities out there by way of Assistants API and Azure AI Agent Service. The Responses API allows seamless interplay with instruments like CUA, code interpreter, perform calling, and file search—all in a single API name. This API allows AI programs to retrieve knowledge, course of info, and take actions—seamlessly connecting agentic AI with enterprise workflows.
How the Responses API Works
The Responses API supplies a structured response format that permits AI to work together with a number of instruments whereas sustaining context throughout interactions. It helps:
- Software calling in a single easy API name: Now, builders can seamlessly combine AI instruments, making execution extra environment friendly.
- Pc use: Use the pc use device inside the Responses API to drive automation and execute software program interactions.
- File search: Work together with enterprise knowledge dynamically and extract related info.
- Code interpreter: Create and execute Python code effortlessly inside AI-powered purposes.
- Operate calling: Develop and invoke customized capabilities to boost AI capabilities.
- Chaining responses into conversations: Preserve monitor of interactions by linking responses collectively utilizing distinctive response IDs, guaranteeing continuity in AI-driven dialogues.
- Enterprise-grade knowledge privateness: Constructed with Azure’s trusted safety and compliance requirements, guaranteeing knowledge safety for organizations.
By consolidating retrieval, reasoning, and motion execution right into a single API, the Responses API simplifies AI agent growth, decreasing the complexity of orchestrating a number of AI instruments inside an automation pipeline.
This scalability makes it well-suited for enterprise use circumstances throughout industries reminiscent of customer support, IT operations, finance, and provide chain administration, the place AI-powered automation can streamline workflows and enhance effectivity. For even better flexibility and management, organizations can discover Azure AI Agent Service, which presents further instruments and fashions for growing and scaling AI brokers. Azure AI Agent Service integrates with Semantic Kernel and AutoGen, enabling seamless multi-agent orchestration for extra complicated situations requiring a number of brokers to collaborate on duties.
Empowering AI Brokers with the Pc-Utilizing Agent
The Pc-Utilizing Agent (CUA) is a specialised AI mannequin in Azure OpenAI Service that permits AI to work together with graphical consumer interfaces (GUIs), navigate purposes, and automate multi-step duties—all by way of pure language directions. In contrast to conventional automation instruments that depend on predefined scripts or API-based integrations, CUA can interpret visible parts, adapt dynamically, and take motion primarily based on on-screen content material.
What makes the Pc-Utilizing Agent distinctive?
- Autonomous UI navigation: Can open purposes, click on buttons, fill out types, and navigate multi-page workflows.
- Dynamic adaptation: Interprets UI adjustments and adjusts actions accordingly, decreasing reliance on inflexible automation scripts.
- Cross-application job execution: Operates throughout web-based and desktop purposes, integrating disparate programs with out API dependencies.
- Pure language command interface: Customers can describe a job in plain language, and CUA determines the proper UI interactions to execute.
With in the present day’s announcement, builders can begin constructing further agentic capabilities instantly with CUA. As enterprises look to deploy this expertise at scale, we’re evaluating integration with Home windows 365 and Azure Digital Desktop to allow CUA automation to run seamlessly in a managed host surroundings on Cloud PCs or digital machines (VMs), guaranteeing constant efficiency whereas sustaining enterprise compliance and safety requirements.
Making certain safe and reliable AI automation
As AI programs grow to be extra autonomous, guaranteeing safety, reliability, and alignment with human intent is vital. The CUA mannequin is likely one of the first agentic AI fashions able to straight interacting with software program environments, bringing new challenges in misuse prevention, unintended actions, and adversarial dangers. To handle these, Microsoft and OpenAI have applied a multi-layered security method spanning the mannequin, system, and deployment ranges.
The CUA mannequin is developed with safeguards to refuse dangerous duties, reject unauthorized actions, and forestall misuse. On the system stage, Microsoft implements enterprise-grade content material filtering and execution monitoring to assist detect and forestall coverage violations. To attenuate unintended actions, CUA is designed to request consumer confirmations earlier than executing irreversible duties and to limit high-risk actions reminiscent of monetary transactions.
Microsoft’s Reliable AI framework additional ensures real-time observability, logging, and compliance auditing for enterprise deployments. Automated and human-in-the-loop detection programs monitor execution patterns, figuring out anomalous behaviors and imposing governance insurance policies. These safeguards are constantly refined primarily based on inside red-teaming, exterior audits, and real-world testing to strengthen safety in opposition to immediate injections, adversarial manipulations, and unauthorized entry. Given the present reliability stage of the CUA mannequin—notably in non-browser environments—human oversight stays strongly really helpful for delicate operations.
As AI brokers evolve, Microsoft is dedicated to transparency, safety, and ongoing threat mitigation. By combining CUA’s built-in safeguards with Azure’s enterprise compliance and governance instruments, organizations can deploy AI-powered automation with confidence, guaranteeing secure and accountable AI adoption at scale.
Getting began with CUA and Responses API
Azure AI Foundry continues to push the boundaries of AI-powered automation. Enterprise prospects will acquire entry to the Responses API and CUA in Azure OpenAI Service within the coming weeks.
We’re excited to see how builders and companies innovate with these new capabilities.