Close Menu
    What's Hot
    Weekly Breaking AI News

    Weekly AI News: Global AI Developments from June 7–13, 2026

    By Art RyanJune 14, 20260

    This Weekly AI News June 7–13 2026 round-up covers the most important artificial intelligence developments…

    KKR, Kuwait, NVIDIA and Vistra Launch $10 Billion AI Infrastructure Venture Helix

    June 14, 2026

    UAE Launches First National Experts Programme AI Cohort to Strengthen Future AI Leadership

    June 14, 2026

    Xiaomi Launches MiMo Code, an Open-Source AI Coding Assistant Built to Remember Longer Projects

    June 14, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Breaking AI News
    Sunday, June 14
    • Home
    • Events
    • Videos
      • Machine Can Think Summit 2026
      • Step Dubai Conference 2026
    • Technology & Innovation

      KKR, Kuwait, NVIDIA and Vistra Launch $10 Billion AI Infrastructure Venture Helix

      June 14, 2026

      UAE Launches First National Experts Programme AI Cohort to Strengthen Future AI Leadership

      June 14, 2026

      Xiaomi Launches MiMo Code, an Open-Source AI Coding Assistant Built to Remember Longer Projects

      June 14, 2026

      Meta AI Transformation: Zuckerberg Admits Mistakes as Company Pushes Deeper Into AI

      June 14, 2026

      NVIDIA Blackwell Sets New Standard for Agentic AI Performance

      June 14, 2026
    • Business & Marketing

      KKR, Kuwait, NVIDIA and Vistra Launch $10 Billion AI Infrastructure Venture Helix

      June 14, 2026

      Meta AI Transformation: Zuckerberg Admits Mistakes as Company Pushes Deeper Into AI

      June 14, 2026

      British Gas AI Job Cuts Spark Debate Over Automation in Customer Service

      June 13, 2026

      Anthropic’s Vertical AI Tools Could Shake Up the Enterprise AI Market

      June 13, 2026

      Jeff Bezos’ AI Startup Prometheus Raises $12 Billion to Build the Future of Engineering

      June 13, 2026
    • Industry Applications

      Xiaomi Launches MiMo Code, an Open-Source AI Coding Assistant Built to Remember Longer Projects

      June 14, 2026

      Anthropic’s Vertical AI Tools Could Shake Up the Enterprise AI Market

      June 13, 2026

      Anthropic Launches Claude Corps, a $150M AI Fellowship for U.S. Nonprofits

      June 13, 2026

      Dubai Plans to Equip 295,000 Companies With Agentic AI Within Two Years

      June 12, 2026

      UAE Launches 90-Day Agentic AI Sprint Across 50 Federal Entities

      June 12, 2026
    • Trends & Insights

      NVIDIA Blackwell Sets New Standard for Agentic AI Performance

      June 14, 2026

      NVIDIA Confidential Computing Helps Apple Expand Private Cloud Compute for Apple Intelligence

      June 12, 2026

      Rio Aims to Become Latin America’s Next AI Capital as Web Summit Rio Opens

      June 10, 2026

      Anthropic Launches Claude Fable 5, Its Most Powerful Public AI Model Yet

      June 10, 2026

      China’s $295B AI Infrastructure Push Targets Quantum Computing

      June 10, 2026
    • AI in Travel

      Dubai Uses AI to Improve Real-Time Bus Management and Cut Emissions

      June 10, 2026

      Breaking News: Xiamen Airlines to Host 83rd IATA AGM in 2027

      June 8, 2026

      Middle East Disruptions and High Fuel Prices Hit Airlines

      June 8, 2026

      Willie Walsh Report Warns Airline Profits to Halve in 2026

      June 8, 2026

      IATA AGM 2026: China’s Aviation Market Sees Major Growth

      June 7, 2026
    Breaking AI News
    Home » Researchers Train AI Agents to Share Complex Tasks
    Technology & Innovation

    Researchers Train AI Agents to Share Complex Tasks

    Art RyanBy Art RyanNovember 27, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Researchers at Imperial College London and Ant Group, part of the Chinese conglomerate Alibaba Group, introduced a new method for training groups of artificial intelligence (AI) agents to work together on complex tasks, presenting a framework that coordinates a main agent that plans steps and sub-agents that operate tools. The team detailed the approach, called M-GRPO, in a paper released this month and evaluated the system across three real-world benchmarks that measure multi-step reasoning and tool use.

    Single Agent Systems Face Coordination Limits

    Most current tools using AI systems rely on a single agent to handle planning, reasoning and tool execution. They reported that these systems struggle with tasks that require long decision chains because one model must determine what to do, when to do it, which tool to use, and how to combine outputs. According to the paper, errors made early in a sequence often affect subsequent steps when all decisions run through a single model.

    The study tested an alternative structure in which several agents share responsibility. A main agent produces a plan, delegates steps, and checks outputs, while sub-agents run tool operations that may involve several turns. The authors described this structure as a vertical multi-agent setup that mirrors how multistage tasks unfold in real environments where an AI system must search, analyze and retrieve information from external tools.

    In one example, the main agent selected a reasoning tool and issued instructions while sub-agents carried out web navigation or retrieval steps. The researchers noted that this structure differed from single-agent attempts, in which the same component tried to perform every action.

    New Training Method Introduces Decoupled Pipeline

    The researchers developed M-GRPO as an extension of the earlier GRPO method, a training method that evaluates an agent’s output against the average performance of other outputs in the same group and updates the policy based on that relative score.

    The framework adapts GRPO to a structure with a single main agent and multiple sub-agents operating at different frequencies. The paper identifies three challenges in training such systems. The first is that the main agent operates on every turn, while sub-agents engage only when a tool is needed. The second is that tasks may require different numbers of sub-agents. The third is that rollouts may be generated on separate servers.

    To address these issues, the researchers created a decoupled training pipeline. The system collects rollouts from the main agent and all sub-agents and stores them in a shared buffer. Each agent is then evaluated on its contribution to the final answer. The method computes group-relative advantages by comparing an agent’s performance with the average performance of similar agents, allowing updates even when agents participate at different rates.

    The paper states that this design enables coordination between the main agent’s planning behavior and each sub-agent’s tool-execution behavior. The authors wrote that M-GRPO supports scenarios in which sub-agents must run multi-turn tool calls, retrieve external information, or navigate through several steps before returning results.

    Meeting Benchmarks

    The researchers tested their thesis on several performance benchmarks. These benchmarks simulate real-world tasks that require planning and decision-making across multiple stages. WebWalkerQA tasks involve page-to-page navigation, locating specific content and issuing sequential tool calls. XBench DeepSearch includes tasks that require selecting the correct tool, combining retrieved information and assembling a final output. GAIA includes tasks that require searching, running tools and integrating several sources of information.

    The paper reported that the system achieved higher performance than both a single-agent baseline and a multi-agent baseline with fixed sub-agents, and that the multi-agent model demonstrated greater training stability and higher sample efficiency across all three benchmarks.

    Source: https://www.pymnts.com/
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Art Ryan

    Related Posts

    KKR, Kuwait, NVIDIA and Vistra Launch $10 Billion AI Infrastructure Venture Helix

    June 14, 2026

    UAE Launches First National Experts Programme AI Cohort to Strengthen Future AI Leadership

    June 14, 2026

    Xiaomi Launches MiMo Code, an Open-Source AI Coding Assistant Built to Remember Longer Projects

    June 14, 2026

    Comments are closed.

    Latest News

    Weekly AI News: Global AI Developments from June 7–13, 2026

    June 14, 2026

    KKR, Kuwait, NVIDIA and Vistra Launch $10 Billion AI Infrastructure Venture Helix

    June 14, 2026

    UAE Launches First National Experts Programme AI Cohort to Strengthen Future AI Leadership

    June 14, 2026

    Xiaomi Launches MiMo Code, an Open-Source AI Coding Assistant Built to Remember Longer Projects

    June 14, 2026
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram LinkedIn YouTube Spotify Reddit Snapchat Threads

    AI University

    • Global Universities
    • Universities in Africa
    • Universities in Asia
    • Universities in Europe
    • Universities in Latin America
    • Universities in Middle East
    • Universities in North America
    • Universities in Oceania

    AI Tools & Apps Directory

    • AI Productivity Tools
    • AI Coding Tools
    • AI Voice Tools
    • AI Video Tools
    • AI Image Generators
    • AI Writing Tools

    Info

    • Home
    • About Us
    • AI Organizations & Associations
    • Contact Us
    • Cookie Policy
    • Copyright Policy
    • Disclaimer
    • Editorial Policy
    • Terms and Conditions

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Breaking AI News.
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Sign Up

    Want to stay ahead In Artificial Intelligence?

     Sign up now and get exclusive breaking AI news and special updates—FREE!