Close Menu
    • Home
    • Events
      • Upcoming Events
      • Videos
        • Machine Can Think Summit 2026
        • Step Dubai Conference 2026
    • Technology & Innovation
    • Business & Marketing
    • Trends & Insights
    • Industry Applications
    • Tutorials & Guides
    What's Hot
    Industry Applications

    AI Drug Development Johnson & Johnson Impact on Healthcare

    By Art RyanApril 28, 20260

    Johnson & Johnson (J&J) has unveiled new information about the future of AI in healthcare,…

    Qualcomm OpenAI AI Smartphone Processors Partnership News

    April 28, 2026

    Google AI Campus South Korea and Its Development Plans

    April 28, 2026

    Accenture Copilot Rollout Enhances Employee Productivity

    April 28, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Breaking AI News
    Wednesday, April 29
    • Home
    • Events
      • Upcoming Events
      • Videos
        • Machine Can Think Summit 2026
        • Step Dubai Conference 2026
    • Technology & Innovation

      AI Drug Development Johnson & Johnson Impact on Healthcare

      April 28, 2026

      Qualcomm OpenAI AI Smartphone Processors Partnership News

      April 28, 2026

      Google AI Campus South Korea and Its Development Plans

      April 28, 2026

      New AI-Based Solution Launched by Box to Revolutionize Enterprise Workflows

      April 28, 2026

      Meta AWS Graviton AI Partnership: Revolutionizing Infrastructure

      April 28, 2026
    • Business & Marketing

      UK AI Startup Ineffable Secures $1.1B in Europe’s Largest Seed Round

      April 28, 2026

      Meta Manus AI Acquisition Blocked Over Strategic Concerns

      April 28, 2026

      Microsoft Ceases Revenue Split With OpenAI in Landmark AI Partnership Move

      April 28, 2026

      ZainTECH Named a Leader in IDC MarketScape: Gulf Countries AI Professional Services

      April 28, 2026

      AI Job Cuts Forecast: Shocking Prediction That 50% of UK Executives Expect Workforce Reduction

      April 20, 2026
    • Trends & Insights

      Google AI Campus South Korea and Its Development Plans

      April 28, 2026

      Meta Manus AI Acquisition Blocked Over Strategic Concerns

      April 28, 2026

      Emirati Inventor AI UAE: Bridging Culture and Technology

      April 28, 2026

      Cursor’s $50 Billion Ambition: Explosive AI Coding Demand Fuels Massive Growth

      April 19, 2026

      Dubai AI-powered government will change your daily life in the UAE

      April 3, 2026
    • Industry Applications

      AI Drug Development Johnson & Johnson Impact on Healthcare

      April 28, 2026

      Accenture Copilot Rollout Enhances Employee Productivity

      April 28, 2026

      HomeLight AI Real Estate Closings Transforming the Market

      April 27, 2026

      UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

      April 27, 2026

      Visit Oman Launches Revolutionary AI Digital Hub and Global Collaboration to Transform Tourism Industry

      April 27, 2026
    • Tutorials & Guides

      How AI Is Revolutionizing the Future of Travel 2026 with Wellness and Sustainability

      April 19, 2026

      University of Wollongong in Dubai AI initiative boosts future-ready education

      March 31, 2026

      Microsoft AI upgrades Copilot Cowork unveiled for early access users

      March 31, 2026

      Starcloud $11 billion valuation signals AI space race surge

      March 31, 2026

      Flexible AI Factories Power the Future of Energy Grids

      March 30, 2026
    Breaking AI News
    Home » Inference scaling emerges as the next frontier for AI at AISC 2025
    Technology & Innovation

    Inference scaling emerges as the next frontier for AI at AISC 2025

    Art RyanBy Art RyanMarch 13, 2025No Comments5 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    The concept of inference scaling is being hailed as a transformative approach in artificial intelligence (AI) at the AI and Semiconductor International Conference 2025 (AISC 2025) in Hà Nội.

    Held from March 12–16 in Hà Nội, with additional sessions in Đà Nẵng, AISC 2025 has attracted over 1,000 technology experts and industry leaders from around the world. Co-organised by Việt Nam’s National Innovation Centre (NIC) and US-based AI firm Aitomatic, the conference explores how AI and semiconductor advancements are reshaping the future of computing. 

    A key theme at the conference is the shift towards allocating computational power during inference, rather than primarily during training. According to a professor at Stanford University and AI researcher at Google DeepMind, Azalia Mirhoseini, inference scaling represents a new axis for AI performance enhancement.

    She likened it to an “infinite monkey” approach, where an AI model can generate multiple outputs and eventually arrive at the correct solution given enough attempts. This contrasts with traditional AI development, which prioritises pre-training and fine-tuning as the main scaling strategies.

    AI inference scaling and its impact on accuracy

    Emerging research shared at AISC 2025 highlights that allowing AI to generate multiple solutions and selecting the best one can dramatically improve accuracy. A proposed framework, “Large Language Monkeys,” demonstrated that running a large language model (LLM) multiple times on the same prompt—while an automated verifier assesses each output—can significantly enhance the likelihood of correct responses.

    Across reasoning and programming tasks, researchers observed that the probability of obtaining the correct answer, termed coverage, increases predictably with the number of inference attempts.

    According to Mirhoseini, this follows an inference-time scaling law, similar to well-established training scaling laws. She explained that in fields where automated verification is possible—such as unit testing for software or mathematical proof verification—this approach directly enhances problem-solving success.

    For instance, in software development benchmarks, an AI-based code generator solved 15.9 per cent of coding problems with a single attempt. However, when given 250 attempts, its accuracy increased to 56 per cent, surpassing the previous best-in-class model, which achieved 43 per cent in a single-shot scenario.

    Even a smaller 70-billion-parameter open-source model, when given sufficient inference runs, could match or outperform larger models like GPT-4 on specific coding and reasoning tasks. These findings suggest that computational effort during inference can compensate for smaller model sizes or limited training data, making advanced AI capabilities more accessible without requiring massive models.

    Applications 

    At AISC 2025, researchers showcased several real-world applications of inference scaling across software engineering, hardware programming and semiconductor design.

    In software development, a prototype system called ‘CodeMonkeys’ applies inference scaling to programming tasks. The AI generates multiple candidate code edits and bug fixes in parallel, each evaluated automatically using unit tests.

    According to Mirhoseini, this process enables the AI to refine its output iteratively, selecting the most optimal solution. The key insight is that allocating more computational power at the inference stage—rather than during initial training—enhances AI’s ability to write, debug and optimise code.

    In hardware programming, researchers introduced ‘KernelBench’ a tool leveraging inference scaling to automate low-level programming tasks. Writing optimised kernel code—critical for high-performance computing—traditionally requires extensive manual effort.

    KernelBench enables AI models to generate kernel code, receive compiler feedback and performance metrics, and refine their output over multiple iterations. This iterative process allows AI to automate complex programming tasks that would otherwise require significant human expertise and time.

    The conference also underscored AI’s growing impact on chip design, with Google’s AlphaChip project serving as a standout example. AlphaChip employs deep reinforcement learning to automate chip floorplanning, an essential step in semiconductor design.

    The FPT booth at the conference. — VNS Photo Tiến Đạt

    According to Google, its AI-generated chip layouts are comparable to or superior to human designs across all performance metrics, while requiring significantly less time. A floorplan that would take months for human engineers to finalise can be generated by AlphaChip’s AI in under six hours. Mirhoseini noted that inference scaling could further enhance AI-driven chip design, enabling rapid evaluation of thousands of design variations to improve efficiency and performance.

    The emergence of inference scaling marks a fundamental shift in AI development. Traditionally, AI research has focused on increasing model size and dataset volume to improve performance. However, the findings presented at AISC 2025 suggest that redistributing computational resources to inference may unlock latent AI capabilities without requiring ever-larger models.

    This shift also presents new challenges for hardware and software infrastructure. As AI inference workloads grow, developing specialised AI chips that optimise for high-throughput inference will be crucial.

    Discussions at AISC 2025 highlighted next-generation AI accelerators and parallel processing techniques aimed at reducing computational costs associated with inference scaling.

    Experts at the conference expressed optimism that inference scaling will become a cost-effective and practical approach to AI deployment. By combining advanced inference strategies, automated verification, and high-performance hardware, AI systems may soon tackle problems previously considered too complex or computationally expensive. — VNS

    Source: https://vietnamnews.vn/

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Art Ryan

    Related Posts

    AI Drug Development Johnson & Johnson Impact on Healthcare

    April 28, 2026

    Qualcomm OpenAI AI Smartphone Processors Partnership News

    April 28, 2026

    Google AI Campus South Korea and Its Development Plans

    April 28, 2026

    Comments are closed.

    Latest News

    AI Drug Development Johnson & Johnson Impact on Healthcare

    April 28, 2026

    Qualcomm OpenAI AI Smartphone Processors Partnership News

    April 28, 2026

    Google AI Campus South Korea and Its Development Plans

    April 28, 2026

    Accenture Copilot Rollout Enhances Employee Productivity

    April 28, 2026
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

    AI University

    • Global Universities
    • Universities in Africa
    • Universities in Asia
    • Universities in Europe
    • Universities in Latin America
    • Universities in Middle East
    • Universities in North America
    • Universities in Oceania

    AI Tools & Apps Directory

    • AI Productivity Tools
    • AI Coding Tools
    • AI Voice Tools
    • AI Video Tools
    • AI Image Generators
    • AI Writing Tools

    Info

    • Home
    • About Us
    • AI Organizations & Associations
    • Contact Us

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Breaking AI News.
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.