Close Menu
    • Home
    • Events
    • Videos
      • Machine Can Think Summit 2026
      • Step Dubai Conference 2026
    • Technology & Innovation
    • Business & Marketing
    • Trends & Insights
    • Industry Applications
    • Tutorials & Guides
    What's Hot
    Business & Marketing

    HMRC Signs £175 Million AI Transformation Deal With Quantexa

    By Art RyanMay 18, 20260

    The United Kingdom’s HM Revenue and Customs (HMRC) has announced a 10-year, £175 million ($233…

    OpenAI Acquires Weights.gg to Broaden Its Voice AI Presence

    May 18, 2026

    New EU AI Border System May Bring Travel Delays for US Tourists

    May 18, 2026

    AI-Powered Apps Drive Colombia’s Birdwatching Tourism Growth

    May 18, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Breaking AI News
    Tuesday, May 19
    • Home
    • Events
    • Videos
      • Machine Can Think Summit 2026
      • Step Dubai Conference 2026
    • Technology & Innovation

      HMRC Signs £175 Million AI Transformation Deal With Quantexa

      May 18, 2026

      OpenAI Acquires Weights.gg to Broaden Its Voice AI Presence

      May 18, 2026

      New EU AI Border System May Bring Travel Delays for US Tourists

      May 18, 2026

      AI-Powered Apps Drive Colombia’s Birdwatching Tourism Growth

      May 18, 2026

      UAE Deploys 50 AI Traffic Surveillance Stations on Federal Roads

      May 18, 2026
    • Business & Marketing

      HMRC Signs £175 Million AI Transformation Deal With Quantexa

      May 18, 2026

      OpenAI Acquires Weights.gg to Broaden Its Voice AI Presence

      May 18, 2026

      Major US Deals Boost Abu Dhabi AI Privacy Tech Acquisitions

      May 17, 2026

      Korea UAE AI Alliance: Strengthening Global Cooperation

      May 16, 2026

      UAE and India Expand AI, Energy, and Defense Investment Ties

      May 16, 2026
    • Trends & Insights

      Ghana AI Healthcare Programme for Quality Healthcare Access

      May 18, 2026

      Israel National AI Strategy Drives AI Talent and Startup Innovation

      May 18, 2026

      Malta Unveils ChatGPT Plus Initiative to Accelerate AI Growth

      May 17, 2026

      Korea UAE AI Alliance: Strengthening Global Cooperation

      May 16, 2026

      Figma Raises 2026 Forecast Despite Claude Design Pressure

      May 16, 2026
    • Industry Applications

      HMRC Signs £175 Million AI Transformation Deal With Quantexa

      May 18, 2026

      UAE Deploys 50 AI Traffic Surveillance Stations on Federal Roads

      May 18, 2026

      Ghana AI Healthcare Programme for Quality Healthcare Access

      May 18, 2026

      UAE China AI Education Partnership: Future of Learning

      May 18, 2026

      MTestHub Strengthens UAE AI Workforce Transformation Efforts

      May 16, 2026
    • Tutorials & Guides

      How AI Is Revolutionizing the Future of Travel 2026 with Wellness and Sustainability

      April 19, 2026

      University of Wollongong in Dubai AI initiative boosts future-ready education

      March 31, 2026

      Microsoft AI upgrades Copilot Cowork unveiled for early access users

      March 31, 2026

      Starcloud $11 billion valuation signals AI space race surge

      March 31, 2026

      Flexible AI Factories Power the Future of Energy Grids

      March 30, 2026
    Breaking AI News
    Home » Stanford CRFM partners with Arabic AI on HELM Arabic leaderboard
    Technology & Innovation

    Stanford CRFM partners with Arabic AI on HELM Arabic leaderboard

    Art RyanBy Art RyanJanuary 29, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Stanford CRFM partners with Arabic AI on HELM Arabic leaderboard
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Stanford CRFM collaborates with Arabic.AI to create a new evaluation platform focused on Arabic large language models. The collaboration resulted in HELM Arabic, a public leaderboard designed to measure model performance using standardized Arabic benchmarks. The project was developed by Stanford University’s Center for Research on Foundation Models (CRFM) together with Arabic.AI. The platform extends the existing HELM evaluation framework to Arabic language tasks.

    Stanford CRFM Collaborates with Arabic.AI on HELM Arabic

    The HELM Arabic leaderboard evaluates models across seven Arabic-language tasks. These tasks include AlGhafa, ArabicMMLU, Arabic EXAMS, MadinahQA, AraTrust, ALRAGE, and ArbMMLU-HT. Each benchmark measures different language capabilities. These include multiple-choice reasoning, question answering, grammar understanding, safety evaluation, and academic knowledge. The benchmarks are drawn from established Arabic datasets.

    Evaluation Methodology Used by Stanford CRFM and Arabic.AI

    Stanford CRFM collaborates with Arabic.AI using a standardized evaluation process. The system applies zero-shot prompting for instruction-tuned models. Multiple-choice tasks use Arabic letter options rather than Latin characters. The evaluation samples 1,000 examples per task subset to balance dataset distributions. Optional reasoning features are disabled to maintain consistency across models. The leaderboard records full model prompts and outputs to support reproducibility.

    Model Rankings and Benchmark Results

    In the initial HELM Arabic results, Arabic.AI LLM-X (Pronoia) achieved the highest overall score across all seven tasks. Among open-weights models, Qwen3 235B ranked highest by mean score. Other open-weights models appearing in the top ten include Llama 4 Maverick, Qwen3-Next 80B, and DeepSeek v3.1. Several Arabic-focused models, such as AceGPT-v2, ALLaM, JAIS, and SILMA, were evaluated but did not rank above leading multilingual models.

    Purpose of the HELM Arabic Platform

    Stanford CRFM collaborates with Arabic.AI to address gaps in Arabic model evaluation infrastructure. HELM Arabic provides a transparent system for comparing both proprietary and open models. The platform allows researchers to replicate results and track progress in Arabic language modeling using consistent benchmarks.

    Source: https://www.middleeastainews.com/p/stanford-crfm-collabs-with-arabic

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Art Ryan

    Related Posts

    HMRC Signs £175 Million AI Transformation Deal With Quantexa

    May 18, 2026

    OpenAI Acquires Weights.gg to Broaden Its Voice AI Presence

    May 18, 2026

    New EU AI Border System May Bring Travel Delays for US Tourists

    May 18, 2026

    Comments are closed.

    Latest News

    HMRC Signs £175 Million AI Transformation Deal With Quantexa

    May 18, 2026

    OpenAI Acquires Weights.gg to Broaden Its Voice AI Presence

    May 18, 2026

    New EU AI Border System May Bring Travel Delays for US Tourists

    May 18, 2026

    AI-Powered Apps Drive Colombia’s Birdwatching Tourism Growth

    May 18, 2026
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram LinkedIn YouTube Spotify Reddit Snapchat Threads

    AI University

    • Global Universities
    • Universities in Africa
    • Universities in Asia
    • Universities in Europe
    • Universities in Latin America
    • Universities in Middle East
    • Universities in North America
    • Universities in Oceania

    AI Tools & Apps Directory

    • AI Productivity Tools
    • AI Coding Tools
    • AI Voice Tools
    • AI Video Tools
    • AI Image Generators
    • AI Writing Tools

    Info

    • Home
    • About Us
    • AI Organizations & Associations
    • Contact Us
    • Cookie Policy
    • Copyright Policy
    • Disclaimer
    • Editorial Policy
    • Terms and Conditions

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Breaking AI News.
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Sign Up

    Want to stay ahead In Artificial Intelligence?

     Sign up now and get exclusive breaking AI news and special updates—FREE!