Close Menu
    • Home
    • Events
      • Upcoming Events
      • Videos
        • Machine Can Think Summit 2026
        • Step Dubai Conference 2026
    • Technology & Innovation
    • Business & Marketing
    • Trends & Insights
    • Industry Applications
    • Tutorials & Guides
    What's Hot
    Business & Marketing

    ZainTECH Named a Leader in IDC MarketScape: Gulf Countries AI Professional Services

    By Art RyanApril 28, 20260

    Dubai – April 22, 2026: ZainTECH, the integrated digital solutions provider of Zain Group, has…

    HomeLight AI Real Estate Closings Transforming the Market

    April 27, 2026

    UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

    April 27, 2026

    Visit Oman Launches Revolutionary AI Digital Hub and Global Collaboration to Transform Tourism Industry

    April 27, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Breaking AI News
    Tuesday, April 28
    • Home
    • Events
      • Upcoming Events
      • Videos
        • Machine Can Think Summit 2026
        • Step Dubai Conference 2026
    • Technology & Innovation

      HomeLight AI Real Estate Closings Transforming the Market

      April 27, 2026

      UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

      April 27, 2026

      Visit Oman Launches Revolutionary AI Digital Hub and Global Collaboration to Transform Tourism Industry

      April 27, 2026

      Virgin Voyages AI Rovey and the Future of Cruising

      April 27, 2026

      KAYAK Ask AI Travel Planning for the World Cup

      April 27, 2026
    • Business & Marketing

      ZainTECH Named a Leader in IDC MarketScape: Gulf Countries AI Professional Services

      April 28, 2026

      AI Job Cuts Forecast: Shocking Prediction That 50% of UK Executives Expect Workforce Reduction

      April 20, 2026

      AI in Supply Chain: Redesigning Logistics Operations

      April 15, 2026

      Who Will Own Travel in 2046? AI, Trust, and Power Set to Reshape the Industry

      April 14, 2026

      Omio Inside ChatGPT: Revolutionizing Travel Planning

      April 14, 2026
    • Trends & Insights

      Cursor’s $50 Billion Ambition: Explosive AI Coding Demand Fuels Massive Growth

      April 19, 2026

      Dubai AI-powered government will change your daily life in the UAE

      April 3, 2026

      Alteryx Expands Regional Leadership with Sabya Sen to Lead IMEA & APAC

      April 2, 2026

      Safa Soft Showcases AI Driven Umrah Platform Yusur at Umrah and Ziyarah 2026

      April 2, 2026

      Hitek AI launches compliance solutions for Dubai building safety law

      April 2, 2026
    • Industry Applications

      HomeLight AI Real Estate Closings Transforming the Market

      April 27, 2026

      UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

      April 27, 2026

      Visit Oman Launches Revolutionary AI Digital Hub and Global Collaboration to Transform Tourism Industry

      April 27, 2026

      Pony.ai Launches Driverless Robotaxi Trials in Dubai

      April 20, 2026

      Grab AI strategy helps cut fuel costs and scale efficiently

      April 9, 2026
    • Tutorials & Guides

      How AI Is Revolutionizing the Future of Travel 2026 with Wellness and Sustainability

      April 19, 2026

      University of Wollongong in Dubai AI initiative boosts future-ready education

      March 31, 2026

      Microsoft AI upgrades Copilot Cowork unveiled for early access users

      March 31, 2026

      Starcloud $11 billion valuation signals AI space race surge

      March 31, 2026

      Flexible AI Factories Power the Future of Energy Grids

      March 30, 2026
    Breaking AI News
    Home » Stanford CRFM partners with Arabic AI on HELM Arabic leaderboard
    Technology & Innovation

    Stanford CRFM partners with Arabic AI on HELM Arabic leaderboard

    Art RyanBy Art RyanJanuary 29, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Stanford CRFM partners with Arabic AI on HELM Arabic leaderboard
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Stanford CRFM collaborates with Arabic.AI to create a new evaluation platform focused on Arabic large language models. The collaboration resulted in HELM Arabic, a public leaderboard designed to measure model performance using standardized Arabic benchmarks. The project was developed by Stanford University’s Center for Research on Foundation Models (CRFM) together with Arabic.AI. The platform extends the existing HELM evaluation framework to Arabic language tasks.

    Stanford CRFM Collaborates with Arabic.AI on HELM Arabic

    The HELM Arabic leaderboard evaluates models across seven Arabic-language tasks. These tasks include AlGhafa, ArabicMMLU, Arabic EXAMS, MadinahQA, AraTrust, ALRAGE, and ArbMMLU-HT. Each benchmark measures different language capabilities. These include multiple-choice reasoning, question answering, grammar understanding, safety evaluation, and academic knowledge. The benchmarks are drawn from established Arabic datasets.

    Evaluation Methodology Used by Stanford CRFM and Arabic.AI

    Stanford CRFM collaborates with Arabic.AI using a standardized evaluation process. The system applies zero-shot prompting for instruction-tuned models. Multiple-choice tasks use Arabic letter options rather than Latin characters. The evaluation samples 1,000 examples per task subset to balance dataset distributions. Optional reasoning features are disabled to maintain consistency across models. The leaderboard records full model prompts and outputs to support reproducibility.

    Model Rankings and Benchmark Results

    In the initial HELM Arabic results, Arabic.AI LLM-X (Pronoia) achieved the highest overall score across all seven tasks. Among open-weights models, Qwen3 235B ranked highest by mean score. Other open-weights models appearing in the top ten include Llama 4 Maverick, Qwen3-Next 80B, and DeepSeek v3.1. Several Arabic-focused models, such as AceGPT-v2, ALLaM, JAIS, and SILMA, were evaluated but did not rank above leading multilingual models.

    Purpose of the HELM Arabic Platform

    Stanford CRFM collaborates with Arabic.AI to address gaps in Arabic model evaluation infrastructure. HELM Arabic provides a transparent system for comparing both proprietary and open models. The platform allows researchers to replicate results and track progress in Arabic language modeling using consistent benchmarks.

    Source: https://www.middleeastainews.com/p/stanford-crfm-collabs-with-arabic

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Art Ryan

    Related Posts

    HomeLight AI Real Estate Closings Transforming the Market

    April 27, 2026

    UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

    April 27, 2026

    Visit Oman Launches Revolutionary AI Digital Hub and Global Collaboration to Transform Tourism Industry

    April 27, 2026

    Comments are closed.

    Latest News

    ZainTECH Named a Leader in IDC MarketScape: Gulf Countries AI Professional Services

    April 28, 2026

    HomeLight AI Real Estate Closings Transforming the Market

    April 27, 2026

    UiPath & Databricks Partner to Transform Enterprise Operations through Automation and Data Intelligence

    April 27, 2026

    Visit Oman Launches Revolutionary AI Digital Hub and Global Collaboration to Transform Tourism Industry

    April 27, 2026
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

    AI University

    • Global Universities
    • Universities in Africa
    • Universities in Asia
    • Universities in Europe
    • Universities in Latin America
    • Universities in Middle East
    • Universities in North America
    • Universities in Oceania

    AI Tools & Apps Directory

    • AI Productivity Tools
    • AI Coding Tools
    • AI Voice Tools
    • AI Video Tools
    • AI Image Generators
    • AI Writing Tools

    Info

    • Home
    • About Us
    • AI Organizations & Associations
    • Contact Us

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2026 Breaking AI News.
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.