MLCommons Releases New AI Benchmark to Test Speed of Responses to User Queries

Mar 28, 2024 10:15

Artificial intelligence benchmarking group MLCommons on Wednesday released a fresh set of tests and results that rate the speed at which top-of-the-line hardware can run AI applications and respond to users.

The two new benchmarks added by MLCommons measure the speed at which the AI chips and systems can generate responses from the powerful AI models packed with data. The results roughly demonstrate to how quickly an AI application such as ChatGPT can deliver a response to a user query.

One of the new benchmarks added the capability to measure the speediness of a question-and-answer scenario for large language models. Called Llama 2, it includes 70 billion parameters and was developed by Meta Platforms.

OpenAI Could Reportedly Release GPT-5 AI Model Later This Year

MLCommons officials also added a second text-to-image generator to the suite of benchmarking tools, called MLPerf, based on Stability AI's Stable Diffusion XL model.

Servers powered by Nvidia's H100 chips built by the likes of Alphabet's Google, Supermicro and Nvidia itself handily won both new benchmarks on raw performance. Several server builders submitted designs based on the company's less powerful L40S chip.

Server builder Krai submitted a design for the image generation benchmark with a Qualcomm AI chip that draws significant less power than Nvidia's cutting edge processors.

Microsoft Taps Google DeepMind Co-Founder to Lead its New Consumer AI Unit

Intel also submitted a design based on its Gaudi2 accelerator chips. The company described the results as "solid."

Raw performance is not the only measure that is critical when deploying AI applications. Advanced AI chips suck up enormous amounts of energy and one of the most significant challenges for AI companies is deploying chip that deliver an optimal amount of performance for a minimal amount of energy.

MLCommons has a separate benchmark category for measuring power consumption.

YouTube Announces Labels to Highlight AI-Generated Content

Stability AI Unveils Stable Video 3D Model for 3D Video Rendering Nvidia's New Platform Will Utilise Generative AI to Power Humanoid Robots Nvidia Unveils Flagship AI Chip, the B200, Aiming to Extend Dominance .embed-container { position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden; max-width: 100%; } .embed-container iframe, .embed-container object, .embed-container embed { position: absolute; top: 0; left: 0; width: 100%; height: 100%; } Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.

(This story has not been edited by NDTV staff and is auto-generated from a syndicated feed.)

Affiliate links may be automatically generated - see our ethics statement for details.

Ads Links by Easy Branches
Play online games for free at games.easybranches.com
Guest Post Services www.easybranches.com/contribute

Vivo X100 Ultra, Vivo S19 and Vivo S19 Pro Bag 3C Certification Ahead of Anticipated Launch in China

Apple Renews Talks With OpenAI for iPhone Generative AI Features

Our ⚡AMP sites

MLCommons Releases New AI Benchmark to Test Speed of Responses to User Queries