Benchmarks

Delivering open, useful measures of quality, performance and safety to help guide responsible AI development.

About MLCommons Benchmarks

The foundation for MLCommons benchmark work was derived from and builds upon MLPerf which aims to deliver a representative benchmark suite for AI/ML that fairly evaluates system performance to meet five high-level goals:

01

Enable fair comparison of competing systems while still encouraging AI innovation.

02

Accelerate AI progress through fair and useful measurement.

03

Enforce reproducibility to ensure reliable results.

04

Serve both the commercial and research communities.

05

Keep benchmarking effort affordable so all can participate.

Benchmark management

Each benchmark suite is defined by a working group community of experts, who establish the fair benchmarks for AI systems. The working group defines the AI model to run, the data set against which it gets run, sets rules on what changes to the model are allowed, and measures how fast a given hardware runs the model. By working within this AI model tripod, MLCommons AI systems benchmarks measure not only the speed of hardware, but also the quality of training data, and quality metrics of an AI model itself. 

Perspective: Unlocking ML requires an ecosystem approach

Benchmark Suites

How to submit MLPerf results

If you are interested in submitting MLPerf benchmark results, please join the appropriate working group. Registration deadlines are several weeks in advance of submission dates to ensure that all submitters are aware of benchmark requirements, and to ensure proper provision of all necessary resources.

Enable fair comparison of competing systems while still encouraging AI/ML innovation.
Accelerate AI/ML progress through fair and useful measurement.
Enforce reproducibility to ensure reliable results.
Serve both the commercial and research communities.
Keep benchmarking effort affordable so all can participate.

Membership is required for most benchmark working groups (e.g., Training, Inference, Mobile). There are some public benchmark working groups that have no access requirements where non-members may submit to the benchmark by first signing a Non-member Test Agreement.

We encourage people to become MLCommons Members if they wish to contribute to MLCommons projects. However, if you are interested in contributing to one of our open source projects and do not think your organization would be a good fit as a Member, please enter your GitHub ID into our subscription form. If your organization is already a Member of MLCommons, you can also use the subscription form to request authorization to commit code in accordance with the CLA.

Submission Requirements:

Join an appropriate MLCommons Working Group
A signed Contributor License Agreement (CLA) to enable contributing code, logs, etc. to MLCommons GitHub repositories.
Membership in MLCommons OR a signed Non-member Test Agreement
A signed trademark license agreement (either the member OR non-member version, as appropriate)

MLPerf is a registered trademark of MLCommons. The use of the MLPerf results and trademark are described in our Policies.

Both member and non-member trademark agreements are available upon request by contacting support@mlcommons.org.

Benchmarks

About MLCommons Benchmarks

01

02

03

04

05

Benchmark management

Benchmark Suites

AILuminate

AlgoPerf: Training Algorithms Benchmark Results

MLPerf Client

MLPerf Inference: Datacenter

MLPerf Inference: Edge

MLPerf Inference: Mobile

MLPerf Inference: Tiny

MLPerf Storage

MLPerf Training

MLPerf Training: HPC

How to submit MLPerf results

Submission Requirements: