Distributed Machine Learning Benchmark

Team

There have been rapid advancements in machine learning services and systems, extending their application to a wide range of use cases. However, some basic questions have emerged from the perspectives of both users and providers of such services. How can we improve the efficiency, ease-of-use, transparency, and reproducibility of distributed machine learning methods and provide fair performance measures as well as reference implementations? The answer could lead to increased adoption of distributed machine learning methods in industry and among the academia. MLBench, a framework for distributed machine learning, can help achieve those goals.

The main objectives of MLbench are to:

Serve as an easy-to-use and fair benchmarking suite for algorithms as well as for systems (software frameworks and hardware).
Provide re-usable and reliable reference implementations of distributed ML training algorithms.

MLbench is based on Kubernetes to ease deployment in a distributed setting, both on public clouds and on dedicated hardware. It supports several standard machine-learning frameworks and algorithms, and can be set up with a single shell command. It comes with a convenient dashboard for easy access and management for running experiments, such as monitoring resource usage at all worker nodes. You can quickly set up the reference experiments or initiate your own, and get visualizations of your runs. By offering precise specifications of the benchmark ML tasks, metrics as well as reference implementations,

MLbench provides fair baselines and improves transparency. It can render support to a wide range of platforms, ML frameworks, and machine learning tasks. Our goal is to benchmark all/most currently relevant distributed execution frameworks. We welcome contributions of new algorithms and systems in the benchmark suite.

MLbench consists of a public website as well as 5 Github repositories:

Documentation: http://github.com/mlbench/mlbench-docs
Helm Charts for Kubernetes: http://github.com/mlbench/mlbench-helm
Python Core Library: http://github.com/mlbench/mlbench-core
Benchmark Implementations: http://github.com/mlbench/mlbench-benchmarks
Dashboard: http://github.com/mlbench/mlbench-dashboard

Suggested Reading

https://mlbench.github.io/

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Upcoming Events

Future Health: Harnessing Multimodal Data and GenAI for Health Promotion

Swiss Federal Offices Day 2024

Annual Event

MLbench

Distributed Machine Learning Benchmark

Team