Dask (software)

Dask
Original author(s)Matthew Rocklin
Developer(s)Dask
Initial releaseJanuary 8, 2015 (2015-01-08)
Stable release
2024.2.1 / February 23, 2024 (2024-02-23)
RepositoryDask Repository
Written inPython
Operating systemLinux, Microsoft Windows, macOS
Available inPython
TypeData analytics
LicenseNew BSD
Websitedask.org

Dask is an open-source Python library for parallel computing. Dask scales Python code from multi-core local machines to large distributed clusters in the cloud. Dask provides a familiar user interface by mirroring the APIs of other libraries in the PyData ecosystem including: Pandas, scikit-learn and NumPy. It also exposes low-level APIs that help programmers run custom algorithms in parallel.

Dask was created by Matthew Rocklin in December 2014 and has over 9.8k stars and 500 contributors on GitHub.

Dask is used by retail, financial, governmental organizations, as well as life science and geophysical institutes. Walmart, Wayfair, JDA, GrubHub, General Motors, Nvidia, Harvard Medical School, Capital One and NASA are among the organizations that use Dask.