BU Today Featured Events
Sign Up
Biological Science Center, 2 Cummington Mall, Room 107

Dask is an open source Python library for parallel computing. This helps to scale Python code to large scale problems, including ones where the quantity of data is much greater than the amount of computer memory on hand. It provides a convenient way to adapt existing programs based around libraries such as Pandas and Numpy to run in parallel. This tutorial will cover using Dask to scale up Pandas Dataframes, numpy array processing, parallelizing custom Python code, and scalable file processing.

Event Details

See Who Is Interested

0 people are interested in this event

User Activity

No recent activity