Contribute Media
A thank you to everyone who makes this possible: Read More

Meet dask and distributed: the unsung heroes of Python scientific data ecosystem.

Description

Thanks to its world-class data tools and libraries, like Numpy, Pandas, Jupyter, Matplotlib and xarray, Python is becoming the language of choice in many scientific communities from Physics to Climate Science, from Earth Observation to Economy.

A turn-key but less-know component of the scientific ecosystem is the dask library that enable seamless parallel, distributed and GPU computing in most cases without code changes.

We will use climate science as an typical example of a discipline where simple tasks become easily big data problems and where mastering xarray, dask and dask.distributed is the key to turn them back into simple tasks, possibly on a large cluster of VMs (that you can easily provision from your preferred cloud provider).

Slides: https://gitpitch.com/alexamici/talks/master?p=PyConX-2019

Feedback form: https://python.it/feedback-1704

in __on Friday 3 May at 17:15 **See schedule**

Improve this page