Posted on 2019-08-13, by nokia241186.
Packt - Scalable Data Analysis in Python with Dask
English | Size: 1.09 GB
Category: Programming | E-learning
Understand the concept of Block algorithms and how Dask leverages it to load large data.
Implement various example using Dask Arrays, Bags, and Dask Data frames for efficient parallel computing
Combine Dask with existing Python packages such as NumPy and Pandas
See how Dask works under the hood and the various in-built algorithms it has to offer
Leverage the power of Dask in a distributed setting and explore its various schedulers
Implement an end-to-end Machine Learning pipeline in a distributed setting using Dask and scikit-learn
Use Dask Arrays, Bags, and Dask Data frames for parallel and out-of-memory computations
Data analysts, Machine Learning professionals, and data scientists often use tools such as Pandas, Scikit-Learn, and NumPy for data analysis on their personal computer. However, when they want to apply their analyses to larger datasets, these tools fail to scale beyond a single machine, and so the analyst is forced to rewrite their computation.
If you work on big data and you're using Pandas, you know you can end up waiting up to a whole minute for a simple average of a series. And that's just for a couple of million rows!
In this course, you'll learn to scale your data analysis. Firstly, you will execute distributed data science projects right from data ingestion to data manipulation and visualization using Dask. Then, you will explore the Dask framework. After, see how Dask can be used with other common Python tools such as NumPy, Pandas, matplotlib, Scikit-learn, and more.
You'll be working on large datasets and performing exploratory data analysis to investigate the dataset, then come up with the findings from the dataset. You'll learn by implementing data analysis principles using different statistical techniques in one go across different systems on the same massive datasets.
Throughout the course, we'll go over the various techniques, modules, and features that Dask has to offer. Finally, you'll learn to use its unique offering for machine learning, using the Dask-ML package. You'll also start using parallel processing in your data tasks on your own system without moving to the distributed environment.
All the code files and related files are uploaded on GitHub at this link: https://github.com/PacktPublishing/-Scalable-Data-Analysis-in-Python-with-Dask
Style and Approach
This hands-on course covers all the important components of Dask (arrays, bags, data frames, schedulers, and the Futures API) to parallelize your existing Python code and perform computations in a distributed setting. This course is designed with minimum theory and maximum practical implementation, followed by step-by-step instructions to get you up and running.
Leverage the power of parallel computing using Dask.delayed
Get complete exposure to using Dask to handle large data in a distributed setting
Learn how to do machine learning by combining scikit-learn and Dask in a distributed setting
Course Length 3 hours 31 minutes ISBN 9781789808926 Date Of Publication 30 May 2019
(Buy premium account for maximum speed and resuming ability)
- Ebooks list page : 41118
- 2019-09-28Scalable Data Analysis In Python With Dask
- 2019-06-25Scalable Data Analysis in Python with Dask
- 2019-06-24Scalable Data Analysis in Python with Dask
- 2019-05-21Packt Exploratory Data Analysis with Pandas and Python 3.x
- 2020-04-20Packt Exploratory Data Analysis with R
- 2020-04-11Python Programming: 2 Books in 1: Python for Data Analysis and Science with Big Data Analysis, Statistics and Machine Learning.
- 2020-03-05Packt Exploratory Data Analysis With R
- 2020-03-02Python for Data Science: Master Data Analysis from Scratch, with Business Analytics Tools and Step-by-Step techniques for Beginners. The Future of Machine Learning & Applied Artificial Intelligence
- 2020-01-07Packt - Exploratory Data Analysis with R
- 2019-11-30Learning Geospatial Analysis with Python: Understand GIS fundamentals and perform remote sensing data analysis using Python 3.7, 3rd Edition
- 2019-10-22Learning Geospatial Analysis with Python: Understand GIS fundamentals and perform remote sensing data analysis using Python 3.7, 3rd Edition
- 2019-03-15Packt ADVANCED DATA ANALYSIS WITH HASKELL
- 2019-01-26PACKT- ADVANCED DATA ANALYSIS WITH HASKELL JGTiSO
- 2019-01-25Pluralsight Getting Started with Data Analysis Using Python - Removed
- 2018-12-14Packt - Advanced Data Analysis with Haskell
- 2018-12-02Packt - Advanced Data Analysis with Haskell
- 2018-08-09Data Analysis From Scratch With Python Step By Step Guide
- 2018-08-07Data Analysis From Scratch With Python: Step By Step Guide
- 2018-07-15Data Analysis From Scratch With Python: Step By Step Guide by Peters Morgan
- Download links and password may be in the description section, read description carefully!
- Do a search to find mirrors if no download links or dead links.