Skip to content

tiagoantao/python-performance

Repository files navigation

Fast Python for Data Science

Welcome to the code repository for the book Fast Python

Book cover

Fast Python is your guide to optimizing every part of your Python-based data analysis process, from the pure Python code you write to managing the resources of modern hardware and GPUs. You'll learn to rewrite inefficient data structures, improve underperforming code with multithreading, and simplify your datasets without sacrificing accuracy.

Here you can find the code for the book. Here is a chapter-oriented roadmap. Given that the book is in early access, the repo is also under construction

Introduction

Extracting maximum performance from built-in features

Concurrency, parallelism, and asynchronous processing

Using NumPy more efficiently

Extracting maximum efficiency of hardware and networks

Re-implementing critical code with Cython

Memory hierarchy, storage and networking

Optimizing modern data processing libraries

High performance Pandas and Apache Arrow

Storing big data

Advanced topics

Data analysis using GPU computing

Analyzing big data with Dask

Appendixes

Setting up the environment