Programming techniques that work well on laptop-sized data can slow to a crawl-- or fail altogether-- when applied to massive files or distributed datasets. By mastering the powerful map and reduce paradigm, along with the Python-based tools that support it, you can write data-centric applications that scale efficiently without requiring codebase rewrites as your requirements change. "Mastering large datasets with Python" teaches you to write code that can handle datasets of any size. You'll start with laptop-sized datasets ...
Read More
Programming techniques that work well on laptop-sized data can slow to a crawl-- or fail altogether-- when applied to massive files or distributed datasets. By mastering the powerful map and reduce paradigm, along with the Python-based tools that support it, you can write data-centric applications that scale efficiently without requiring codebase rewrites as your requirements change. "Mastering large datasets with Python" teaches you to write code that can handle datasets of any size. You'll start with laptop-sized datasets that teach you to parallelize data analysis by breaking large tasks into smaller ones that can run simultaneously. You'll then scale those same programs to industrial-sized datasets on a cluster of cloud servers. With the map and reduce paradigm firly in place, you'll explore tools like Hadoop and PySpark to efficiently process massive distributed datasets, speed up decision-making with machine learning, and simplify your data storage with AWS S3.
Read Less
Add this copy of Mastering Large Datasets With Python: Parallelize and to cart. $17.86, very good condition, Sold by BooksRun rated 5.0 out of 5 stars, ships from Philadelphia, PA, UNITED STATES, published 2020 by Manning.
Add this copy of Mastering Large Datasets With Python: Parallelize and to cart. $17.86, good condition, Sold by BooksRun rated 5.0 out of 5 stars, ships from Philadelphia, PA, UNITED STATES, published 2020 by Manning.
Add this copy of Mastering Large Datasets With Python: Parallelize and to cart. $18.78, good condition, Sold by HPB-Red rated 5.0 out of 5 stars, ships from Dallas, TX, UNITED STATES, published 2020 by Manning Publications.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or limited writing/highlighting. We ship orders daily and Customer Service is our top priority!
Add this copy of Mastering Large Datasets: Parallelize and Distribute to cart. $55.57, new condition, Sold by Media Smart rated 3.0 out of 5 stars, ships from Hawthorne, CA, UNITED STATES, published 2020 by Manning Publications.
Add this copy of Mastering Large Datasets With Python: Parallelize and to cart. $55.83, good condition, Sold by Bonita rated 4.0 out of 5 stars, ships from Newport Coast, CA, UNITED STATES, published 2020 by Manning.
Add this copy of Mastering Large Datasets With Python to cart. $59.47, new condition, Sold by Blackwell's rated 3.0 out of 5 stars, ships from Gloucester, GLOUCESTERSHIRE, UNITED KINGDOM, published 2020 by Manning Publications.
Add this copy of Mastering Large Datasets (Book) to cart. $65.93, new condition, Sold by Basi6 International rated 5.0 out of 5 stars, ships from Irving, TX, UNITED STATES, published 2020 by Manning Publications.
Add this copy of Mastering Large Datasets (Book) to cart. $65.93, new condition, Sold by discount_scientific_books rated 5.0 out of 5 stars, ships from Sterling Heights, MI, UNITED STATES, published 2020 by Manning Publications.