Learning Spark

Learning Spark
Author :
Publisher : O'Reilly Media
Total Pages : 400
Release :
ISBN-10 : 9781492050018
ISBN-13 : 1492050016
Rating : 4/5 (016 Downloads)

Book Synopsis Learning Spark by : Jules S. Damji

Download or read book Learning Spark written by Jules S. Damji and published by O'Reilly Media. This book was released on 2020-07-16 with total page 400 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark. Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, youâ??ll be able to: Learn Python, SQL, Scala, or Java high-level Structured APIs Understand Spark operations and SQL Engine Inspect, tune, and debug Spark operations with Spark configurations and Spark UI Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka Perform analytics on batch and streaming data using Structured Streaming Build reliable data pipelines with open source Delta Lake and Spark Develop machine learning pipelines with MLlib and productionize models using MLflow


Learning Spark Related Books

Learning Spark
Language: en
Pages: 400
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: O'Reilly Media

GET EBOOK

Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to be processed at scale for analytics or machine learning. But how can yo
Advanced Analytics with Spark
Language: en
Pages: 276
Authors: Sandy Ryza
Categories: Computers
Type: BOOK - Published: 2015-04-02 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors
Learning Spark
Language: en
Pages: 289
Authors: Holden Karau
Categories: Computers
Type: BOOK - Published: 2015-01-28 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source
Machine Learning in Python
Language: en
Pages: 361
Authors: Michael Bowles
Categories: Computers
Type: BOOK - Published: 2015-03-31 - Publisher: John Wiley & Sons

GET EBOOK

Learn a simpler and more effective way to analyze data and predict outcomes with Python Machine Learning in Python shows you how to successfully analyze data us
Spark in Action
Language: en
Pages: 574
Authors: Jean-Georges Perrin
Categories: Computers
Type: BOOK - Published: 2020-05-12 - Publisher: Simon and Schuster

GET EBOOK

Summary The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spar