10% off all books and free delivery over £50
Buy from our bookstore and 25% of the cover price will be given to a school of your choice to buy more books. *15% of eBooks.

Spark Operations Cookbook

View All Editions (1)

The selected edition of this book is not available to buy right now.
Add To Wishlist
Write A Review

About

Spark Operations Cookbook Synopsis

The Apache Spark cluster computing system aims to make data analytics fast-both fast to run and fast to write. But as powerful and useful as Spark is for distributed systems, there are many issues that may occur during implementation. This practical cookbook contains recipes solving the most common problems that Spark users face. Author Neelesh Srinivas Salian, a customer operations engineer at Cloudera, has seen all things that can go wrong in the code for Spark applications.

Data engineers, system administrators, architects will learn recipes for debugging common and unexpected problems that occur during key phases of Spark implementation on large distributed system environments. From setting up your cluster to running your first application, submitting to a cluster, understanding storage needs, and handling security and monitoring metrics, this book is your guide to facing any Spark operations issue.

  • Learn an approach to debugging Spark from the perspective of improving business logic implementation
  • Understand the nuances of Spark's components, including Spark Core, Spark Streaming, SparkSQL, and MLLib
  • Get an entire chapter devoted to Spark security-an emerging and vital topic

About This Edition

ISBN: 9781491971581
Publication date:
Author: Neelesh Srinivas Salian
Publisher: O'Reilly an imprint of O'Reilly Media
Format: Paperback
Pagination: 200 pages
Genres: Data mining