eBook

Spark in Action

Marko Bonaći, Software Engineer, data Artisans

Petar Zečević, Software Engineer, data Artisans

Audience: Architects

Technical level: Introductory

Working with big data can be complex and challenging, in part because of the multiple analysis frameworks and tools required. Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. But Spark goes much further than other frameworks. By including machine learning and graph processing capabilities, it makes many specialized data processing platforms obsolete. Spark's unified framework and programming model significantly lowers the initial infrastructure investment, and Spark's core abstractions are intuitive for most Scala, Java, and Python developers.

Spark in Action teaches you to use Spark for stream and batch data processing. It starts with an introduction to the Spark architecture and ecosystem followed by a taste of Spark's command line interface. You then discover the most fundamental concepts and abstractions of Spark, particularly Resilient Distributed Datasets (RDDs) and the basic data transformations that RDDs provide. The first part of the book also introduces you to writing Spark applications using the the core APIs. Next, you learn about different Spark components: how to work with structured data using Spark SQL, how to process near-real time data with Spark Streaming, how to apply machine learning algorithms with Spark MLlib, how to apply graph algorithms on graph-shaped data using Spark GraphX, and a clear introduction to Spark clustering.

The free sample PDF available here includes chapter 1. All registrants qualify for a Lightbend discount and will save 40% off the price of Spark in Action (all formats) as it becomes available by referencing promotional code 15tsspark. If you are interested in purchasing the full book get it here. Offer only valid at manning.com.

Grab your copy

Please enter your information to receive your E-book chapter(s) of Spark in Action and be signed up for the Lightbend Newsletter. Once you've entered your information and submitted the form, the PDF will be emailed to your address.

*Required: The information you provide will be used in accordance with the terms of our privacy policy. **Required Opt-In: I would like to receive the monthly newsletter, educational resources (white papers, ebooks, webinars, reports), and event information via email. You can unsubscribe at anytime or manage your email preferences here.

ABOUT AUTHOR(S)

Marko Bonaći, Software Engineer, data Artisans

Petar Zečević, Software Engineer, data Artisans

About Lightbend

Lightbend (Twitter: @Lightbend) provides the leading Reactive application development platform for building distributed systems. Based on a message-driven runtime, these distributed systems, which include microservices and streaming fast data applications, can effortlessly scale on multi-core and cloud architectures. Many of the most admired brands around the globe are transforming their businesses with our platform, engaging billions of users every day through software that is changing the world. For more information on Lightbend, visit: lightbend.com