7 Steps for a Developer to Learn Apache Spark

White Papers > Hadoop > 7 Steps for a Developer to Learn Apache Spark

Released last year in July, Apache Spark 2.0 was more than just an increase in its numerical notation from 1.x to 2.0: It was a monumental shift in ease of use, higher performance, and smarter unification of APIs across Spark components; and it laid the foundation for a unified API interface for Structured Streaming. Also, it defined the course for subsequent releases in how these unified APIs across Spark’s components will be developed, providing developers expressive ways to write their computations on structured data sets.

In this ebook, we expand, augment and curate on concepts initially published on KDnuggets. In addition, we augment the ebook with technical blogs and related assets specific to Apache Spark 2.x, written and presented by leading Spark contributors and members of Spark PMC including Matei Zaharia, the creator of Spark; Reynold Xin, chief architect; Michael Armbrust, lead architect behind Spark SQL and Structured Streaming; Joseph Bradley, one of the drivers behind Spark MLlib and SparkR; and Tathagata Das, lead developer for Structured Streaming.

Collectively, the ebook introduces steps for a developer to understand Spark, at a deeper level, and speaks to the Spark 2.x’s three themes— easier, faster, and smarter. Whether you’re getting started with Spark or already an accomplished developer, this ebook will arm you with the knowledge to employ all of Spark 2.x’s benefits.  To learn more download: 7 Steps for a Developer to Learn Apache Spark

    Contact Info

    Work Email*
    First Name*
    Last Name*
    Zip/Postal Code*

    Company Info

    Company Size*
    Job Role*


    'How would you describe yourself*

    All information that you supply is protected by our privacy policy. By submitting your information you agree to our Terms of Use.
    * All fields required.