Announcing Free Spark 101 Training Day

We are proud to be offering a free training day in Ottawa on Tuesday April 25, 2017 in  Ottawa. Please email inquiry@wisewithdata.com if you would like to attend. Spaces are limited. Below is a syllabus of the topics that will be covered, which includes live coding exercises.

Spark 101 Training Day Syllabus

  • Distributed Computing Basics
    • Grid and Cluster Computing
    • Partitioning
    • Map Operations
    • Reduce Operations
    • Data Skew
  • Spark Architecture
    • Drivers, Workers and Executors
    • RDD, Dataframes and Datasets
    • Lazy Execution
    • Memory and Caching
    • The DAG and the Optimizer
    • Shuffle and Broadcast
    • Spark ML and GraphX
    • API’s
  • The Spark Ecosystem
    • Cluster Managers
    • Data and Job Orchestration
    • Workbooks
    • 3rd Party Packages
    • The Thrift Server
  • Apache Zeppelin Workbook
    • Interpreters
    • Graphs
    • Exporting results
  • Python Fundamentals
    • Python philosophy
    • Syntax
    • Data structures
    • Data Science Ecosystem
  • The PySpark API overview
    • The Spark Context
    • Data Structures
    • Libraries
    • SQL
    • ML Pipelines
    • Streaming
    • Dataframe Deep Dive
  • Spark Programming Exercises