
- This event has passed.
Introduction to Apache Spark
November 18, 2019 @ 8:30 am - 12:30 pm CST

Are you curious about Apache Spark? Come learn about it in this introductory class. We will cover:
Apache Spark Basics:
* What is Apache Spark?
* Starting the Spark Shell
* Using the Spark Shell
* Getting Started with Datasets and DataFrames
* DataFrame Operations
Working with DataFrames and Schemas:
* Creating DataFrames from Data Sources
* Saving DataFrames to Data Sources
* DataFrame Schemas
* Eager and Lazy Execution
This training assumes some familiarity with HDFS and YARN (though the discussion is not specific to Hadoop, this training assumes Hadoop usage). The instructor is happy to cover these topics if they are of general interest.