Course Overview
This course is designed as an entry point for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark.
Topics include:
- An overview of the Hortonworks Data Platform (HDP), including HDFS and YARN
- Using Spark Core APIs for interactive data exploration
- Spark SQL and DataFrame operations
- Spark Streaming and DStream operations
- Data visualization, reporting, and collaboration
- Performance monitoring and tuning
- Building and deploying Spark applications
- Introduction to the Spark Machine Learning Library