Udacity

Spark and Data Lakes

Engineering and Technology

Short Description

Learn about the big data ecosystem and the power of Apache Spark for data wrangling and transformation.

Long Description

This course on Udacity, Spark and Data Lakes, provides a comprehensive understanding of the big data ecosystem, data lakes, and the Spark framework. It covers topics such as the purpose and evolution of data lakes, a comparison between Spark and Hadoop, and the features of lakehouse architecture. Additionally, the course delves into the essentials of Spark, including data wrangling with functional programming, processing data with Spark DataFrames and Spark SQL, and working with common formats like CSV and JSON. Furthermore, it explores the usage of Spark and data lakes in the AWS Cloud, utilizing distributed data storage with Amazon S3 and configuring AWS Glue for running Spark Jobs. The course also covers ingesting and organizing data in lakehouse architecture on AWS, using Spark and AWS Glue for ELT processes, creating a Glue Data Catalog and Tables, and leveraging AWS Athena for ad-hoc queries. Finally, the course concludes with a hands-on project where learners act as data engineers for the STEDI team, building a data lakehouse solution for sensor data that involves building an ELT pipeline, processing data with Spark and AWS Glue, and loading the analytics tables back into the lakehouse architecture.

Course Details

Duration
4 weeks
Difficulty
Intermediate
Format
Short Course
Price
USD399.00
Course Link
More Information
Udacity
Description
Udacity is an online learning platform that offers a wide range of courses and programs in various fields such as technology, business, data science, and artificial intelligence. It was founded in 2012 by Sebastian Thrun, David Stavens, and Mike Sokolsky with the aim of providing accessible and affordable education to individuals worldwide. Udacity's courses are designed in collaboration with industry experts and leading companies, ensuring that the content is relevant and up-to-date. The platform offers both self-paced courses and guided programs, allowing learners to choose the learning style that suits them best. Udacity also provides career services and support, including resume reviews, interview preparation, and job placement assistance, to help learners transition into their desired careers.