DataCamp

Joining Data with dplyr

Engineering and Technology

Short Description

Learn to combine data across multiple tables to answer more complex questions with dplyr.

Long Description

This course focuses on the essential skills required in data science to effectively analyze complex datasets that are distributed across multiple tables. By joining these tables together, you will gain the ability to extract valuable insights from the combined data. Throughout the course, you will have the opportunity to apply these skills using a captivating dataset from the Rebrickable website, which encompasses comprehensive information about LEGO sets, parts, themes, and colors. Despite the dataset being fragmented across numerous tables, you will progressively work with it while mastering six distinct types of joins. These include four mutating joins: inner join, left join, right join, and full join, as well as two filtering joins: semi join and anti join. In the final chapter, you will further enhance your expertise by applying these newly acquired skills to Stack Overflow data. This dataset comprises nearly 300,000 Stack Overflow questions that are specifically tagged with R, encompassing details about their answers, the date they were asked, and their score. Prepare yourself to elevate your dplyr skills to the next level with this comprehensive course.

Course Details

Duration
4 hours
Format
Short Course
Price
USD39.00
Course Link
More Information
DataCamp
Description
DataCamp is an online learning platform that offers interactive courses and tutorials for data science and analytics. It provides a wide range of courses covering topics such as Python, R, SQL, machine learning, data visualization, and more. The platform offers a hands-on learning experience through coding exercises and projects, allowing users to practice and apply their skills in real-world scenarios. DataCamp also offers a personalized learning experience with adaptive learning technology that adjusts the course content based on the user's skill level and progress. It is widely used by individuals, professionals, and organizations to enhance their data science skills and stay up-to-date with the latest trends and technologies in the field.