Engineering and Technology
Learn to acquire data from common file formats and systems such as CSV files, spreadsheets, JSON, SQL databases, and APIs.
This course focuses on the initial step of data analysis, which is acquiring the necessary data. It provides instruction on constructing pipelines to import data stored in commonly used formats. The course utilizes pandas, a prominent Python library for analytics, to retrieve data from diverse sources such as survey response spreadsheets, public service request databases, and popular review site APIs. Throughout the course, participants will gain knowledge on optimizing imports to extract only relevant information and resolving issues related to incorrect data types. Additionally, participants will have the opportunity to create a customized dataset by combining data from multiple sources.
by DataCamp
Learn to acquire data from common file formats and systems such as CSV files, spreadsheets, JSON, SQ...
by DataCamp
Learn how to build and test data engineering pipelines in Python using PySpark and Apache Airflow.
by DataCamp
This course provides an intro to clustering and dimensionality reduction in R from a machine learnin...
by DataCamp
Learn how to implement and schedule data engineering workflows.
by DataCamp
Master core concepts about data manipulation such as filtering, selecting and calculating groupwise...
by DataCamp
Learn powerful command-line skills to download, process, and transform data, including machine learn...
by DataCamp
Build real-world Excel skills in just 4 hours. This course will show you time-saving shortcuts and e...
by DataCamp
Learn to use spreadsheets to clean, analyze, and draw insights from data. Discover how to sort, filt...
by DataCamp
Learn to streamline your machine learning workflows with tidymodels.
by DataCamp
Leverage your Python and SQL knowledge to create an ETL pipeline to ingest, transform, and load data...