Engineering and Technology
Manipulate text data, analyze it and more by mastering regular expressions and string distances in R.
In this course, we will explore the process of analyzing data that is presented in plain text rather than neatly organized tables. While analyzing data in tables can be enjoyable, it is important to be equipped with the skills to extract valuable information from unstructured text. This course will provide you with a comprehensive understanding of creating powerful regular expressions, enabling you to efficiently locate the necessary information for your analyses within a text blob. Furthermore, we will delve into the concept of string distances, which allows us to work with text that may contain typos or scanning errors. By mastering this technique, you will be able to accurately match these imperfect texts with their correct counterparts from other data sources, a process known as record linkage. To facilitate your learning experience, we will utilize real documents pertaining to box office figures in Swiss cinemas. Through the analysis of these authentic materials, you will gain practical insights and develop the necessary skills to effectively extract and analyze data from unstructured text.
by DataCamp
Manipulate text data, analyze it and more by mastering regular expressions and string distances in R...
by DataCamp
Continue your journey to becoming an R ninja by learning about conditional statements, loops, and ve...
by DataCamp
Build multiple-input and multiple-output deep learning models using Keras.
by DataCamp
Learn how to use spaCy to build advanced natural language understanding systems, using both rule-bas...
by DataCamp
Learn all about the advantages of Bayesian data analysis, and apply it to a variety of real-world us...
by DataCamp
Learn how to clean data with Apache Spark in Python.
by DataCamp
Master the complex SQL queries necessary to answer a wide variety of data science questions and prep...
by DataCamp
Consolidate and extend your knowledge of Python data types such as lists, dictionaries, and tuples,...
by DataCamp
Learn to create your own Python packages to make your code easier to use and share with others.
by DataCamp
Learn how to use graphical and numerical techniques to begin uncovering the structure of your data.