Course Outline

Introduction

Overview of Data Cleaning

  • Why is Data Cleaning Important?

Case Study: When Big Data Is Dirty

Developing A Thorough Data Cleaning Strategy

Common Data Cleaning Tools

  • Drake
  • OpenRefine
  • Pandas (for Python)
  • Dplyr (for R)

Achieving High Data Integrity

  • Complete
  • Correct
  • Accurate
  • Relevant
  • Consistent

Automating the Data Cleaning Process

Monitoring Your Data Cleaning System

Summary and Conclusion

Requirements

  • An understanding of data analytics concepts.

Audience

  • Data Scientists
  • Data Analysts
  • Business Analysts
 7 Hours

Number of participants



Price per participant

Testimonials (9)

Related Courses

Analytic Functions Fundamentals

21 Hours

Apache Arrow for Data Analysis across Disparate Data Sources

14 Hours

AWS Glue Fundamentals

14 Hours

Azure for Data Engineer

35 Hours

A Practical Introduction to Data Analysis and Big Data

35 Hours

Data and Analytics - from the ground up

42 Hours

Scaling Data Analysis with Python and Dask

14 Hours

Data Analysis for Marketers

14 Hours

Data Analytics With R

21 Hours

Datameer for Data Analysts

14 Hours

Data Analysis with Python, Pandas and Numpy

14 Hours

A Practical Introduction to Data Science

35 Hours

Introduction to dbt Cloud

21 Hours

Dremio for Self-Service Data Analysis

21 Hours

Elasticsearch for Developers

14 Hours

Related Categories

1