CG数据库 >> Cleaning Bad Data in R

Cleaning Bad Data in R的图片1MP4 | Video: h264, 1280x720 | Audio: AAC, 48 KHz, 2 Ch

Genre: eLearning | Language: English + .SRT | + Exercise Files

Level: Beginner | Duration: 1h 54m | 265 MB

Data integrity is the new focal point of the data science revolution. Now that everybody is onboard with the role of data in people's lives and business, it's not an unfair question to ask, "Can you prove that your data is accurate?" In this course, you can learn how to identify and address many of the data integrity issues facing modern data scientists, using R and the tidyverse. Discover how to handle missing values and duplicated data. Find out how to convert data between different units and tackle poorly formatted text. Plus, learn how to detect outliers, address structural issues, and identify red flags that indicate potential data quality issues.

Where possible, instructor Mike Chapple shows how to correct the issues using R, but the same principles can be applied to any statistical programing language.

Topics include:

Missing data

Duplicate rows and values

Converting data

Formatting data

Working with tidy data

Tidying data sets

Dealing with suspicious data

Cleaning Bad Data in R的图片2

Password/解压密码

-0daydown

发布日期: 2018-08-23