Coursera - Data Analysis
WEBRip | English | MP4 + PDF Slides | 960 x 540 | AVC ~33.8 kbps | 30 fps
AAC | 128 Kbps | 44.1 KHz | 2 channels | Subs: English (.srt) | 12h 36mn | 949 MB
Genre: eLearning Video / Programming
This course is an applied statistics course focusing on data analysis. The course will begin with an overview of how to organize, perform, and write-up data analyses. Then we will cover some of the most popular and widely used statistical methods like linear regression, principal components analysis, cross-validation, and p-values. Instead of focusing on mathematical details, the lectures will be designed to help you apply these techniques to real data using the R statistical programming language, interpret the results, and diagnose potential problems in your analysis. You will also have the opportunity to critique and assist your fellow classmates with their data analyses.
Course Content:
The structure of a data analysis (steps in the process, knowing when to quit, etc.)
Types of data (census, designed studies, randomized trials)
Types of data analysis questions (exploratory, inferential, predictive, etc.)
How to write up a data analysis (compositional style, reproducibility, etc.)
Obtaining data from the web (through downloads mostly)
Loading data into R from different file types
Plotting data for exploratory purposes (boxplots, scatterplots, etc.)
Exploratory statistical models (clustering)
Statistical models for inference (linear models, basic confidence intervals/hypothesis testing)
Basic model checking (primarily visually)
The prediction process
Study design for prediction
Cross-validation
A couple of simple prediction models
Basics of simulation for evaluating models
Ways you can fool yourself and how to avoid them (confounding, multiple testing, etc.)
发布日期: 2016-02-18