Welcome to ANLY 500 Analytics I
This course provides comprehensive R programming instruction for data analytics, covering foundational concepts through advanced statistical modeling and visualization. Each week builds upon previous knowledge with hands-on, code-first workflows grounded in statistical theory and reproducible research practices.
Primary Textbook: Field, A., Miles, J., & Field, Z. (2012). Discovering Statistics Using R. SAGE Publications.
Author: Ziyuan Huang | Course: Analytics I | Institution: Harrisburg University
📚 Weekly Tutorials
Week 01: Introduction to R
Complete beginner's guide to R programming! Learn R basics, data types, structures (vectors, matrices, data frames), functions, data manipulation, visualization, and practical analysis with Palmer Penguins dataset. 40+ visualizations with plain English explanations.
Week 02: R for Data Analytics
Introduction to R programming, data types, descriptive statistics, and basic hypothesis testing. Learn central tendency, dispersion measures, and visualization fundamentals.
Week 03: Data Wrangling & EDA
Master data import, tidy data principles, dplyr verbs, handling missing values, and exploratory data analysis with ggplot2. Includes correlation and basic inference.
Week 04: Statistical Inference
Deep dive into hypothesis testing, confidence intervals, effect sizes (Cohen's d), statistical power, and Type I/II errors. Learn the fundamental equation: Outcome = Model + Error.
Week 05: Data Visualization
Comprehensive ggplot2 tutorial covering histograms, scatterplots, bar graphs, line graphs, data reshaping, and professional themes. Master the grammar of graphics!
Week 06: Data Screening Part 1
Essential data cleaning techniques: accuracy checking, handling missing data (MCAR, MAR, MNAR), multiple imputation with MICE, and multivariate outlier detection using Mahalanobis distance.
Week 07: Data Screening Part 2
Comprehensive assumption checking with 60+ visualizations! Master independence, multicollinearity, linearity, normality, and homoscedasticity. Includes influential cases, Cook's Distance, and transformation guide.
Week 08: Correlation Analysis
Complete correlation guide: Pearson, Spearman, Kendall, point-biserial, partial & semi-partial correlations. Modern visualizations with corrplot & ggpairs. Includes comparing correlations, effect sizes, and APA reporting.
Week 09: Linear Regression
Introduction to Linear Regression. Simple and multiple regression, hierarchical models, dummy coding, and regression diagnostics. Learn to predict outcomes and test specific hypotheses.
Week 10: Mediation & Moderation
Advanced regression topics: Mediation (mechanisms) and Moderation (interactions). Learn modern Bootstrapping methods, Simple Slopes analysis, and how to interpret "It Depends" effects.
Week 11: Comparing Two Means
Comparing two means using the t-test (Independent & Paired). Learn about Signal-to-Noise Ratio, Assumptions (Normality, Homogeneity), and Effect Sizes (Cohen's d).
Week 12: Comparing Several Means (ANOVA)
Introduction to One-Way ANOVA. Understand the F-Ratio (Signal-to-Noise), Post Hoc Tests (Bonferroni), and Effect Sizes (Omega Squared). Visualizing SST, SSM, and SSR.
🎯 Key Features
- Code-First Approach: All examples are executable and reproducible
- Theory-Grounded: IEEE-style citations from Field et al. (2012)
- Beginner-Friendly: Extensive explanations and interpretations
- Professional Quality: Publication-ready figures and tables
- Reproducible: Seedhash integration for consistent results
📖 References
- Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences (3rd ed.). Lawrence Erlbaum Associates.
- Field, A., Miles, J. and Field, Z. (2012). Discovering Statistics Using R. Sage Publications Ltd., London.
- Hays, W. L. (1994). Statistics (5th ed.). Harcourt Brace.
- Howell, D. C. (2012). Statistical Methods for Psychology (8th ed.). Cengage Learning.
- Little, R. J. A., & Rubin, D. B. (2020). Statistical Analysis with Missing Data (3rd ed.). Wiley.
- Tabachnick, B. G., & Fidell, L. S. (2019). Using Multivariate Statistics (7th ed.). Pearson.
- Tufte, E. R. (2001). The Visual Display of Quantitative Information (2nd ed.). Graphics Press.
- Van Buuren, S. (2018). Flexible Imputation of Missing Data (2nd ed.). CRC Press.
- Wickham, H. (2016). ggplot2: Elegant Graphics for Data Analysis (2nd ed.). Springer.