Data Science with Yan
Welcome to my channel! And welcome to the world of Statistics and Data Science!
I am a college professor, and some of the videos are initially made to help college students do better in their study of courses and research projects in Statistics and Data Science. I have been creating and sharing videos whenever I find the time.
Flights dataset| data manipulation | deal with date/week of the day | calendar heat map| R tidyverse
Split strings in R | change variables from characters to numeric | strisplit( )
Using R to draw US maps | regions | Selected States in the United States PART 1
One step boxplot considering two factors | customize your plot color palette
check for normal conditions | normality test | histogram | qq plot
Logarithmic regression| non-linear regression| lm in R| visualization of models
polynomial regression using R | non-linear regression | curved regression
The full process of one-way ANOVA | EDA | aggregate| model process| using R| RStudio
outliers and influential points| how to identify| understand them using data in R
working directory in R | check and change | export a R dataset to csv document
Leave one out and k-fold cross validation| using R| cv.glm | train and test data | prediction error
smoothing splines in R | degrees of freedom in smooth.spline | data predictions| data matches
data visualization| ggplot2| dplyr| data manipulation| Bar plot with error bars using R
creating dummy variables automatically using R | dummy_cols function|
How to plot any function curves in R | draw function curves using R | plot( ) | curve ( ) R function
What is cross validation? Why we need it? Leave one out and k-fold cross validation
what is poisson regression | what are really GLM?| using R | fit the model | real data examples
logistic regression using R | when to use | fit | interpret coefficients| odds | chi-square test
Hypothesis tests on Multiple linear regression using R | T-test| partial F-test| model comparison
Multiple linear regression model using R | lm( ) | variations of MLR | visualize results coefplot( )
boxplot for comparison | before and after| group cross group comparison| ggplot2| R
Modeling using R | simple linear regression| correlations, visualizations, fit a model lm() function
Create dates and times in R lubridate package| make_datetime( ) function | ymd( ) in RStudio
combine different datasets into one | relational data | R for data science | left_join function in R
Tidy messy data | R for data science | tidyr tidyverse package
tibbles and data frames in R | R for data science| book club| How to create tibbles and subset it
EDA part 2| ultimate guide to visualize covariations on two variables | R for data science book club
EDA exploratory data analysis part 1 distributions of one variable | R for data science book club
A guide to help you organize your R scripts | how to find your old code file quickly
Atomic Habits: Get better each day | R for loops, data manipulation, data visualization all in one