Organize your data manipulation tasks in a standard way, write clean and efficient code, and build reproducible data management processes, using the most modern R tools: tidyr, dplyr and lubridate.
Info, dates and topics of our five Autumn R Courses: from programming to data manipulation, from statistics to data mining, everything with R starting with R for Beginners course.
Quantide in partnership with DataCamp , offers an introductory online course to R completely in Italian. The course is free and open to anyone who wishes to participate. There is no requirement, you need only a Pc or a Mac with an Internet access.
Principal components regression (PCR) is a regression method based on Principal Component Analysis: discover how to perform this Data Mining technique in R
If you want to compute arbitrary operations on a data frame returning more than one number back, use dplyr do()! Tips and suggestions, in SE and NSE version.
Let's go deeper into methods overloading, and play around with classes, environment, operators and other R "oddities". How? Trying to emulate some C function in R.
Reshape your data from long to wide, split a column, aggregate: a comparison between tidyr and reshape2 R packages to tidy data
I would like to propose, as a kind of tribute and language game, a method for inventing words —actually, more an algorithm than a method.
Detect sentinel values, recode factor variables, replace missing values: a tutorial on various steps in data preparation using R.
Cross-validation is a widely used model selection method. We show how to implement it in R using both raw code and the functions in the caret package.