Info, dates and topics of our five Autumn R Courses: from programming to data manipulation, from statistics to data mining, everything with R.
Cross-validation is a widely used model selection method. We show how to implement it in R using both raw code and the functions in the caret package.
A very interesting paradigm in data analysis comes from the necessity to model data where it is difficult to think of a single global function to be capable to represent adequately the data.
We could see a spectrum of models going from the global statistical model, with a single function and associated probability distribution, to the decision tree fitting a set of constants at each leaf of the tree.