R has a vast array of useful and versatile functions that excel at statistical data analysis. These functions can come in quite handy for data wrangling as well, contrary to the public notion of R being un-suitable for manipulating data. This article demonstrates the use of the merge function through a case example in R.

The implication of R-Squared

The R squared value, called the coefficient of determination, determines how well the data points fit on a regression equation. More specifically, the R squared value is a measure of how the independent variables in a regression equation explain the variables of the dependent variable. The value of R squared can change based on the inclusion or removal of variables in the regression model. R squared values are typically used as a measure of the effectiveness of a model. Hence, a high R squared value (anything above 55%), can be an indicator of a capable model.