Root-Mean-Square Error in R Programming

Last Updated : 29 Apr, 2026

Root Mean Squared Error (RMSE) is the square root of the mean of the squared errors. It is a useful error metric for numerical predictions, primarily to compare prediction errors of different models or configurations for the same variable, as it is scale-dependent. RMSE measures how well a regression line fits the data.

Formula for MAE

Where:

= actual value
= predicted value
= number of observations

Note: The difference between the actual values and the predicted values is known as residuals.

Significance of RMSE

Here are some significance of RMSE.

Scale-Dependent: RMSE has the same units as the target variable. A lower RMSE indicates better model performance, but the value must be compared with the scale of the target variable to make sense.
Sensitive to Outliers: Since RMSE squares the error terms, larger errors have a disproportionately large effect, making RMSE sensitive to outliers.
Comparing Models: RMSE can be used to compare models. A model with a lower RMSE value is generally considered better at predicting the target variable.

Computing RMSE in R

Now we will discuss different method to compute RMSE in R Programming Language.

1. Simple RMSE Calculation

Let’s first compute the RMSE between two vectors (actual and predicted values) manually.

Output:

[1] 0.3464102

The above code calculates the RMSE between the actual and predicted values manually by following the RMSE formula.

2. Calculating RMSE Using the Metrics Package

The Metrics package offers a convenient rmse() function. First, install and load the package.

Output:

[1] 0.3464102

3. Calculating RMSE Using the `caret` Package

The caret package is a popular package for machine learning and model evaluation. It provides a similar RMSE() function.

Output:

[1] 0.3464102

4. Calculating RMSE for Regression Models

In regression models, RMSE is used to evaluate the performance of the model. Let’s fit a linear regression model in R and compute the RMSE for the predicted values.

Output:

[1] 3.740297

This example fits a linear regression model predicting the miles per gallon (mpg) of cars based on horsepower (hp) and computes the RMSE to evaluate the model's prediction accuracy.

Interpreting RMSE involves understanding its relationship with the data.

Low RMSE: Indicates that the model's predictions are close to the actual values.
High RMSE: Indicates large errors in prediction.

However, the RMSE value should always be interpreted in the context of the data. For example, an RMSE of 10 might be considered good for a dataset where the target variable ranges between 100 and 500, but it could indicate poor performance if the target variable ranges between 0 and 20.

5. Visualizing RMSE

Visualizing the performance of our model can help in understanding where the model is underperforming. A scatter plot of actual vs. predicted values can provide insights into how well the model fits the data.

Output:

👁 gh

Root-Mean-Square Error in R Programming

The closer the points are to the red line (where actual = predicted), the better the model's predictions.

Comment

Article Tags:

Explore

Introduction

Fundamentals of R

Variables

Input/Output

Control Flow

Functions

Data Structures

Object Oriented Programming

Error Handling

File Handling

Packages in R

Data Interfaces

Data Visualization

Statistics

Machine Learning

Courses

URL: https://www.geeksforgeeks.org/r-language/root-mean-square-error-in-r-programming/