Implementation of Movie Recommender System - Python

Last Updated : 11 Jul, 2025

Recommender Systems provide personalized suggestions for items that are most relevant to each user by predicting preferences according to user's past choices. They are used in various areas like movies, music, news, search queries, etc. These recommendations are made in two ways:

Collaborative filtering: Collaborative filtering makes recommendations based on user's past behavior and similar decisions made by other users to predict items user may be interested in. For example, if User A likes movies X and Y and User B likes Y and Z then system might recommend movie Z to User A.
Content-based filtering: Content-based filtering recommends items based on the features of the items and the user's past preferences. For example, if a user likes action movies the system will recommend other action movies based on genres, actors or directors.

In this article we’ll build a basic recommender system using Python that recommends movies based on user past preferences.

Step 1: Import Libraries and Load the Data

In this step we will load pandas ,matplotlib and seaborn library. After that we will load user ratings for movies file. Each row in this file shows a user’s rating for a specific movie. To download file click here: .tsv file.

Output:

👁 User_Rating

User Rating

Step 2: Loading Movie Titles

Now we load another file that matches movie IDs to their titles. This helps us show the actual movie names instead of just numbers. You can download file from here: Movie_Id_Titles.csv.

Output:

👁 Movie_IDs

Movie IDs with their IDs

Step 3: Merging Both Datasets

We now combine the data about user ratings and movie titles and we can see both the movie names and the ratings for each movie.

Output:

👁 Merged_dataset

Merged Dataset

Step 4: Analyzing Movie Ratings

This gives us the average rating for each movie and use to find top-rated movies.

Output:

👁 Analyze_Movie_Rating

Analyzed Movie Rating

we can also see how many ratings each movie has. Movies with more ratings are more important when we make recommendations.

Output:

👁 rating_of_movies

rating of movies

Step 5: Creating a Ratings DataFrame

In this step the DataFrame stores the average rating and number of ratings for each movie. We’ll use this later to filter out movies with few ratings.

Output:

👁 Rating_Dataframe

Rating Dataframe

Step 6: Visualizing Ratings

We use matplotlib and seaborn to create visual charts that make the data easier to understand.

To Shows how many movies got how many ratings and helps identify popular movies.

Output:

👁 Visualize_ratings

Visualize ratings

We now show the distribution of average ratings across movies

Output:

👁 average-rating-distribution

Distribution of average ratings

Step 7: Creating User-Movie Matrix

We now create a matrix where each row represents a user and each column represents a movie. The values in this matrix are the ratings that users have given to each movie.

Output:

👁 user-movie-matrix

User Movie Matrix