Neural Style Transfer with TensorFlow

Last Updated : 15 Jul, 2025

Neural Style Transfer (NST) is a method in deep learning where the details of one picture are combined with the artistic style of another to create a new image. This process allows you to generate an image that has the subject of the first picture but the art style, colors and textures of the second. In this guide we will explain how NST works and how to apply it using TensorFlow.

Understanding Neural Style Transfer

Neural Style Transfer or NST involves three different images:

Content Image: This is the picture with the main subject or scene you want to change. It holds the basic structure and layout you want to keep
Style Image: the picture with the artistic feel, like colors, textures and patterns. You want to use this style to change how your content image looks.
Generated Image: The output image that combines the content of the first image with the style of the second.

👁 Image

An example of style transfer A is a content image, B is output with style image in the bottom left corner

The process of NST uses a type of computer model called a convolutional neural network (CNN). A well-known CNN often used in NST is called VGG19. This computer model analyzes details and features in both the content and style images. To make the new image the computer model works with a formula known as a loss function. This formula keeps the computer to find the best mix of subject from the content image and the artistic elements from the style image.

To understand the Neural Style Transfer more refer to: Overview of Style Transfer

Implementing Neural Style Transfer with TensorFlow

In implementation of NST we'll use the VGG19 model, pre-trained on the ImageNet dataset to extract features from our images.

Step 1: Import Necessary Libraries

First we import the necessary module like TensorFlow v2 with Keras, NumPy for numerical operations, Matplotlib for data visualisation and Keras-specific components for working with pre-trained models and image processing.

Step 2: Image processing

Now, we load and process the image using Keras preprocess input in VGG 19. The expand_dims function adds a dimension to represent a number of images in the input. This preprocess_input function used in VGG 19 converts the input RGB to BGR images and centre these values around 0 according to ImageNet data.

Now, we define the deprocess function that takes the input image and perform the inverse of preprocess_input function that we imported above. To display the unprocessed image we also define a display function.

Now we use the above function to display the style and content images

Output:

👁 Content Image

Content Image

👁 Style Image

Style Image

Step 3: Model Initialization

Now we initialize the VGG model with ImageNet weights we will also remove the top layers and make it non-trainable. We use VGG19 to extract image features.

include_top=False removes the classification head.
We freeze the model to use it as a fixed feature extractor.

Output:

👁 Screenshot-2025-04-09-153459

Step 4: Content Model defining

Now we define the content and style model using Keras.Model API. The content model takes the image as input and output the feature map from "block5_conv1" from the above VGG model.

Output:

👁 Screenshot-2025-04-09-153923

Step 5: Style Model defining:

Now, we define the content and style model using Keras.Model API. The style model takes an image as input and output the feature map from "block1_conv1, block3_conv1 and block5_conv2" from the above VGG model. We use multiple layers for style to capture texture patterns at different scales. Each layer captures style at different levels of abstraction like edges, patterns and textures.

Content Loss: Now we define the content loss function it will take the feature map of generated and real images and calculate the mean square difference between them.

Gram Matrix: Now we define the gram matrix function. This function also takes the real and generated images as the input of the model and calculates gram matrices of them before calculate the style loss weighted to different layers.

Style Loss: The function style_cost defined by this code determines the style loss between a generated image and a style image that is supplied. In neural style transfer algorithms style loss is frequently employed to create an image that blends the content of two different images with their styles.

Content Loss: The content loss between a style image and a generated image is determined by the function content_cost, which is defined in this code. To make sure that the generated image preserves the original image's content, neural style transfer algorithms frequently employ content loss.

Step 6: Training Function

In this step we define the training_loop function that optimizes the generated image using both content and style losses. It begins by loading and preprocessing the content and style images then initializes the generated image as a trainable variable.

GradientTape watches changes in the generated image.
J_content: difference in content features.
J_style: difference in style (Gram matrices).
J_total: weighted combination using a and b.

Step 7: Model Training

Now, we train our model using the training function we defined above.

Output:

👁 Screenshot-2025-04-09-154622

Step 8: Model Prediction

In the final step, we plot the final and intermediate results.

Output:

👁 Last 10 Generated images

Last 10 Generated images

👁 Best Generated image

Best Generated image

This output shows the last 10 generated images during Neural Style Transfer where the content (dog) is preserved and style patterns are gradually applied. Each image reflects slight improvements in blending style with content across iterations.

Complete Code : click here

Comment

Article Tags:

Deep Learning

AI-ML-DS With Python

Explore

Basics

Neural Networks

Deep Learning Models

Model Evaluation

Deep Learning Frameworks

Projects

Courses

URL: https://www.geeksforgeeks.org/deep-learning/neural-style-transfer-with-tensorflow/