Understanding the Capabilities of DeepSeek R1 Large Language Models

Published on February 3, 2025

AI/ML Technical Content Strategist

👁 Understanding the Capabilities of DeepSeek R1 Large Language Models

DeepSeek R1 has, for good reason, taken the AI/ML community by storm these past weeks, and has even in fact spread beyond to the wider world with major effects on both the economy and politics. This is largely because of the model suite’s open-source nature & incredibly low training price, which has shown the greater community that training SOTA AI models my not require nearly as much capital or proprietary research as previously thought.

In the first part of this series, we introduced DeepSeek R1 and showed how to run the model using Ollama. In this follow up, we will begin with a deeper dive into what actually makes R1 so special. We will focus on analyzing model’s unique Reinforcement Learning (RL) paradigm to see how reasoning capabilities of LLMs can be incentivized purely through RL, and, afterwards, discuss how the distillation of these techniques to other models allows us to share these capabilites with existing releases. We will conclude with a short demonstration on how to setup and run DeepSeek R1 models with GPU Droplets using 1-Click Model GPU Droplets.

Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.

Learn more about our products

About the author

👁 James Skelton

James Skelton

Author

AI/ML Technical Content Strategist

Category:

Tags:

Still looking for an answer?

Ask a question Search for more help

Was this helpful?

This textbox defaults to using Markdown to format your answer.

You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!

👁 Creative Commons
This work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike 4.0 International License.

Table of contents

Deploy on DigitalOcean
Click below to sign up for DigitalOcean's virtual machines, Databases, and AIML products.
Sign up

👁 Image

Become a contributor for community

Get paid to write technical tutorials and select a tech-focused charity to receive a matching donation.

👁 Image

DigitalOcean Documentation

Full documentation for every DigitalOcean product.

Learn more

👁 Image

Resources for startups and AI-native businesses

The Wave has everything you need to know about building a business, from raising funding to marketing your product.

Learn more

Get our newsletter

Stay up to date by signing up for DigitalOcean’s Infrastructure as a Newsletter.

New accounts only. By submitting your email you agree to our Privacy Policy

The developer cloud

Scale up as you grow — whether you're running one virtual machine or ten thousand.

View all products

Start building today

From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications.

Dark mode is coming soon.

URL: https://www.digitalocean.com/community/tutorials/deepseek-r1-large-language-model-capabilities