VOOZH about

URL: https://www.digitalocean.com/community/tutorials/deepseek-r1-gpu-droplets?comment=209772

⇱ How to run Deepseek R1 LLMs on GPU Droplets | DigitalOcean


How to run Deepseek R1 LLMs on GPU Droplets

Updated on August 4, 2025

By James Skelton

AI/ML Technical Content Strategist

👁 How to run Deepseek R1 LLMs on GPU Droplets

DigitalOcean is closely observing the increasing parity between open-source Large Language Models (LLMs) and proprietary alternatives. A crucial capability of these models is reasoning, which involves logical and sensible thinking.

For a long time, LLMs were very linear. When given a prompt, they provided an answer. There is no meta-logic involved, or any stage where the model might be able to self-correct if it is mistaken. This effectively hinders their ability to reason, question, or adjust to problems that may be inherent to the instruction they are responding to. For example, with low-reasoning models, complex language based mathematics problems may be too complicated to solve without explicit instructions and work on the user’s part.

Enter the latest generation of reasoning LLMs. Ushered in by OpenAI’s O1 model series, reasoning models have taken the community by storm as they have effectively closed the gap between human and machine learning capabilities on a variety of logic tasks. These include coding, mathematics, and even scientific reasoning.

Like with all previous steps forward in development, the open source community has been working hard to match the closed-source models capabilities. Recently, the first open-source models to achieve this level of abstract reasoning, the Deepseek R1 series of LLMs, was released to the public.

In the first part of this 2 part article series, we will show how to run these models on DigitalOcean’s GPU Droplets using Ollama. Readers can expect to learn how to set up the GPU Droplet, install Ollama, and begin reasoning with Deepseek R1.

Key takeaways:

  • DeepSeek R1 is an open large language model known for being a cost-effective alternative to proprietary AI systems, and running it on a DigitalOcean GPU Droplet allows you to harness its capabilities (for tasks like coding assistance or Q&A) without needing to invest in high-end hardware on premises.
  • This guide shows how to set up a DigitalOcean GPU Droplet and use Ollama (a local LLM runtime) to download and serve the DeepSeek R1 model, so you can interact with it via a chat interface or API on your own cloud instance.
  • By deploying DeepSeek R1 on a GPU (instead of a CPU), you ensure the model runs efficiently and can handle complex queries, taking advantage of the droplet’s GPU acceleration to achieve faster responses and support larger model variants than would be feasible on a typical laptop or small server.
  • Hosting an LLM like DeepSeek R1 on your own Droplet gives you full control over the model and data (since queries aren’t sent to an external API), and DigitalOcean’s platform even offers convenient deployment options like a one-click app to get the model running quickly for experimentation or production use.

Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.

Learn more about our products

About the author

👁 James Skelton
James Skelton
Author
AI/ML Technical Content Strategist
See author profile
Category:

Still looking for an answer?

Was this helpful?

This textbox defaults to using Markdown to format your answer.

You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!

what droplet configuration would you recommend for the 671b model?

what droplet configuration would you recommend for the 671b model?

Great article! I have a question about pricing. After you create the droplet, you are only charged for the time the GPU is being used. Is this correct? So the droplet can exist, but if you are not actively using it, you are not charged? Thank you.

Running DeepSeek R1 LLMs on GPU droplets (such as those provided by cloud services like Paperspace, AWS, or DigitalOcean) allows for efficient inference and fine-tuning of large language models at scale. Here’s how to get started:

  1. Choose a Suitable GPU Droplet Ensure your droplet has enough VRAM—ideally 16GB or more (e.g., NVIDIA A100, V100, or L4 GPUs). LLMs like DeepSeek R1 require significant memory, especially for larger variants.

  2. Set Up the Environment Install the necessary deep learning framework. DeepSeek R1 models are compatible with PyTorch and transformers libraries. Run:

    nginx CopyEdit

    pip install torch transformers accelerate

  3. Download the Model Visit Hugging Face or DeepSeek’s official release page. Use transformers to load the model:

    python CopyEdit

    from transformers import AutoModelForCausalLM, AutoTokenizer model = AutoModelForCausalLM.from_pretrained("deepseek-ai/deepseek-llm-7b", device_map="auto") tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/deepseek-llm-7b")

  4. Optimize for GPU Use device_map="auto" or manually assign the model to the GPU using .to("cuda"). Install accelerate for seamless device handling and memory-efficient inference.

  5. Run Inference Once the model and tokenizer are loaded:

    python CopyEdit

    inputs = tokenizer("Hello, DeepSeek!", return_tensors="pt").to("cuda") outputs = model.generate(**inputs) print(tokenizer.decode(outputs[0]))

  6. Monitor GPU Usage Use nvidia-smi to monitor resource usage and avoid out-of-memory errors.

Great article! I really appreciates your efforts.

👁 Creative Commons
This work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike 4.0 International License.
  • Deploy on DigitalOcean

    Click below to sign up for DigitalOcean's virtual machines, Databases, and AIML products.

Become a contributor for community

Get paid to write technical tutorials and select a tech-focused charity to receive a matching donation.

DigitalOcean Documentation

Full documentation for every DigitalOcean product.

Resources for startups and AI-native businesses

The Wave has everything you need to know about building a business, from raising funding to marketing your product.

Get our newsletter

Stay up to date by signing up for DigitalOcean’s Infrastructure as a Newsletter.

New accounts only. By submitting your email you agree to our Privacy Policy

The developer cloud

Scale up as you grow — whether you're running one virtual machine or ten thousand.

Start building today

From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications.

© 2026 DigitalOcean, LLC.Sitemap.
Dark mode is coming soon.