How to Perform Batch Inferencing with DigitalOcean’s 1-Click Models

Published on November 27, 2024

AI Technical Writer

👁 How to Perform Batch Inferencing with DigitalOcean’s 1-Click Models

Introduction

DigitalOcean’s 1-Click Models, powered by Hugging Face, makes it easy to deploy and interact with popular large language models such as Mistral, Llama, Gemma, Qwen, and more, all on the most powerful GPUs available in the cloud. Utilizing NVIDIA H100 GPU Droplets, this solution provides accelerated computing performance for deep learning tasks. It eliminates overwhelming infrastructure complexities, allowing developers of all skill levels—whether beginners or advanced—to concentrate on building applications without the hassle of complicated software configurations.
In this article, we will demonstrate batch processing using the 1-Click Model. Our tutorial will utilize the Llama 3.1 8B Instruct model on a single GPU. Although we will use a smaller batch for this example, it can easily be scaled to accommodate larger batches, depending on your workload and the computational resources available. The flexibility of DigitalOcean’s 1-Click Model deployment allows users to easily manage varying data sizes, making it suitable for scenarios ranging from small-scale tasks to large-scale enterprise applications.

Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.

Learn more about our products

About the author

👁 Shaoni Mukherjee

Shaoni Mukherjee

Author

AI Technical Writer

See author profile

With a strong background in data science and over six years of experience, I am passionate about creating in-depth content on technologies. Currently focused on AI, machine learning, and GPU computing, working on topics ranging from deep learning frameworks to optimizing GPU-based workloads.

Category:

Tags:

Still looking for an answer?

Ask a question Search for more help

Was this helpful?

This textbox defaults to using Markdown to format your answer.

You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!

👁 Creative Commons
This work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike 4.0 International License.

Table of contents

Deploy on DigitalOcean
Click below to sign up for DigitalOcean's virtual machines, Databases, and AIML products.
Sign up

👁 Image

Become a contributor for community

Get paid to write technical tutorials and select a tech-focused charity to receive a matching donation.

👁 Image

DigitalOcean Documentation

Full documentation for every DigitalOcean product.

Learn more

👁 Image

Resources for startups and AI-native businesses

The Wave has everything you need to know about building a business, from raising funding to marketing your product.

Learn more

Get our newsletter

Stay up to date by signing up for DigitalOcean’s Infrastructure as a Newsletter.

New accounts only. By submitting your email you agree to our Privacy Policy

The developer cloud

Scale up as you grow — whether you're running one virtual machine or ten thousand.

View all products

Start building today

From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications.

Dark mode is coming soon.

URL: https://www.digitalocean.com/community/tutorials/how-to-perform-batch-inferencing-using-1-click-model