VOOZH about

URL: https://thenewstack.io/is-fine-tuning-or-prompt-engineering-the-right-approach-for-ai/

⇱ Is Fine-Tuning or Prompt Engineering the Right Approach for AI? - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-02-10 09:00:50
Is Fine-Tuning or Prompt Engineering the Right Approach for AI?
sponsor-rafay,sponsored-post-contributed,
AI / AI Engineering

Is Fine-Tuning or Prompt Engineering the Right Approach for AI?

While prompt engineering is a quick and cost-effective solution for general tasks, fine-tuning enables superior AI performance on proprietary data.
Feb 10th, 2025 9:00am by Rajat Tiwari
👁 Featued image for: Is Fine-Tuning or Prompt Engineering the Right Approach for AI?
Image from Treecha on Shutterstock.
Rafay sponsored this post.

We previously discussed how building a RAG-based chatbot for enterprise data paved the way for creating a comprehensive GenAI platform. That article highlighted the growing need for enterprises to develop AI solutions tailored to their specific needs.

As AI adoption accelerates, organizations face a critical decision: Should they rely on prompt engineering for quick solutions or invest in fine-tuning models for deeper customization?

Let’s explore the differences between these two approaches, learn from early adopters and outline the infrastructure requirements for fine-tuning at scale.

Prompt Engineering and RAG: Quick Start Into the AI World

Prompt engineering involves crafting precise input prompts to guide large language models (LLMs) like OpenAI’s GPT or Anthropic’s Claude without modifying their architecture.

👁 Image
Prompt engineering

When combined with retrieval-augmented generation (RAG), which integrates external knowledge bases, this approach dynamically enriches model outputs, making it a cost-effective and adaptable solution.

👁 Image
Prompt engineering + RAG

Advantages of Prompt Engineering and RAG

  • Speed and simplicity: Faster to implement with minimal technical overhead.
  • Adaptability: Effective for general use cases and common tasks.
  • Lower costs: No need for complex infrastructure or model training.

Challenges and Limitations

  • Knowledge cutoffs: Pretrained models may lack up-to-date information.
  • Limited customization: Models struggle with niche or proprietary tasks.
  • Data privacy risks: Using sensitive data in prompts may expose intellectual property.

While prompt engineering is ideal for general applications, specialized AI workflows often require more robust solutions. This is where fine-tuning shines.

Fine-Tuning: Unlocking Model Customization

Fine-tuning involves retraining a base model using domain-specific data sets and adjusting the model’s weights to better suit unique workflows. This process enables organizations to enhance model performance for specialized tasks, offering unparalleled control and customization.

👁 Image
Fine-Tuning

Key Benefits of Fine-Tuning

  • Improved accuracy: Tailored models perform better on proprietary data sets.
  • Full control: Fine-tuning allows deeper control over model behavior and outputs.
  • Specialization: Perfect for industry-specific or proprietary use cases.

Common Fine-Tuning Techniques

  • Continued pretraining (CPT): Extends base model training with custom data sets.
  • Supervised fine-tuning (SFT): Uses labeled prompt-response pairs for task-specific optimization.
  • Reinforcement learning from human feedback (RLHF): Aligns outputs with human preferences for nuanced results.

Why Fine-Tuning Is Becoming Popular

Fine-tuning is becoming more popular as enterprises realize its potential to deliver better results by customizing AI models for their specific needs. It’s not just about having access to GPUs — it’s about getting the most out of proprietary data with new tools that make fine-tuning easier.

Here’s why fine-tuning is gaining traction:

  1. Better results with proprietary data: Fine-tuning allows businesses to train models on their own data, making the AI much more accurate and relevant to their specific tasks. This leads to better outcomes and real business value.
  2. Easier than ever before: Tools like Hugging Face’s Open Source libraries, PyTorch and TensorFlow, along with cloud services, have made fine-tuning more accessible. These frameworks simplify the process, even for teams without deep AI expertise.
  3. Improved infrastructure: The rising availability of powerful GPUs and cloud-based solutions has made it much easier to set up and run fine-tuning at scale.

While fine-tuning opens the door to more customized AI, it does require careful planning and the right infrastructure to succeed.

AI Development Journey: From GPUs to Fine-Tuned Models

Developing fine-tuned AI models is a multistep process that begins with securing the right infrastructure. Below is a step-by-step roadmap.

Step 1: Procuring GPUs

Securing GPUs is the foundation of AI development. Organizations often use NVIDIA’s Cloud Partner (NCP) program, cloud GPU providers or platforms like AWS.

Example: A technology company, ABC Corp, decided to procure GPUs to support its growing AI initiatives, including running complex simulations, accelerating machine learning experiments and enabling fine-tuning of models for proprietary use cases. By building an in-house AI data center, it ensured it has the flexibility and resources needed for diverse AI projects while maintaining control over sensitive information.

Step 2: Setting Up GPU Infrastructure

After securing GPUs, the next step is setting up the infrastructure. Automation tools and platforms simplify tasks like cluster management, server setup and deployment, making it easier to consume and scale GPU resources efficiently.

Example: IT administrators at ABC Corp used automation tools to deploy and manage their GPU clusters efficiently. This streamlined process allowed their teams to begin experimenting with models much sooner.

Step 3: Building the Orchestration Layer

Managing GPU resources efficiently requires an orchestration layer. This layer allocates GPU capacity based on developer needs. Rafay’s GPU PaaS solution, for example, allows IT administrators to create GPU profiles for teams, enabling seamless self-service access.

Example: The IT team at ABC Corp configured GPU profiles using an orchestration platform. When a lead data scientist needed a 4-GPU or 2-GPU instance for a project, it was provisioned instantly, allowing the team to proceed without delays.

Step 4: Fine-Tuning and Model Development

Once the infrastructure is set up, AI teams can focus on the real work: fine-tuning and building models. Public cloud platforms like AWS Bedrock and Azure AI, and private cloud solutions like Rafay, provide user-friendly environments that make it easier for developers to experiment, train and deploy models efficiently.

These platforms allow end users such as data engineers and machine learning engineers to use fine-tuned models for their daily tasks, driving innovation and productivity.

Example: A data scientist at ABC Corp uploaded a domain-specific data set to a fine-tuning platform, tailoring a large language model to the company’s unique requirements. This resulted in a model that delivered superior accuracy and improved outcomes for their business applications.

Conclusion

As enterprises accelerate their AI adoption, choosing between prompt engineering and fine-tuning will have a significant impact on their success. While prompt engineering provides a quick, cost-effective solution for general tasks, fine-tuning unlocks the full potential of AI, enabling superior performance on proprietary data.

From securing GPUs to fine-tuning models, the journey is complex, but organizations can simplify it with the right infrastructure and tools.

In future articles, we’ll explore fine-tuning techniques in detail, providing actionable insights for enterprises at every stage of their AI journey.

If you’re looking for solutions to support your AI initiatives, Rafay supports enterprises across key steps of the AI journey — from setting up infrastructure with Rafay’s Bare Metal Solution, managing GPU resources with Rafay’s GPU PaaS, to fine-tuning and model deployment using Rafay’s GenAI platform.

Rafay’s Cloud Automation Platform provides a solution for platform teams that wish to build automated self-service cloud infrastructure workflows, guardrails included, allowing platform teams to enable anyone who depends on rapid access to cloud infrastructure to move faster safely with golden paths.
Learn More
The latest from Rafay
TRENDING STORIES
Rajat Tiwari is a Senior Software Engineer at Rafay Systems, with a strong interest in the GenAI/ML space and cloud infrastructure. Prior to joining Rafay Systems, he was a Software Engineer at Citrix Systems, where he contributed to the development...
Read more from Rajat Tiwari
Rafay sponsored this post.
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: OpenAI, Anthropic.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.