VOOZH about

URL: https://www.cdata.com/kb/tech/databricks-cloud-chatgpt.rst

⇱ Use ChatGPT to Talk to Your Databricks Data via CData Connect AI


Use ChatGPT to Talk to Your Databricks Data via CData Connect AI

πŸ‘ Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Leverage the CData Connect AI Remote MCP Server to enable ChatGPT to securely answer questions and take actions on your Databricks data for you.

ChatGPT is an AI chatbot developed by OpenAI, launched in November 2022. Based on large language models (LLMs), it enables users to refine and steer conversations through natural language processing. ChatGPT's developer mode, available to Plus and Pro subscribers, provides full Model Context Protocol (MCP) support for connecting to external data sources and tools.

CData Connect AI offers a dedicated cloud-to-cloud interface for connecting to Databricks data. The CData Connect AI Remote MCP Server enables secure communication between ChatGPT and Databricks. This allows you to ask questions and take actions on your Databricks data using ChatGPT, all without the need for data replication to a natively supported database. With its inherent optimized data processing capabilities, CData Connect AI efficiently channels all supported SQL operations, including filters and JOINs, directly to Databricks. This leverages server-side processing to swiftly deliver the requested Databricks data.

About Databricks Data Integration

Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:

  • Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
  • Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
  • Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
  • Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.

While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.

Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.


Getting Started


Step 1: Configure Databricks Connectivity for ChatGPT

Connectivity to Databricks from ChatGPT is made possible through CData Connect AI Remote MCP. To interact with Databricks data from ChatGPT, we start by creating and configuring a Databricks connection in CData Connect AI.

  1. Log into your Connect AI account, click Sources, and then click Add Connection.
  2. πŸ‘ Adding a Connection
  3. Select "Databricks" from the Add Connection panel.
  4. πŸ‘ Selecting a data source
  5. Enter the necessary authentication properties to connect to Databricks.

    To connect to a Databricks cluster, set the properties as described below.

    Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

    • Server: Set to the Server Hostname of your Databricks cluster.
    • HTTPPath: Set to the HTTP Path of your Databricks cluster.
    • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
    πŸ‘ Configuring a connection (Salesforce is shown)
  6. Click Save & Test

With the connection configured, we are ready to connect to Databricks data from ChatGPT.

Step 2: Connect ChatGPT to CData Connect AI

Follow these steps to add a CData Connect AI connection in ChatGPT:

  1. Sign in to ChatGPT with a Plus or Pro subscription.
  2. Navigate to Apps from the left panel.
  3. Select the CData Connect AI app from the list.
  4. . πŸ‘ Search for CData Connect AI
  5. Click Connect to authenticate to your Connect AI account.
  6. πŸ‘ Click Connect to authenticate
  7. Click Sign in with CData Connect AI to add Connect AI to your ChatGPT account.
  8. πŸ‘ Sign in with CData Connect AI
  9. After successful authorization, you will be redirected back to ChatGPT.
  10. Click Start Chat to start a new conversation in ChatGPT with Connect AI connected at the background.
  11. πŸ‘ Start Chat

Step 3: Explore Live Databricks Data with ChatGPT

  1. Start a new conversation in ChatGPT.
  2. Connect AI should ideally be automatically enabled in the chat. If not, navigate to -> More -> CData Connect AI, under the chatbox, to enable the connector from the dropdown
  3. πŸ‘ Enable CData Connect AI in the chat
  4. You can now start exploring your data with natural language prompts. ChatGPT will use the Connect AI MCP server to query your live Databricks data. Example prompts:
    • "Show me all customers from the last 30 days"
    • "What are my top performing products?"
    • "Analyze sales trends for this quarter"
    • "List all active projects and their current status"
    Refer to CData prompt library for more prompt ideas.
  5. πŸ‘ Using natural language to explore your Databricks data (Salesforce used here).
  6. Permit ChatGPT to access your Databricks data. Click Query Data to continue, or Deny to refuse.
  7. πŸ‘ Give permissions to ChatGPT.
  8. ChatGPT translates your natural language queries into SQL and execute them against your Databricks data through the Connect AI MCP server.
  9. πŸ‘ ChatGPT shows the desired results from your Databricks data based on the prompts

Get CData Connect AI

To get live data access to hundreds of SaaS, Big Data, and NoSQL sources directly from your cloud applications, try CData Connect AI today!

Ready to get started?

Learn more about CData Connect AI or sign up for free trial access:

Free Trial

In this article


Related articles