VOOZH about

URL: https://www.cdata.com/kb/tech/databricks-cloud-copilot.rst

⇱ Use Microsoft Copilot Studio to talk to your Databricks Data via CData Connect AI


Use Microsoft Copilot Studio to talk to your Databricks Data via CData Connect AI

πŸ‘ Cameron Leblanc
Cameron Leblanc
Senior Technology Evangelist
Leverage the CData Connect AI Remote MCP Server to enable Microsoft Copilot Studio to securely answer questions and take actions on your Databricks data for you.

Microsoft Copilot Studio is a no-code/low-code platform for creating AI Agents that can automate tasks, answer questions, and assist with various business processes. When combined with CData Connect AI Remote MCP, you can leverage Copilot Studio to interact with your Databricks data in real-time. This article outlines the process of connecting to Databricks using Connect AI Remote MCP and creating a connection in Copilot Studio to interact with your Databricks data.

CData Connect AI offers a dedicated cloud-to-cloud interface for connecting to Databricks data. The CData Connect AI Remote MCP Server enables secure communication between Microsoft Copilot Studio and Databricks. This allows you to ask questions and take actions on your Databricks data using Microsoft Copilot Studio, all without the need for data replication to a natively supported database. With its inherent optimized data processing capabilities, CData Connect AI efficiently channels all supported SQL operations, including filters and JOINs, directly to Databricks. This leverages server-side processing to swiftly deliver the requested Databricks data.

In this article, we show how to build a agent in Microsoft Copilot Studio to conversational explore (or Vibe Query) your data. The connectivity principals apply to any Copilot agent. With Connect AI you can build workflows and agents with access to live Databricks data, plus hundreds of other sources.

About Databricks Data Integration

Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:

  • Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
  • Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
  • Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
  • Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.

While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.

Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.


Getting Started


Step 1: Configure Databricks Connectivity for Microsoft Copilot Studio

Connectivity to Databricks from Microsoft Copilot Studio is made possible through CData Connect AI Remote MCP. To interact with Databricks data from Microsoft Copilot Studio, we start by creating and configuring a Databricks connection in CData Connect AI.

  1. Log into Connect AI, click Connections and click Add Connection πŸ‘ Adding a Connection
  2. Select "Databricks" from the Add Connection panel πŸ‘ Selecting a data source
  3. Enter the necessary authentication properties to connect to Databricks.

    To connect to a Databricks cluster, set the properties as described below.

    Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

    • Server: Set to the Server Hostname of your Databricks cluster.
    • HTTPPath: Set to the HTTP Path of your Databricks cluster.
    • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
    πŸ‘ Configuring a connection (Salesforce is shown)
    Click Save & Test
  4. Navigate to the Permissions tab in the Add Databricks Connection page and update the User-based permissions. πŸ‘ Updating permissions

With the connection configured, we are ready to connect to Databricks data from Microsoft Copilot Studio.

Step 2: Connect Microsoft Copilot Studio to CData Connect AI

Follow these steps to add a CData Connect AI MCP connection in Microsoft Copilot Studio:

  1. Under Tools, click Add tool, then click + New Tool. πŸ‘ Add a Tool to the Copilot agent
  2. In the Add Tool window, search for and click CData Connect AI. πŸ‘ Select Model Context Protocol
  3. In the Connect to CData Connect AI window, click Create to authenticate your connection CData Connect AI using OAuth authentication. πŸ‘ Configure the MCP Tool
  4. Click Add and configure to add the CData Connect AI Tool to your agent. πŸ‘ Create a new connection for the MCP Tool

Optional: Give the AI Agent context

This step establishes the AI Agent's role and provides context for the conversation through the Instructions property in the Agent. By providing instructions that explicitly informs the agent about its role as an MCP Server expert and lists the available tools, you can enhance the agent's understanding and response accuracy. For example, you can set the System Message to:

You are an expert at using the MCP Client tool connected which is the CData Connect AI MCP Server. Always search thoroughly and use the most relevant MCP Client tool for each query. Below are the available tools and a description of each:
queryData: Execute SQL queries against connected data sources and retrieve results. When you use the queryData tool, ensure you use the following format for the table name: catalog.schema.tableName
getCatalogs: Retrieve a list of available connections from CData Connect AI. The connection names should be used as catalog names in other tools and in any queries to CData Connect AI. Use the `getSchemas` tool to get a list of available schemas for a specific catalog.
getSchemas: Retrieve a list of available database schemas from CData Connect AI for a specific catalog. Use the `getTables` tool to get a list of available tables for a specific catalog and schema.
getTables: Retrieve a list of available database tables from CData Connect AI for a specific catalog and schema. Use the `getColumns` tool to get a list of available columns for a specific table.
getColumns: Retrieve a list of available database columns from CData Connect AI for a specific catalog, schema, and table.
getProcedures: Retrieve a list of stored procedures from CData Connect AI for a specific catalog and schema
getProcedureParameters: Retrieve a list of stored procedure parameters from CData Connect AI for a specific catalog, schema, and procedure.
executeProcedure: Execute stored procedures with parameters against connected data sources
 

Step 3: Explore Live Databricks Data with Microsoft Copilot Studio

With the Agent created in Microsoft Copilot Studio and the MCP tool connected, you can now interact with your Databricks data using Microsoft Copilot Studio. The MCP tool allows you to send queries and receive responses from the Databricks data source in real-time.

Open the chat window in your Microsoft Copilot Studio Agent to begin interacting with your Databricks data. You can ask questions, retrieve data, and perform actions on your Databricks data using the MCP tool: πŸ‘ Interact with your data using the MCP Tool in Microsoft Copilot Studio

Get CData Connect AI

To get live data access to hundreds of SaaS, Big Data, and NoSQL sources directly from your cloud applications, try CData Connect AI today!