VOOZH about

URL: https://www.cdata.com/kb/tech/azuredatalake-cloud-mistral.rst

⇱ How to Connect to Live Azure Data Lake Storage Data in Mistral AI Workflows and Agents (via CData Connect AI)


How to Connect to Live Azure Data Lake Storage Data in Mistral AI Workflows and Agents (via CData Connect AI)

πŸ‘ Mohsin Turki
Mohsin Turki
Technical Marketing Engineer
Leverage the CData Connect AI Remote MCP Server to enable Mistral AI to securely access, query, and take action on live Azure Data Lake Storage data without replication.

Mistral AI is a frontier AI company that builds enterprise-grade open-source and commercial large language models (LLMs). With Mistral, you can train, fine-tune, and deploy agents anywhere – on premises, in the cloud, or at the edge – while retaining full control of your data. Its agent-ready platform enables multilingual and multimodal AI that can search, create, code, automate, and collaborate securely, with support for memory and extended context handling.

CData Connect AI provides a secure cloud-to-cloud interface for easily integrating hundreds of enterprise data sources with Mistral AI. Through CData Connect AI, Mistral AI agents can query, analyze, and act on live Azure Data Lake Storage data in real time, without replication. Connect AI manages authentication, security, and query optimization so you can focus on building intelligent workflows, while Mistral handles reasoning and natural language interaction.

In this guide, we will use Mistral AI's Le Chat, Mistral's customizable conversational chatbot, along with CData Connect AI to connect to live Azure Data Lake Storage data. You will be able to interact with your live Azure Data Lake Storage data directly in Mistral AI workflows – running queries and automating tasks securely.

The setup takes just a few minutes, and once connected, you will have your own chatbot agent intelligently conversing with your live Azure Data Lake Storage data.

Let's begin.

Prerequisites

  1. A Mistral AI account – Sign up or log in here.
  2. A CData Connect AI account – Sign up or log in here.
  3. An active Azure Data Lake Storage account with valid credentials.

Overview

Here is a quick overview of the steps we will follow:

  1. Connect: Add a connection to Azure Data Lake Storage in CData Connect AI using your credentials.
  2. Configure: Create a custom MCP connection in Mistral AI Le Chat that points to your Azure Data Lake Storage connection in CData Connect AI.
  3. Query: Interact with live Azure Data Lake Storage data in Mistral AI workflows – running queries and taking actions using natural language.

Step 1: Configure Azure Data Lake Storage Connectivity for Mistral

Connectivity to Azure Data Lake Storage from Mistral AI is made possible through CData Connect AI Remote MCP. To interact with Azure Data Lake Storage data from Mistral, we start by creating and configuring a Azure Data Lake Storage connection in CData Connect AI.

  1. Log into Connect AI, click Sources, and then click Add Connection.
  2. πŸ‘ Adding a Connection
  3. Select "Azure Data Lake Storage" from the Add Connection panel.
  4. πŸ‘ Selecting a data source
  5. Enter the necessary authentication properties to connect to Azure Data Lake Storage.

    Authenticating to a Gen 1 DataLakeStore Account

    Gen 1 uses OAuth 2.0 in Entra ID (formerly Azure AD) for authentication.

    For this, an Active Directory web application is required. You can create one as follows:

    1. Sign in to your Azure Account through the
    2. Select "Entra ID" (formerly Azure AD).
    3. Select "App registrations".
    4. Select "New application registration".
    5. Provide a name and URL for the application. Select Web app for the type of application you want to create.
    6. Select "Required permissions" and change the required permissions for this app. At a minimum, "Azure Data Lake" and "Windows Azure Service Management API" are required.
    7. Select "Key" and generate a new key. Add a description, a duration, and take note of the generated key. You won't be able to see it again.

    To authenticate against a Gen 1 DataLakeStore account, the following properties are required:

    • Schema: Set this to ADLSGen1.
    • Account: Set this to the name of the account.
    • OAuthClientId: Set this to the application Id of the app you created.
    • OAuthClientSecret: Set this to the key generated for the app you created.
    • TenantId: Set this to the tenant Id. See the property for more information on how to acquire this.
    • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.

    Authenticating to a Gen 2 DataLakeStore Account

    To authenticate against a Gen 2 DataLakeStore account, the following properties are required:

    • Schema: Set this to ADLSGen2.
    • Account: Set this to the name of the account.
    • FileSystem: Set this to the file system which will be used for this account.
    • AccessKey: Set this to the access key which will be used to authenticate the calls to the API. See the property for more information on how to acquire this.
    • Directory: Set this to the path which will be used to store the replicated file. If not specified, the root directory will be used.
    πŸ‘ Configuring a connection (Salesforce is shown)
  6. Click Save & Test.
  7. Navigate to the Permissions tab in the Add Azure Data Lake Storage Connection page and update the user-based permissions. πŸ‘ Updating permissions

Add a Personal Access Token

A Personal Access Token (PAT) is used to authenticate the connection to Connect AI from Mistral AI. It is a best practice to create a separate PAT for each service to maintain fine-grained access control.

  1. Click on the gear icon () at the top right of the Connect AI app to open the Settings page.
  2. On the Settings page, go to the Access Tokens section and click Create PAT.
  3. Give the PAT a descriptive name and click Create. πŸ‘ Creating a new PAT
  4. Note: The PAT is only visible at creation, so be sure to copy and store it securely for future use.

With the connection configured and a PAT generated, you are ready to connect to Azure Data Lake Storage data from Mistral AI workflows.


Step 2: Configure the MCP Connector in Mistral Le Chat

With your Azure Data Lake Storage connection and PAT created in CData Connect AI, the next step is to configure a custom MCP connector inside Mistral Le Chat.

  1. Log into Le Chat. πŸ‘ Logging into Mistral Le Chat
  2. From the left-hand menu, click on Intelligence, then select Connectors. Click Add Connector. πŸ‘ Adding a new connector
  3. In the dialog, select Custom MCP Connector and enter the following details:
    • Connector Name: For example, CData_Remote_MCP.
    • Connector Server: https://mcp.cloud.cdata.com/mcp (found in the "Connect Data to AI" ribbon in Connect AI).
    • Authentication Method: API Token Authentication.
    • Header Name: Authorization.
    • Header Value: Basic [email protected]:YourPAT (replace "[email protected]" with your CData Connect AI email and "YourPAT" with the PAT created earlier in above format).
    πŸ‘ Configuring the MCP connection
  4. Click Connect to establish the connection.
  5. Scroll down to the bottom of the Connections section to confirm that your MCP connection is successfully established. πŸ‘ MCP connection established

This step ensures that Mistral Le Chat can securely route queries through the CData Remote MCP Server to your live Azure Data Lake Storage data.


Step 3: Query Live Azure Data Lake Storage Data from Mistral AI

Now that your MCP connector is configured in Le Chat, you can begin querying live Azure Data Lake Storage data directly in your conversations.

  1. In Le Chat, click on Chats in the left menu to start a new chat.
  2. Enable your MCP connector by clicking the Enable Tools button. πŸ‘ Enabling MCP tools in Le Chat
  3. Run a discovery query such as Get Catalogs or Get Tables to see the available data sources and schemas connected through CData Connect AI. πŸ‘ Listing connected catalogs and tables
  4. Test the connection by running a simple query. For example: "Compare the win rate of Opportunities across different industries." πŸ‘ Viewing visual query results (Salesforce is shown)
    πŸ‘ Viewing tabular query results (Salesforce is shown)

And that's it! You can now interact with live Azure Data Lake Storage data conversationally inside Mistral Le Chat.


Build Complex AI Agents with CData Connect AI

With the integration complete, you can go beyond simple queries and build complex, multi-step AI agents. These agents can combine reasoning from Mistral AI with secure, real-time access to your enterprise data through CData Connect AI, enabling workflows such as sales forecasting, support triage, customer trend analysis, and more.

Try CData Connect AI for free today and use the full power of Mistral AI agents with secure, live access to your enterprise data.

Ready to get started?

Learn more about CData Connect AI or sign up for free trial access:

Free Trial