VOOZH about

URL: https://www.cdata.com/kb/tech/databricks-cloud-powerbi.rst

⇱ Visualize Live Databricks Data in Power BI (via CData Connect AI)


Visualize Live Databricks Data in Power BI (via CData Connect AI)

πŸ‘ Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Use the CData Power BI Connector and CData Connect AI to integrate live Databricks data into custom reports in Power BI.

Power BI transforms your company's data into rich visuals for you to collect and organize so you can focus on what matters to you. When paired with CData Connect AI, you get access to Databricks data for visualizations, dashboards, and more. This article shows how to use CData Connect to create a live connection to Databricks, connect to Databricks data from Power BI and then create reports on Databricks data in Power BI.

About Databricks Data Integration

Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:

  • Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
  • Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
  • Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
  • Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.

While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.

Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.


Getting Started


Configure Databricks Connectivity for Power BI

Connectivity to Databricks from Power BI is made possible through CData Connect AI. To work with Databricks data from Power BI, we start by creating and configuring a Databricks connection.

  1. Log into Connect AI, click Sources, and then click Add Connection
  2. πŸ‘ Adding a Connection
  3. Select "Databricks" from the Add Connection panel
  4. πŸ‘ Selecting a data source
  5. Enter the necessary authentication properties to connect to Databricks.

    To connect to a Databricks cluster, set the properties as described below.

    Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

    • Server: Set to the Server Hostname of your Databricks cluster.
    • HTTPPath: Set to the HTTP Path of your Databricks cluster.
    • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
    πŸ‘ Configuring a connection (Salesforce is shown)
  6. Click Save & Test
  7. Navigate to the Permissions tab in the Add Databricks Connection page and update the User-based permissions. πŸ‘ Updating permissions

With the connection configured, you are ready to connect to Databricks data from Power BI.

Query Databricks Tables

Follow the steps below to build a query to pull Databricks data into the report:

  1. Open Power BI Desktop and click Get Data -> Online Services -> CData Connect AI and click "Connect"
  2. Click "Sign in" and authenticate with your CData Connect AI account πŸ‘ Authenticating with Connect AI
  3. After signing in, click "Connect" πŸ‘ Connecting to Connect AI
  4. Select tables in the Navigator dialog πŸ‘ The available tables. (Salesforce tables are shown)
  5. Click Load to establish the connection to your Databricks data from Power BI

Create Databricks Data Visualizations

After connecting to the data into Power BI, you can create data visualizations in the Report view by dragging fields from the Fields pane onto the canvas. Select the dimensions and measures you wish to visualize along with the chart type.

πŸ‘ Visualizing data in Power BI (Salesforce data is shown)

Click Refresh to synchronize your report with any changes to the data.

Live Access to Databricks Data from Data Applications

With CData Connect AI you have a direct connection to Databricks data from Power BI. You can import more data, create new visualizations, build reports, and more β€” all without replicating Databricks data.

To get SQL data access to hundreds of SaaS, Big Data, and NoSQL sources (including Databricks) directly from your on-premise BI, reporting, ETL and other data applications, visit the CData Connect AI page and start a free trial.

Ready to get started?

Learn more about CData Connect AI or sign up for free trial access:

Free Trial