![]() |
VOOZH | about |
With built-in support for ODBC on Microsoft Windows, the CData ODBC Drivers provide self-service integration with self-service analytics tools such as Microsoft Power BI. The CData ODBC Driver for Databricks links your Power BI reports to operational Databricks data. You can monitor Databricks data through dashboards and ensure that your analysis reflects Databricks data in real time by scheduling refreshes or refreshing on demand. This article details how to use the ODBC driver to create real-time visualizations of Databricks data in Microsoft Power BI Desktop and then upload to Power BI.
The CData ODBC Drivers offer unmatched performance for interacting with live Databricks data in Power BI due to optimized data processing built into the driver. When you issue complex SQL queries from Power BI to Databricks, the driver pushes supported SQL operations, like filters and aggregations, directly to Databricks and utilizes the embedded SQL Engine to process unsupported operations (often SQL functions and JOIN operations) client-side. With built-in dynamic metadata querying, you can visualize and analyze Databricks data using native Power BI data types.
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
If you have not already, first specify connection properties in an ODBC DSN (data source name). This is the last step of the driver installation. You can use the Microsoft ODBC Data Source Administrator to create and configure ODBC DSNs.
π Configure ODBC DSN. (Salesforce is shown.)
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
After creating an ODBC DSN, follow the steps below to connect to the Databricks ODBC DSN from Power BI Desktop:
Click Transform Data to edit the query. The table you imported is displayed in the Power Query Editor. In the Power Query Editor, you can enrich your local copy of Databricks data with other data sources, pivot Databricks columns, and more. Power BI detects each column's data type from the Databricks metadata retrieved by the driver.
Power BI records your modifications to the query in the Applied Steps section, adjusting the underlying data retrieval query that is executed to the remote Databricks data. When you click Close and Apply, Power BI executes the data retrieval query.
Otherwise, click Load to pull the data into Power BI.
After pulling the data into Power BI, you can create data visualizations in the Report view by dragging fields from the Fields pane onto the canvas. Follow the steps below to create a pie chart (Salesforce shown):
You can change sort options by clicking the ellipsis (...) button for the chart. Options to select the sort column and change the sort order are displayed.
You can use both highlighting and filtering to focus on data. Filtering removes unfocused data from visualizations; highlighting dims unfocused data. You can highlight fields by clicking them:
π A highlighted account in a pie chart. (Salesforce is shown.)You can apply filters at the page level, at the report level, or to a single visualization by dragging fields onto the Filters pane. To filter on the field's value, select one of the values that are displayed in the Filters pane.
π Accounts and Annual Revenue filtered by Industry. (Salesforce is shown.)Click Refresh to synchronize your report with any changes to the data.
If you are interested in connecting to your Databricks data from Microsoft Power BI, or any applications that support ODBC connectivity, download a free, 30-day trial of the CData ODBC Driver for Databricks. As always, our world-class support team is ready to answer any questions you may have.
Download a free trial of the Databricks ODBC Driver to get started:
Download NowLearn more:
π Databricks IconThe Databricks ODBC Driver is a powerful tool that allows you to connect with live data from Databricks, directly from any applications that support ODBC connectivity.
Access Databricks data like you would a database - read, write, and update through a standard ODBC Driver interface.