![]() |
VOOZH | about |
Tableau Cloud is an analytics platform fully hosted in the cloud. When paired with CData Connect AI, you get instant, cloud-to-cloud access to Databricks data for visualizations, dashboards, and more. This article shows how to connect to Databricks and build visualizations from Databricks data in Tableau Cloud.
CData Connect AI provides a pure cloud-to-cloud interface for Databricks, allowing you to easily build visualizations from live Databricks data in Tableau Cloud without installing connectors or publishing worksheets and data sources from Tableau Desktop. As you build visualizations, Tableau Cloud generates SQL queries to gather data. Using optimized data processing out of the box, CData Connect AI pushes all supported SQL operations (filters, JOINs, etc) directly to Databricks, leveraging server-side processing to quickly return Databricks data.
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
To work with Databricks data from Tableau Cloud, we need to connect to Databricks from Connect AI, provide user access to the connection, and create a Workspace for the Databricks data.
CData Connect AI uses a straightforward, point-and-click interface to connect to data sources.
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
When connecting to Connect AI through the REST API, the OData API, or the Virtual SQL Server, a Personal Access Token (PAT) is used to authenticate the connection to Connect AI. It is best practice to create a separate PAT for each service to maintain granularity of access.
After connecting to Databricks, create a workspace for your desired table(s).
With the connection, PAT, and Workspace configured, you are ready to connect to Databricks data from Tableau Cloud.
The steps below outline creating a new data source in Tableau Cloud based on the virtual Databricks database in Connect AI and building a simple visualization from the data.
At this point, you have a direct, cloud-to-cloud connection to live Databricks data from your Tableau Cloud workbook. You can create new visualizations, build dashboards, and more, with no need to publish data sources and workbooks from Tableau Desktop. For more information on gaining live access to data from more than 100 SaaS, Big Data, and NoSQL sources from cloud applications like Tableau Cloud, refer to our Connect AI page.
Learn more about CData Connect AI or sign up for free trial access:
Free Trial