![]() |
VOOZH | about |
Denodo Platform is a data virtualization product providing a single point of contact for enterprise database data. When paired with the CData JDBC Driver for Databricks, Denodo users can work with live Databricks data alongside other enterprise data sources. This article explains how to create a virtual data source for Databricks in the Denodo Virtual DataPort Administrator.
With built-in optimized data processing, the CData JDBC Driver offers unmatched performance for interacting with live Databricks data. When you issue complex SQL queries to Databricks, the driver pushes supported SQL operations, like filters and aggregations, directly to Databricks and utilizes the embedded SQL engine to process unsupported operations client-side (often SQL functions and JOIN operations). Its built-in dynamic metadata querying allows you to work with and analyze Databricks data using native data types.
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
To connect to live Databricks data from Denodo, you need to copy the JDBC Driver JAR file to the external library directory for Denodo and create a new JDBC Data Source from the Virtual DataPort Administrator tool.
Database URI: Set this to a JDBC URL using the necessary connection properties. For example,
jdbc:databricks:Server=127.0.0.1;Port=443;TransportMode=HTTP;HTTPPath=MyHTTPPath;UseSSL=True;User=MyUser;Password=MyPassword;
π Configuring the JDBC connection (NetSuite is shown).Information on creating the Database URI follows:
For assistance in constructing the JDBC URL, use the connection string designer built into the Databricks JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.databricks.jar
Fill in the connection properties and copy the connection string to the clipboard.
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
After creating the data source, you can create a base view of Databricks data for use in the Denodo Platform.
SELECT * FROM cdata_databricks_customers CONTEXT ('i18n'='us_est', 'cache_wait_for_load'='true')
π Configuring the query to view the data.With the base view created, you can now work with live Databricks data like you would any other data source in Denodo Platform, for example, querying Databricks in the Denodo Data Catalog.
Download a free, 30-day trial of the CData JDBC Driver for Databricks and start working with your live Databricks data in Denodo Platform. Reach out to our Support Team if you have any questions.
Download a free trial of the Databricks Driver to get started:
Download NowLearn more:
π Databricks IconRapidly create and deploy powerful Java applications that integrate with Databricks.