![]() |
VOOZH | about |
The CData JDBC Driver for Databricks enables you to connect to live Databricks data from business intelligence and data mining tools that support the JDBC standard. This article shows how to integrate Databricks data into a report in SpagoBI Studio and host it on SpagoBI Server.
Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:
While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.
Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.
Follow the steps to create a JDBC data source for Databricks in SpagoBI Server.
Add a Databricks driver resource to the context. The following resource definition can be added to the GlobalNamingResources element in server.xml:
<Resource name="jdbc/databricks" auth="Container" type="javax.sql.DataSource" driverclassname="cdata.jdbc.databricks.DatabricksDriver" factory="org.apache.tomcat.jdbc.pool.DataSourceFactory" maxactive="20" maxidle="10" maxwait="-1"/>
<ResourceLink global="jdbc/databricks" name="jdbc/databricks" type="javax.sql.DataSource"/>
After adding the driver to the resources for the SpagoBI server, add the data source: In SpagoBI, click Resources -> Data Source -> Add and enter the following information:
To connect to a Databricks cluster, set the properties as described below.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
For assistance in constructing the JDBC URL, use the connection string designer built into the Databricks JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.databricks.jar
Fill in the connection properties and copy the connection string to the clipboard.
π Using the built-in connection string designer to generate a JDBC URL (Salesforce is shown.)A typical JDBC URL is below:
jdbc:databricks:Server=127.0.0.1;Port=443;TransportMode=HTTP;HTTPPath=MyHTTPPath;UseSSL=True;User=MyUser;Password=MyPassword;
Follow the steps below to populate reports based on Databricks data in SpagoBI Studio. You will create a dataset that populates a chart with the results of an SQL query. In the next section, you will host this report on SpagoBI Server.
First, you will need to connect to Databricks data from a report in SpagoBI Studio:
jdbc:databricks:Server=127.0.0.1;Port=443;TransportMode=HTTP;HTTPPath=MyHTTPPath;UseSSL=True;User=MyUser;Password=MyPassword;See the "Getting Started" chapter of the driver help for a guide to obtaining the required connection properties. π The JDBC data source. (Salesforce is shown.)
After you have connected to Databricks data, create a dataset that contains the results of an SQL query:
SELECT City, CompanyName FROM Customers WHERE Country = 'US'π The query to be used to populate a chart. (Salesforce is shown.)
You can use the dataset to populate report objects. Follow the steps below to create a chart.
Follow the steps below to host documents based on live Databricks data on SpagoBI Server. You will use the report you created in the previous section as a template. To enable report users to access the live data, create placeholder parameters to be replaced by the Databricks JDBC data source on the server:
In the Property Binding node, set the JDBC Driver URL binding property to the url parameter: Click the box for the property. In the Category section, select Report Parameters. Select All in the Subcategory section and double-click the parameter.
You can also enter the following in the JavaScript syntax:
params["url"].valueπ Placeholder values in the report for the JDBC data source on the server.
Next, create a new document for the report on SpagoBI Server.
In the Template section, click Choose File. Navigate to the folder containing your report project. Select the .rptdesign file.
Note: You can find the path to the project in the project properties.
When you run the report on the server, the placeholder url parameter is replaced with the JDBC URL defined on the server.
π The chart running on the SpagoBI Server. (Salesforce is shown.)Download a free trial of the Databricks Driver to get started:
Download NowLearn more:
π Databricks IconRapidly create and deploy powerful Java applications that integrate with Databricks.