![]() |
VOOZH | about |
Databricks is a leading AI cloud-native platform that unifies data engineering, machine learning, and analytics at scale. Its powerful data lakehouse architecture combines the performance of data warehouses with the flexibility of data lakes. Integrating Databricks with CData Connect AI gives organizations live, real-time access to Adobe Analytics data without the need for complex ETL pipelines or data duplication—streamlining operations and reducing time-to-insights.
In this article, we'll walk through how to configure a secure, live connection from Databricks to Adobe Analytics using CData Connect AI. Once configured, you'll be able to access Adobe Analytics data directly from Databricks notebooks using standard SQL—enabling unified, real-time analytics across your data ecosystem.
Here is an overview of the simple steps:
Before you begin, make sure you have the following:
CData Connect AI uses a straightforward, point-and-click interface to connect to available data sources.
Adobe Analytics uses the OAuth authentication standard. To authenticate using OAuth, create an app to obtain the OAuthClientId, OAuthClientSecret, and CallbackURL connection properties. See the "Getting Started" section of the help documentation for a guide.
GlobalCompanyId is a required connection property. If you do not know your Global Company ID, you can find it in the request URL for the users/me endpoint on the Swagger UI. After logging into the Swagger UI Url, expand the users endpoint and then click the GET users/me button. Click the Try it out and Execute buttons. Note your Global Company ID shown in the Request URL immediately preceding the users/me endpoint.
Report Suite ID (RSID) is also a required connection property. In the Adobe Analytics UI, navigate to Admin -> Report Suites and you will get a list of your report suites along with their identifiers next to the name.
After setting the GlobalCompanyId, RSID and OAuth connection properties, you are ready to connect to Adobe Analytics.
👁 Configuring a connection (Salesforce is shown)When connecting to Connect AI through the REST API, the OData API, or the Virtual SQL Server, a Personal Access Token (PAT) is used to authenticate the connection to Connect AI. PAT functions as an alternative to your login credentials for secure, token-based authentication. It is a best practice to create a separate PAT for each service to maintain granularity of access.
Follow these steps to establish a connection from Databricks to Adobe Analytics. You'll install the CData JDBC Driver for Connect AI, add the JAR file to your cluster, configure your notebooks, and run SQL queries to access live Adobe Analytics data data.
C:\Program Files\CData\CData JDBC Driver for Connect AI\lib\cdata.jdbc.connect.jar
/Applications/CData/CData JDBC Driver for Connect AI/lib/cdata.jdbc.connect.jar
driver = "cdata.jdbc.connect.ConnectDriver" url = "jdbc:connect:AuthScheme=Basic;User=your_username;Password=your_pat;URL=https://cloud.cdata.com/api/;DefaultCatalog=Your_Connection_Name;"
remote_table = spark.read.format("jdbc") \
.option("driver", "cdata.jdbc.connect.ConnectDriver") \
.option("url", "jdbc:connect:AuthScheme=Basic;User=your_username;Password=your_pat;URL=https://cloud.cdata.com/api/;DefaultCatalog=Your_Connection_Name;") \
.option("dbtable", "YOUR_SCHEMA.YOUR_TABLE") \
.load()
display(remote_table.select("ColumnName1", "ColumnName2"))
You can now explore, join, and analyze live Adobe Analytics data directly within Databricks notebooks—without needing to know the complexities of the back-end API and without replicating Adobe Analytics data.
Ready to simplify real-time access to Adobe Analytics data? Start your free 14-day trial of CData Connect AI today and experience seamless, live connectivity from Databricks to Adobe Analytics.
Low code, zero infrastructure, zero replication — just seamless, secure access to your most critical data and insights.
Learn more about CData Connect AI or sign up for free trial access:
Free Trial