![]() |
VOOZH | about |
Databricks is a unified data analytics platform that allows organizations to easily process, analyze, and visualize large amounts of data. It combines data engineering, data science, and machine learning capabilities in a single platform, making it easier for teams to collaborate and derive insights from their data.
The CData SSIS Components enhance SQL Server Integration Services by enabling users to easily import and export data from various sources and destinations.
In this article, we explore the data type mapping considerations when exporting to Databricks and walk through how to migrate BigQuery data to Databricks using the CData SSIS Components for BigQuery and Databricks.
| Databricks Schema | CData Schema |
|---|---|
|
int, integer, int32 |
int |
|
smallint, short, int16 |
smallint |
|
double, float, real |
float |
|
date |
date |
|
datetime, timestamp |
datetime |
|
time, timespan |
time |
|
string, varchar |
If length > 4000: nvarchar(max), Otherwise: nvarchar(length) |
|
long, int64, bigint |
bigint |
|
boolean, bool |
tinyint |
|
decimal, numeric |
decimal |
|
uuid |
nvarchar(length) |
|
binary, varbinary, longvarbinary |
binary(1000) or varbinary(max) after SQL Server 2000 |
CData simplifies access and integration of live Google BigQuery data. Our customers leverage CData connectivity to:
Most CData customers are using Google BigQuery as their data warehouse and so use CData solutions to migrate business data from separate sources into BigQuery for comprehensive analytics. Other customers use our connectivity to analyze and report on their Google BigQuery data, with many customers using both solutions.
For more details on how CData enhances your Google BigQuery experience, check out our blog post: https://www.cdata.com/blog/what-is-bigquery
Follow the steps below to specify properties required to connect to BigQuery.
Google uses the OAuth authentication standard. To access Google APIs on behalf of individual users, you can use the embedded credentials or you can register your own OAuth app.
OAuth also enables you to use a service account to connect on behalf of users in a Google Apps domain. To authenticate with a service account, register an application to obtain the OAuth JWT values.
In addition to the OAuth values, specify the DatasetId and ProjectId. See the "Getting Started" chapter of the help documentation for a guide to using OAuth.
π Configure the source connection (Salesforce is shown)With the BigQuery Source configured, we can configure the Databricks connection and map the columns.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
You can now run the project. After the SSIS Task has finished executing, data from your SQL table will be exported to the chosen table.
Download a free trial of the Google BigQuery SSIS Component to get started:
Download NowLearn more:
π Google BigQuery IconPowerful SSIS Source & Destination Components that allows you to easily connect SQL Server with live Google BigQuery data through SSIS Workflows.
Use the Google BigQuery Data Flow Components to synchronize with Google BigQuery Tables and Datasets. Perfect for data synchronization, local back-ups, workflow automation, and more!