![]() |
VOOZH | about |
Databricks is a unified data analytics platform that allows organizations to easily process, analyze, and visualize large amounts of data. It combines data engineering, data science, and machine learning capabilities in a single platform, making it easier for teams to collaborate and derive insights from their data.
The CData SSIS Components enhance SQL Server Integration Services by enabling users to easily import and export data from various sources and destinations.
In this article, we explore the data type mapping considerations when exporting to Databricks and walk through how to migrate Excel Online data to Databricks using the CData SSIS Components for Excel Online and Databricks.
| Databricks Schema | CData Schema |
|---|---|
|
int, integer, int32 |
int |
|
smallint, short, int16 |
smallint |
|
double, float, real |
float |
|
date |
date |
|
datetime, timestamp |
datetime |
|
time, timespan |
time |
|
string, varchar |
If length > 4000: nvarchar(max), Otherwise: nvarchar(length) |
|
long, int64, bigint |
bigint |
|
boolean, bool |
tinyint |
|
decimal, numeric |
decimal |
|
uuid |
nvarchar(length) |
|
binary, varbinary, longvarbinary |
binary(1000) or varbinary(max) after SQL Server 2000 |
Follow the steps below to specify properties required to connect to Excel Online.
You can connect to a workbook by providing authentication to Excel Online and then setting the following properties:
: Set this to the name or Id of the workbook.
If you want to view a list of information about the available workbooks, execute a query to the Workbooks view after you authenticate.
You use the OAuth authentication standard to authenticate to Excel Online. See the Getting Started section in the help documentation for a guide. Getting Started also guides you through executing SQL to worksheets and ranges.
π Configure the source connection (Salesforce is shown)With the Excel Online Source configured, we can configure the Databricks connection and map the columns.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
You can now run the project. After the SSIS Task has finished executing, data from your SQL table will be exported to the chosen table.
Download a free trial of the Excel Online SSIS Component to get started:
Download NowLearn more:
π SSIS Data Flow Components IconPowerful SSIS Source & Destination Components that allows you to easily connect SQL Server with live Excel Online data through SSIS Workflows.
Use the Excel Online Data Flow Components to synchronize with @@@, etc. Perfect for data synchronization, local back-ups, workflow automation, and more!