![]() |
VOOZH | about |
Databricks is a unified data analytics platform that allows organizations to easily process, analyze, and visualize large amounts of data. It combines data engineering, data science, and machine learning capabilities in a single platform, making it easier for teams to collaborate and derive insights from their data.
The CData SSIS Components enhance SQL Server Integration Services by enabling users to easily import and export data from various sources and destinations.
In this article, we explore the data type mapping considerations when exporting to Databricks and walk through how to migrate Impala data to Databricks using the CData SSIS Components for Impala and Databricks.
| Databricks Schema | CData Schema |
|---|---|
|
int, integer, int32 |
int |
|
smallint, short, int16 |
smallint |
|
double, float, real |
float |
|
date |
date |
|
datetime, timestamp |
datetime |
|
time, timespan |
time |
|
string, varchar |
If length > 4000: nvarchar(max), Otherwise: nvarchar(length) |
|
long, int64, bigint |
bigint |
|
boolean, bool |
tinyint |
|
decimal, numeric |
decimal |
|
uuid |
nvarchar(length) |
|
binary, varbinary, longvarbinary |
binary(1000) or varbinary(max) after SQL Server 2000 |
Follow the steps below to specify properties required to connect to Impala.
In order to connect to Apache Impala, set the Server, Port, and ProtocolVersion. You may optionally specify a default Database. To connect using alternative methods, such as NOSASL, LDAP, or Kerberos, refer to the online Help documentation.
π Configure the source connection (Salesforce is shown)With the Impala Source configured, we can configure the Databricks connection and map the columns.
Note: The needed values can be found in your Databricks instance by navigating to Clusters, selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.
You can now run the project. After the SSIS Task has finished executing, data from your SQL table will be exported to the chosen table.
Download a free trial of the Impala SSIS Component to get started:
Download NowLearn more:
π Apache Impala IconPowerful SSIS Source & Destination Components that allows you to easily connect SQL Server with Impala through SSIS Workflows.
Use the Impala Data Flow Components to synchronize with Impala data, and more. Perfect for data synchronization, local back-ups, workflow automation, and more!