SQL Server databases are commonly used to store enterprise records. It is often necessary to move this data to other locations. The CData SSIS Components for BigQuery allow you to easily transfer data from SQL Server to BigQuery.
This article demonstrates how to use the CData SSIS Components for BigQuery inside of a SQL Server Integration Services workflow to export data from SQL Server to BigQuery, utilizing a lookup component to find and update any existing records or insert new records.
About BigQuery Data Integration
CData simplifies access and integration of live Google BigQuery data. Our customers leverage CData connectivity to:
- Simplify access to BigQuery with broad out-of-the-box support for authentication schemes, including OAuth, OAuth JWT, and GCP Instance.
- Enhance data workflows with Bi-directional data access between BigQuery and other applications.
- Perform key BigQuery actions like starting, retrieving, and canceling jobs; deleting tables; or insert job loads through SQL stored procedures.
Most CData customers are using Google BigQuery as their data warehouse and so use CData solutions to migrate business data from separate sources into BigQuery for comprehensive analytics. Other customers use our connectivity to analyze and report on their Google BigQuery data, with many customers using both solutions.
For more details on how CData enhances your Google BigQuery experience, check out our blog post: https://www.cdata.com/blog/what-is-bigquery
Getting Started
Add the Components
To get started, add a new OLE DB source, CData BigQuery lookup, and two CData BigQuery destinations (one to handle a lookup with a match and one to handle a lookup without a match) to a new data flow task.
π OLD DB Source, CData BigQuery Lookup and CData BigQuery Destinations
Configure the OLE DB Source
Follow the steps below to specify properties required to connect to the SQL Server instance.
- Open the OLE DB Source and add a new connection. Enter your server and database information here.
- In the Data access mode menu, select "Table or view" and select the table or view to export into BigQuery.
π The SQL Server table to export into BigQuery. (Salesforce is shown.)
- Close the OLE DB Source wizard and connect it to the CData BigQuery Destination.
Create a New Connection Manager for BigQuery
Follow the steps below to save connection properties in the Connection Manager.
- Create a new Connection manager: In the Connection Manager window, right-click and then click New Connection. The Add SSIS Connection Manager dialog is displayed.
- In the Connection Manager type menu, select the CData BigQuery Connection Manager.
- Configure the connection properties. A typical BigQuery connection requires:
- User
- Password
- AccessToken
π The Connection Manager. (Salesforce is shown.)
Configure the BigQuery Lookup
In the lookup component Transform Editor, define mappings from the SQL Server source table into the BigQuery lookup table.
- Double-click the BigQuery lookup to open the lookup component editor.
- In the Connection tab, select the Connection manager previously created.
- On the Connection tab, specify the Connection manager (or create a new one) and the table into which the data will be upserted. In this case, you will transfer Account records.
π The table into which the data will be exported. (Salesforce is shown.)
- On the Columns tab, configure the mapping of the primary key from the input columns to the primary key of the lookup columns.
π The mappings from SQL Server source to the SSIS lookup component. (Salesforce is shown.)
Configure the BigQuery Destinations
Using the lookup component, you know which entries from the data source already exist in BigQuery and which ones are new. Configure two BigQuery components to handle the situation where a match is found and a match is not found.
Lookup Match Destination
If the lookup component returns a match, update the existing record in BigQuery.
- Map the Lookup Match Output to a BigQuery destination component.
- Double-click the BigQuery destination to open the destination component editor.
- In the Connection Manager tab, select the Connection manager previously created.
- Specify the table which will be updated and set the Action to "Update."
π The table which will be updated. (Salesforce is shown.)
- On the Column Mappings tab, configure the mappings from the input columns to the destination columns (be sure to map the SQL Server primary key column to the BigQuery primary key column).
π The mappings from lookup component to the destination component. (Salesforce is shown.)
Lookup No Match Destination
If the lookup component does not return a match, add a new record to BigQuery.
- Map the Lookup No Match Output to a BigQuery destination component.
- Double-click the BigQuery destination to open the destination component editor.
- In the Connection Manager tab, select the Connection manager previously created.
- Specify the table into which the new record will be inserted and set the Action to "Insert."
π The table into which the new record will be inserted. (Salesforce is shown.)
- On the Column Mappings tab, configure the mappings from the input columns to the destination columns.
π The mappings from lookup component to the destination component. (Salesforce is shown.)
Run the Project
You can now run the project.
After the SSIS Task has finished executing, data from your SQL table will be exported to the chosen table.
π The completed upsert. (Salesforce is shown.)