![]() |
VOOZH | about |
Google Data Fusion allows users to perform self-service data integration to consolidate disparate data. Uploading the CData JDBC Driver for XML enables users to access live XML data from within their Google Data Fusion pipelines. While the CData JDBC Driver enables piping XML data to any data source natively supported in Google Data Fusion, this article explains how to pipe data from XML to Google BigQuery,
Upload the CData JDBC Driver for XML to your Google Data Fusion instance to work with live XML data. Due to the naming restrictions for JDBC drivers in Google Data Fusion, create a copy or rename the JAR file to match the following format driver-version.jar. For example: cdataxml-2020.jar
With the JDBC Driver uploaded, you are ready to work with live XML data in Google Data Fusion Pipelines.
NOTE: To use the JDBC Driver in Google Data Fusion, you will need a license (full or trial) and a Runtime Key (RTK). For more information on obtaining this license (or a trial), contact our sales team.
CData Drivers let you work with XML files stored locally and stored in cloud storage services like Box, Amazon S3, Google Drive, or SharePoint, right where they are.
Set the URI property to local folder path.
To connect to XML file(s) within Amazon S3, set the URI property to the URI of the Bucket and Folder where the intended XML files exist. In addition, at least set these properties:
To connect to XML file(s) within Box, set the URI property to the URI of the folder that includes the intended XML file(s). Use the OAuth authentication method to connect to Box.
To connect to XML file(s) within Dropbox, set the URI proprerty to the URI of the folder that includes the intended XML file(s). Use the OAuth authentication method to connect to Dropbox. Either User Account or Service Account can be used to authenticate.
To connect to XML file(s) within SharePoint with SOAP Schema, set the URI proprerty to the URI of the document library that includes the intended XML file. Set User, Password, and StorageBaseURL.
To connect to XML file(s) within SharePoint with REST Schema, set the URI proprerty to the URI of the document library that includes the intended XML file. StorageBaseURL is optional. If not set, the driver will use the root drive. OAuth is used to authenticate.
To connect to XML file(s) within Google Drive, set the URI property to the URI of the folder that includes the intended XML file(s). Use the OAuth authentication method to connect and set InitiateOAuth to GETANDREFRESH.
The property is the controlling property over how your data is represented into tables and toggles the following basic configurations.
See the Modeling XML Data chapter for more information on configuring the relational representation. You will also find the sample data used in the following examples. The data includes entries for people, the cars they own, and various maintenance services performed on those cars.
For assistance in constructing the JDBC URL, use the connection string designer built into the XML JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.
java -jar cdata.jdbc.xml.jar
Fill in the connection properties and copy the connection string to the clipboard.
π Using the built-in connection string designer to generate a JDBC URL (Salesforce is shown.)With the Source and Sink configured, you are ready to pipe XML data into Google BigQuery. Save and deploy the pipeline. When you run the pipeline, Google Data Fusion will request live data from XML and import it into Google BigQuery.
π Image
While this is a simple pipeline, you can create more complex XML pipelines with transforms, analytics, conditions, and more. Download a free, 30-day trial of the CData JDBC Driver for XML and start working with your live XML data in Google Data Fusion today.
Download a free trial of the XML Driver to get started:
Download NowLearn more:
π XML Documents IconRapidly create and deploy powerful Java applications that integrate with XML data stores.