VOOZH about

URL: https://www.cdata.com/kb/tech/googledatacatalog-jdbc-snaplogic.rst

⇱ Integrate Google Data Catalog with External Services using SnapLogic


Integrate Google Data Catalog with External Services using SnapLogic

πŸ‘ Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Use CData JDBC drivers in SnapLogic to integrate Google Data Catalog with External Services.

SnapLogic is an integration platform-as-a-service (iPaaS) that allows users to create data integration flows with no code. When paired with the CData JDBC Drivers, users get access to live data from more than 250+ SaaS, Big Data and NoSQL sources, including Google Data Catalog, in their SnapLogic workflows.

With built-in optimized data processing, the CData JDBC Driver offers unmatched performance for interacting with live Google Data Catalog data. When platforms issue complex SQL queries to Google Data Catalog, the driver pushes supported SQL operations, like filters and aggregations, directly to Google Data Catalog and utilizes the embedded SQL engine to process unsupported operations client-side (often SQL functions and JOIN operations). Its built-in dynamic metadata querying lets you work with Google Data Catalog data using native data types.

Connect to Google Data Catalog in SnapLogic

To connect to Google Data Catalog data in SnapLogic, download and install the CData Google Data Catalog JDBC Driver. Follow the installation dialog. When the installation is complete, the JAR file can be found in the installation directory (C:/Program Files/CData/CData JDBC Driver for Google Data Catalog/lib by default).

Upload the Google Data Catalog JDBC Driver

After installation, upload the JDBC JAR file to a location in SnapLogic (for example, projects/Jerod Johnson) from the Manager tab.

πŸ‘ Uploaded JDBC Driver (Salesforce & QuickBooks Online are shown)

Configure the Connection

Once the JDBC Driver is uploaded, we can create the connection to Google Data Catalog.

  1. Navigate to the Designer tab
  2. Expand "JDBC" from Snaps and drag a "Generic JDBC - Select" snap onto the designer πŸ‘ Adding a Generic JDBC snap onto the designer
  3. Click Add Account (or select an existing one) and click "Continue"
  4. In the next form, configure the JDBC connection properties:
    • Under JDBC JARs, add the JAR file we previously uploaded
    • Set JDBC Driver Class to cdata.jdbc.googledatacatalog.GoogleDataCatalogDriver
    • Set JDBC URL to a JDBC connection string for the Google Data Catalog JDBC Driver, for example:

      jdbc:googledatacatalog:ProjectId=YourProjectId;InitiateOAuth=GETANDREFRESH;RTK=XXXXXX;

      NOTE: RTK is a trial or full key. Contact our Support team for more information. πŸ‘ Configuring a connection (Salesforce is shown)

      Built-In Connection String Designer

      For assistance in constructing the JDBC URL, use the connection string designer built into the Google Data Catalog JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.

      java -jar cdata.jdbc.googledatacatalog.jar

      Fill in the connection properties and copy the connection string to the clipboard.

      Google Data Catalog uses the OAuth authentication standard. Authorize access to Google APIs on behalf on individual users or on behalf of users in a domain.

      Before connecting, specify the following to identify the organization and project you would like to connect to:

      • OrganizationId: The ID associated with the Google Cloud Platform organization resource you would like to connect to. Find this by navigating to the cloud console.

        Click the project selection drop-down, and select your organization from the list. Then, click More -> Settings. The organization ID is displayed on this page.

      • ProjectId: The ID associated with the Google Cloud Platform project resource you would like to connect to.

        Find this by navigating to the cloud console dashboard and selecting your project from the Select from drop-down. The project ID will be present in the Project info card.

      When you connect, the OAuth endpoint opens in your default browser. Log in and grant permissions to the application to completes the OAuth process. For more information, refer to the OAuth section in the Help documentation.

      πŸ‘ Using the built-in connection string designer to generate a JDBC URL (Salesforce is shown.)
  5. After entering the connection properties, click "Validate" and "Apply"

Read Google Data Catalog Data

In the form that opens after validating and applying the connection, configure your query.

  • Set Schema name to "GoogleDataCatalog"
  • Set Table name to a table for Google Data Catalog using the schema name, for example: "GoogleDataCatalog"."Schemas" (use the drop-down to see the full list of available tables)
  • Add Output fields for each item you wish to work with from the table
πŸ‘ Configuring a Select snap (Salesforce is shown)

Save the Generic JDBC - Select snap.

With connection and query configured, click the end of the snap to preview the data (highlighted below).

πŸ‘ Click the end of the snap to preview the data.

Once you confirm the results are what you expect, you can add additional snaps to funnel your Google Data Catalog data to another endpoint.

πŸ‘ Previewing data (Salesforce is shown).

Piping Google Data Catalog Data to External Services

For this article, we will load data in a Google Spreadsheet. You can use any of the supported snaps, or even use a Generic JDBC snap with another CData JDBC Driver, to move data into an external service.

  1. Start by dropping a "Worksheet Writer" snap onto the end of the "Generic JDBC - Select" snap.
  2. Add an account to connect to Google Sheets πŸ‘ Connecting to Google
  3. Configure the Worksheet Writer snap to write your Google Data Catalog data to a Google Spreadsheet πŸ‘ Writing to a Google Spreadsheet

You can now execute the fully configured pipeline to extract data from Google Data Catalog and push it into a Google Spreadsheet.

πŸ‘ Data written to Google Spreadsheets (Salesforce is shown)

More Information & Free Trial

Using the CData JDBC Driver for Google Data Catalog you can create a pipeline in SnapLogic for integrating Google Data Catalog data with external services. For more information about connecting to Google Data Catalog, check at our CData JDBC Driver for Google Data Catalog page. Download a free, 30 day trial of the CData JDBC Driver for Google Data Catalog and get started today.

Ready to get started?

Download a free trial of the Google Data Catalog Driver to get started:

 Download Now

Learn more:

πŸ‘ Google Data Catalog Icon
Google Data Catalog JDBC Driver

Rapidly create and deploy powerful Java applications that integrate with Google Data Catalog.