VOOZH about

URL: https://www.cdata.com/kb/tech/csv-jdbc-querysurge.rst

⇱ Validate CSV Data with QuerySurge


Validate CSV Data with QuerySurge

πŸ‘ Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Access and validate CSV data in QuerySurge using the CData JDBC Driver.

QuerySurge is a smart data testing solution that automates data validation and testing. When paired with the CData JDBC Driver for CSV, QuerySurge can work with live CSV data. This article explains how to connect to CSV data from QuerySurge.

With built-in optimized data processing, the CData JDBC Driver offers unmatched performance for interacting with live CSV data. When you issue complex SQL queries to CSV, the driver pushes supported SQL operations, like filters and aggregations, directly to CSV and utilizes the embedded SQL engine to process unsupported operations client-side (often SQL functions and JOIN operations). Its built-in dynamic metadata querying allows you to work with and analyze CSV data using native data types.

Connecting to CSV Data in QuerySurge

To connect to live CSV data from QuerySurge, you need to deploy the JDBC Driver JAR file to your QuerySurge Agent(s) and add a new connection from the QuerySurge Admin view.

Deploy the JDBC Driver

  1. Download the CData JDBC Driver for CSV installer, unzip the package, and run the JAR file to install the driver.
  2. Once the driver is installed, stop the Agent Service.
  3. Copy the JAR File (and license file if it exists) from the installation location (typically C:\Program Files\CData\CData JDBC Driver for CSV\lib\) to your Agent(s) (QuerySurge_install_dir\agent\jdbc).
  4. Restart the Agent Service.

For more information on deploying JDBC drivers for QuerySurge, refer to the QuerySurge Knowledge Base.

Configure a New Connection to CSV

  1. Log into QuerySurge and navigate to the Admin view.
  2. Click Configuration -> Connections in the Administration Tree.
  3. Click Add to create a new connection.
  4. In the QuerySurge Connection Wizard, click Next. πŸ‘ Creating a new connection
  5. Name the connection (e.g. CData JDBC Connection to CSV).
  6. Set the Data Source to "All Other JDBC Connections (Connection Extensibility)" and click Next. πŸ‘ Naming the connection and choosing a Data Source
  7. Set the Driver Class to cdata.jdbc.csv.CSVDriver and click Next. πŸ‘ Setting the Driver Class
  8. Set the Connection URL using the necessary connection properties to authenticate with CSV. Your Connection URL will look something like the following:

    jdbc:csv:URI=/PATH/TO/MyCSVFilesFolder;

    Built-in Connection String Designer

    For assistance in constructing the JDBC URL, use the connection string designer built into the CSV JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.

    java -jar cdata.jdbc.csv.jar
    	

    Fill in the connection properties and copy the connection string to the clipboard.

    Connecting to Local or Cloud-Stored (Box, Google Drive, Amazon S3, SharePoint) CSV Files

    CData Drivers let you work with CSV files stored locally and stored in cloud storage services like Box, Amazon S3, Google Drive, or SharePoint, right where they are.

    Setting connection properties for local files

    Set the URI property to local folder path.

    Setting connection properties for files stored in Amazon S3

    To connect to CSV file(s) within Amazon S3, set the URI property to the URI of the Bucket and Folder where the intended CSV files exist. In addition, at least set these properties:

    • AWSAccessKey: AWS Access Key (username)
    • AWSSecretKey: AWS Secret Key

    Setting connection properties for files stored in Box

    To connect to CSV file(s) within Box, set the URI property to the URI of the folder that includes the intended CSV file(s). Use the OAuth authentication method to connect to Box.

    Dropbox

    To connect to CSV file(s) within Dropbox, set the URI proprerty to the URI of the folder that includes the intended CSV file(s). Use the OAuth authentication method to connect to Dropbox. Either User Account or Service Account can be used to authenticate.

    SharePoint Online (SOAP)

    To connect to CSV file(s) within SharePoint with SOAP Schema, set the URI proprerty to the URI of the document library that includes the intended CSV file. Set User, Password, and StorageBaseURL.

    SharePoint Online REST

    To connect to CSV file(s) within SharePoint with REST Schema, set the URI proprerty to the URI of the document library that includes the intended CSV file. StorageBaseURL is optional. If not set, the driver will use the root drive. OAuth is used to authenticate.

    Google Drive

    To connect to CSV file(s) within Google Drive, set the URI property to the URI of the folder that includes the intended CSV file(s). Use the OAuth authentication method to connect and set InitiateOAuth to GETANDREFRESH.

    πŸ‘ Using the built-in connection string designer to generate a JDBC URL (Salesforce is shown.)
  9. Set the Test Query to enable the Test Connection button for the Connection (e.g. SELECT * FROM Customer LIMIT 1) and click Next. πŸ‘ Setting the Connection URL and Test Query (NetSuite is shown)
  10. Click Test Connection to ensure the connection is configured properly and click Save to add the connection.

Once the connection is added, you can write SQL queries against your CSV data in QuerySurge.

Compare CSV Data Queries with a QueryPair

With the connection configured, you can follow the steps below to compare querying CSV data with a QueryPair.

  1. Select "Design Library" from the Design Menu
  2. Click QueryPairs under QuerySurge Design
  3. Click "Create New QueryPair"
  4. Name the QueryPair and click Save
  5. In either the Source or Target panes, select the connection created above (select the same connection to query CSV twice or another connection to perform a comparison)
  6. Write queries in the Editor for each pane, e.g. SELECT * FROM Customer πŸ‘ Setting the Connection URL and Test Query (NetSuite is shown)
  7. Click the "Design-Time Run" tab to execute the queries
  8. When the query execution is finished, click "View Query Results" to see the CSV data returned by the query πŸ‘ Setting the Connection URL and Test Query (NetSuite is shown)

Download a free, 30-day trial of the CData JDBC Driver for CSV and start working with your live CSV data in QuerySurge. Reach out to our Support Team if you have any questions.

Ready to get started?

Download a free trial of the CSV Driver to get started:

 Download Now

Learn more:

πŸ‘ CSV/TSV Files Icon
CSV JDBC Driver

Rapidly create and deploy powerful Java applications that integrate with delimited flat-file (CSV/TSV) data.