VOOZH about

URL: https://www.cdata.com/kb/tech/excel-jdbc-talend.rst

⇱ Integrating Excel Data in Talend Cloud Data Management Platform


Integrating Excel Data in Talend Cloud Data Management Platform

πŸ‘ Dibyendu Datta
Dibyendu Datta
Lead Technology Evangelist
Connect Excel Data with Talend Cloud Data Management Platform using the CData JDBC Driver for Microsoft Excel.

Qlik's Talend Cloud Data Management Platform supports various data environments, enabling analytics for smarter decisions, operational data sharing, data and application modernization, and establishing data excellence for risk reduction. When paired with the CData JDBC Driver for Microsoft Excel, you can improve data integration, quality, and governance for your Excel Data. This article shows how you can easily integrate to Excel using a CData JDBC Driver in Talend Cloud Data Management, and then view the data for simultaneous use in your workflow.

Prerequisites

Before connecting the CData JDBC Driver to view and work with your data in Talend Cloud Data Management Platform, make sure to download and install the latest version of Talend Studio on your system. Also, ensure that you have the required prerequisites.

  1. A Talend Cloud Data Management account with appropriate permissions.
  2. The CData JDBC Driver for Microsoft Excel, which can be downloaded from the CData website.

Connect to Excel in Talend as a JDBC data source

Access Talend Data Management Cloud

  1. Run the Talend Cloud Data Management Platform installed on your local system and click on Other Login Mode. πŸ‘ Log into Talend Cloud Data Management Platform locally
  2. Click on Manage Connections. Set Repository to "Local" and enter the Name, Description and User E-mail in the respective spaces. Set the Workspace path and click on OK. πŸ‘ Manage a connection.
    πŸ‘ Create a connection in Talend Data Management Cloud
  3. Now, select the Create a new project radio button to add a new project name and click on Create. πŸ‘ Create a new project.
  4. The new project should appear under the Select an existing project section. Click on Open. The Talend Cloud Data Management Platform workspace opens up. πŸ‘ Open the Talend Cloud Data Management Platform workspace.

Create a new connection

  1. In the navigation pane, locate and expand the Metadata dropdown. Right-click on Db Connections and select Create Connection. πŸ‘ Create a new connection in the Talend platform under Db connections.
  2. Search for and select 'JDBC' in the Find section of the Database Connection window. Then, click on Finish. πŸ‘ Search and select the JDBC connector.
  3. Enter the Name, Purpose and Description of the new database in Talend where you need to load the Excel data. Click on Next. πŸ‘ Enter details of the database toload the source data.
  4. Generate a JDBC URL for connecting to Excel, beginning with jdbc:excel: followed by a series of semicolon-separated connection string properties.

    Connecting to Local or Cloud-Stored (Box, Google Drive, Amazon S3, SharePoint) Excel Files

    CData Drivers let you work with Excel files stored locally and stored in cloud storage services like Box, Amazon S3, Google Drive, or SharePoint, right where they are.

    Setting connection properties for local files

    Set the URI property to local folder path.

    Setting connection properties for files stored in Amazon S3

    To connect to Excel file(s) within Amazon S3, set the URI property to the URI of the Bucket and Folder where the intended Excel files exist. In addition, at least set these properties:

    • AWSAccessKey: AWS Access Key (username)
    • AWSSecretKey: AWS Secret Key

    Setting connection properties for files stored in Box

    To connect to Excel file(s) within Box, set the URI property to the URI of the folder that includes the intended Excel file(s). Use the OAuth authentication method to connect to Box.

    Dropbox

    To connect to Excel file(s) within Dropbox, set the URI proprerty to the URI of the folder that includes the intended Excel file(s). Use the OAuth authentication method to connect to Dropbox. Either User Account or Service Account can be used to authenticate.

    SharePoint Online (SOAP)

    To connect to Excel file(s) within SharePoint with SOAP Schema, set the URI proprerty to the URI of the document library that includes the intended Excel file. Set User, Password, and StorageBaseURL.

    SharePoint Online REST

    To connect to Excel file(s) within SharePoint with REST Schema, set the URI proprerty to the URI of the document library that includes the intended Excel file. StorageBaseURL is optional. If not set, the driver will use the root drive. OAuth is used to authenticate.

    Google Drive

    To connect to Excel file(s) within Google Drive, set the URI property to the URI of the folder that includes the intended Excel file(s). Use the OAuth authentication method to connect and set InitiateOAuth to GETANDREFRESH.

    Built-in Connection String Designer

    For assistance in constructing the JDBC URL, use the connection string designer built into the Excel JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.

    java -jar cdata.jdbc.excel.jar
    

    Fill in the connection properties and copy the connection string to the clipboard.

    πŸ‘ Using the built-in connection string designer to generate a JDBC URL (Salesforce is shown.)

    A typical JDBC URL is below:

    jdbc:excel:URI='C:/MyExcelWorkbooks/SampleWorkbook.xlsx';
    
  5. Enter the JDBC URL copied from CData JDBC Driver for Microsoft Excel in Edit new database. πŸ‘ Enter the JDBC URL (Salesforce is shown).

  6. Click on the "(+) Add" button under Drivers. A 'newLine' will appear in the Drivers board. Click on (...) at the end of the new line, select the Install a new module radio button, and click on (...) to add the path to the JAR file, located in the 'lib' subfolder of the installation directory. Click on OK. πŸ‘ Add a new line in Drivers board to add the path to the Excel JAR file (Salesforce is shown).
    πŸ‘ The lib folder which contains the Excel JAR file (Salesforce is shown).
    πŸ‘ Add the JAR installation path in 'Install a new module' (Salesforce is shown).
  7. Select the Driver Class as cdata.jdbc.excel (JAR file obtained from your installation directory given in the previous step). πŸ‘ Add the Driver class.

Test the new connection

  1. Click on Check Connection. If the entered details are correct, a "Connection successful" confirmation prompt will appear. Click on "OK" and "Finish". πŸ‘ Check the connection.
  2. Once the connection is established, right-click on the newly created connection and select Retrieve Schema. You can use the filters as well to retrieve the data as per your requirements. Click on Next. πŸ‘ Retrieve schema from the datasource.
    πŸ‘ Add the necessary filters.
  3. Expand the "CData" catalog in the Schema window and select the tables you want to import from the Excel schema. Click on Next. πŸ‘ Select a table from the Excel schema.
  4. In the next step, select the columns you want to view from the table and click on Finish. πŸ‘ Select the necessary columns from the selected table.
  5. All the selected tables from the Excel schema are now populated under the Table Schemas section of the JDBC connection.
  6. Right-click on any of the selected tables and click on Data Viewer to preview the data from the data source. πŸ‘ Click on Data Viewer to view the source data.
    πŸ‘ Display the source table view.

Get Started Today

Download a free, 30-day trial of the CData JDBC Driver for Microsoft Excel and integrate Excel data into Talend Cloud Data Management Platform. Reach out to our Support Team if you have any questions.

Ready to get started?

Download a free trial of the Excel Driver to get started:

 Download Now

Learn more:

πŸ‘ Microsoft Excel Icon
Microsoft Excel JDBC Driver

Easily connect Java/J2EE applications with real-time data from Excel spreadsheets. Use Excel to manage the data that powers your applications.