VOOZH about

URL: https://www.cdata.com/kb/tech/postgresql-jdbc-rapidminer.rst

⇱ Connect to PostgreSQL Data in RapidMiner


Connect to PostgreSQL Data in RapidMiner

πŸ‘ Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Integrate PostgreSQL data with standard components and data source configuration wizards in RapidMiner Studio.

This article shows how you can easily integrate the CData JDBC driver for PostgreSQL into your processes in RapidMiner. This article uses the CData JDBC Driver for PostgreSQL to transfer PostgreSQL data to a process in RapidMiner.

Connect to PostgreSQL in RapidMiner as a JDBC Data Source

You can follow the procedure below to establish a JDBC connection to PostgreSQL:

  1. Add a new database driver for PostgreSQL: Click Connections -> Manage Database Drivers.
  2. In the resulting wizard, click the Add button and enter a name for the connection.
  3. Enter the prefix for the JDBC URL:
    jdbc:postgresql:
    
  4. Enter the path to the cdata.jdbc.postgresql.jar file, located in the lib subfolder of the installation directory.
  5. Enter the driver class:
    cdata.jdbc.postgresql.PostgreSQLDriver
    
    πŸ‘ The JDBC driver configuration. (Salesforce is shown.)
  6. Create a new PostgreSQL connection: Click Connections -> Manage Database Connections.
  7. Enter a name for your connection.
  8. For Database System, select the PostgreSQL driver you configured previously.
  9. Enter your connection string in the Host box.

    To connect to PostgreSQL, set the Server, Port (the default port is 5432), and Database connection properties and set the User and Password you wish to use to authenticate to the server. If the Database property is not specified, the data provider connects to the user's default database.

    SSH Connectivity for PostgreSQL

    You can use SSH (Secure Shell) to authenticate with PostgreSQL, whether the instance is hosted on-premises or in supported cloud environments. SSH authentication ensures that access is encrypted (as compared to direct network connections).

    SSH Connections to PostgreSQL in Password Auth Mode

    To connect to PostgreSQL via SSH in Password Auth mode, set the following connection properties:

    • User: PostgreSQL User name
    • Password: PostgreSQL Password
    • Database: PostgreSQL database name
    • Server: PostgreSQL Server name
    • Port: PostgreSQL port number like 3306
    • UserSSH: "true"
    • SSHAuthMode: "Password"
    • SSHPort: SSH Port number
    • SSHServer: SSH Server name
    • SSHUser: SSH User name
    • SSHPassword: SSH Password

    SSH Connections to PostgreSQL in Public Key Auth Mode

    To connect to PostgreSQL via SSH in Password Auth mode, set the following connection properties:

    • User: PostgreSQL User name
    • Password: PostgreSQL Password
    • Database: PostgreSQL database name
    • Server: PostgreSQL Server name
    • Port: PostgreSQL port number like 3306
    • UserSSH: "true"
    • SSHAuthMode: "Public_Key"
    • SSHPort: SSH Port number
    • SSHServer: SSH Server name
    • SSHUser: SSH User name
    • SSHClientCret: the path for the public key certificate file

    Built-in Connection String Designer

    For assistance in constructing the JDBC URL, use the connection string designer built into the PostgreSQL JDBC Driver. Either double-click the JAR file or execute the jar file from the command-line.

    java -jar cdata.jdbc.postgresql.jar
    

    Fill in the connection properties and copy the connection string to the clipboard.

    πŸ‘ Using the built-in connection string designer to generate a JDBC URL (Salesforce is shown.)

    A typical connection string is below:

    User=postgres;Password=admin;Database=postgres;Server=127.0.0.1;Port=5432;
    
  10. Enter your username and password if necessary. πŸ‘ The connection to the JDBC data source. (Salesforce is shown.)

You can now use your PostgreSQL connection with the various RapidMiner operators in your process. To retrieve PostgreSQL data, drag the Retrieve operator from the Operators view. πŸ‘ A Retrieve operation to select data. (Salesforce is shown.)
With the Retrieve operator selected, you can then define which table to retrieve in the Parameters view by clicking the folder icon next to the "repository entry." In the resulting Repository Browser, you can expand your connection node to select the desired example set.

πŸ‘ The Repository Browser window you can use to select an example set. (Salesforce is shown.)

Finally, wire the output to the Retrieve process to a result, and run the process to see the PostgreSQL data.

πŸ‘ The results of the Retrieve operation. (Salesforce is shown.)

Ready to get started?

Download a free trial of the PostgreSQL Driver to get started:

 Download Now

Learn more:

πŸ‘ PostgreSQL Icon
PostgreSQL JDBC Driver

Rapidly create and deploy powerful Java applications that integrate with PostgreSQL-compatible database engines.