VOOZH about

URL: https://www.cdata.com/kb/tech/athena-odbc-spss-modeler.rst

⇱ How to Seamlessly Import Amazon Athena Data into IBM SPSS Modeler


How to Seamlessly Import Amazon Athena Data into IBM SPSS Modeler

πŸ‘ Mohsin Turki
Mohsin Turki
Technical Marketing Engineer
Integrate Amazon Athena data into IBM SPSS Modeler using the CData ODBC Driver for real-time insights and advanced data analysis.

IBM SPSS Modeler is a powerful data mining and predictive analytics platform that enables organizations to extract valuable insights from their data. By connecting Amazon Athena data data to SPSS Modeler via the CData ODBC Driver for Amazon Athena, you can leverage real-time access for advanced data mining, predictive modeling, and statistical analysis.

This guide takes you through the steps of connecting IBM SPSS Modeler to Amazon Athena data, enabling seamless data import, preparation, and analysis. With the CData ODBC Driver for Amazon Athena, you can unlock the full potential of your Amazon Athena data data within IBM SPSS Modeler for actionable insights.

About Amazon Athena Data Integration

CData provides the easiest way to access and integrate live data from Amazon Athena. Customers use CData connectivity to:

  • Authenticate securely using a variety of methods, including IAM credentials, access keys, and Instance Profiles, catering to diverse security needs and simplifying the authentication process.
  • Streamline their setup and quickly resolve issue with detailed error messaging.
  • Enhance performance and minimize strain on client resources with server-side query execution.

Users frequently integrate Athena with analytics tools like Tableau, Power BI, and Excel for in-depth analytics from their preferred tools.

To learn more about unique Amazon Athena use cases with CData, check out our blog post: https://www.cdata.com/blog/amazon-athena-use-cases.


Getting Started


Overview

Here is an overview of the steps:

  1. CONFIGURE THE ODBC DRIVER: Set up a connection to Amazon Athena data in the CData ODBC Driver for Amazon Athena by entering the required connection properties.
  2. SET UP ODBC CONNECTION IN SPSS MODELER: Establish the ODBC connection within IBM SPSS Modeler by selecting the configured DSN.
  3. IMPORT AND PROCESS DATA: Import the Amazon Athena data data into SPSS Modeler, then review, filter, transform, and prepare the data for predictive analytics and statistical modeling.

Configure the Amazon Athena DSN Using the CData ODBC Driver

To start, configure the DSN (Data Source Name) for Amazon Athena data in your system using the CData ODBC Driver. Download and install a 30-day free trial with all the features from here.

Once installed, launch the ODBC Data Source Administrator:

  • On Windows: Search for ODBC Data Source Administrator in the Start menu and open the application.
  • On Mac: Open Applications, go to Utilities, and select ODBC Manager.
  • On Linux: Use the command line to launch ODBC Data Source Administrator or use unixODBC if installed.
πŸ‘ ODBC Data Source Administrator

Once launched, double-click on the CData Amazon Athena data Source and enter the required values to establish a connection:

Authenticating to Amazon Athena

To authorize Amazon Athena requests, provide the credentials for an administrator account or for an IAM user with custom permissions: Set to the access key Id. Set to the secret access key.

Note: Though you can connect as the AWS account administrator, it is recommended to use IAM user credentials to access AWS services.

Obtaining the Access Key

To obtain the credentials for an IAM user, follow the steps below:

  1. Sign into the IAM console.
  2. In the navigation pane, select Users.
  3. To create or manage the access keys for a user, select the user and then select the Security Credentials tab.

To obtain the credentials for your AWS root account, follow the steps below:

  1. Sign into the AWS Management console with the credentials for your root account.
  2. Select your account name or number and select My Security Credentials in the menu that is displayed.
  3. Click Continue to Security Credentials and expand the Access Keys section to manage or create root account access keys.

Authenticating from an EC2 Instance

If you are using the CData Data Provider for Amazon Athena 2018 from an EC2 Instance and have an IAM Role assigned to the instance, you can use the IAM Role to authenticate. To do so, set to true and leave and empty. The CData Data Provider for Amazon Athena 2018 will automatically obtain your IAM Role credentials and authenticate with them.

Authenticating as an AWS Role

In many situations it may be preferable to use an IAM role for authentication instead of the direct security credentials of an AWS root user. An AWS role may be used instead by specifying the . This will cause the CData Data Provider for Amazon Athena 2018 to attempt to retrieve credentials for the specified role. If you are connecting to AWS (instead of already being connected such as on an EC2 instance), you must additionally specify the and of an IAM user to assume the role for. Roles may not be used when specifying the and of an AWS root user.

Authenticating with MFA

For users and roles that require Multi-factor Authentication, specify the and connection properties. This will cause the CData Data Provider for Amazon Athena 2018 to submit the MFA credentials in a request to retrieve temporary authentication credentials. Note that the duration of the temporary credentials may be controlled via the (default 3600 seconds).

Connecting to Amazon Athena

In addition to the and properties, specify , and . Set to the region where your Amazon Athena data is hosted. Set to a folder in S3 where you would like to store the results of queries.

If is not set in the connection, the data provider connects to the default database set in Amazon Athena.

πŸ‘ Configuring ODBC DSN (Salesforce is shown)

Setup an ODBC Connection in IBM SPSS Modeler

After configuring the DSN, it's time to connect to it in IBM SPSS Modeler:

You are now ready to process and analyze the Amazon Athena data data in IBM SPSS Modeler.


Process Data: Filter, Categories, and Model

Once the tables are imported, you can refine, filter, categorize, and model your Amazon Athena data data in SPSS Modeler:

You have now performed a simple analysis, enabling SPSS Modeler to process and display insights from your database.


Unlock the Potential of Your Amazon Athena Data with CData

With the CData ODBC Driver for Amazon Athena, connecting Amazon Athena data data to IBM SPSS Modeler is seamless. Start your free trial today and unlock the full potential of your real-time data for advanced analytics and decision-making.

Ready to get started?

Download a free trial of the Amazon Athena ODBC Driver to get started:

 Download Now

Learn more:

πŸ‘ Amazon Athena Icon
Amazon Athena ODBC Driver

The Amazon Athena ODBC Driver is a powerful tool that allows you to connect with live data from Amazon Athena, directly from any applications that support ODBC connectivity.

Access Amazon Athena interactive query services data like you would a database, through a standard ODBC Driver interface.