VOOZH about

URL: https://www.cdata.com/kb/tech/huggingface-ssis-task-import-2008.rst

โ‡ฑ Build Data Flows from Hugging Face to SQL Server using SSIS


Build Data Flows from Hugging Face to SQL Server using SSIS

๐Ÿ‘ Jerod Johnson
Jerod Johnson
Director, Technology Evangelism
Easily back up Hugging Face data to SQL Server using the SSIS components for Hugging Face.

Using SQL Server as a backup for critical business data provides an essential safety net against loss. Backing up data to SQL Server enables business users to more easily connect that data with features like reporting, analytics, and more.

This example demonstrates how to use the CData SSIS Tasks for Hugging Face inside of a SQL Server SSIS workflow to transfer Hugging Face data into a Microsoft SQL Server database.

Add the Components

To get started, add a new Hugging Face source and SQL Server ADO.NET destination to a new data flow task.

๐Ÿ‘ The Data Flow task used in this example. (Salesforce is shown.)

Create a New Connection Manager

Follow the steps below to save Hugging Face connection properties in a connection manager.

  1. In the Connection Manager window, right-click and then click New Connection. The Add SSIS Connection Manager dialog is displayed.
  2. In the Connection Manager type menu, select API. The CData Hugging Face Connection Manager is displayed.
  3. Configure connection properties.

    HuggingFace Hub uses token-based authentication to enable access to its API. The API provides access to machine learning models, datasets, spaces, papers, and other resources on the HuggingFace Hub platform.

    Using API Key Authentication

    To authenticate to HuggingFace Hub, you will need to provide an API Key (Access Token). To obtain your access token:

    1. Log in to your HuggingFace account at https://huggingface.co
    2. Navigate to Settings > Access Tokens
    3. Click "New token" to create a new access token
    4. Select the appropriate permissions (read or write)
    5. Copy the token value

    After obtaining your access token, set the following connection properties:

    • AuthScheme: Set this to APIKey.
    • APIKey: Set this to your HuggingFace access token.

    Example connection string

    Profile=C:\profiles\HuggingFace.apip;ProfileSettings='APIKey=hf_xxxxxxxxxxxxxxxxxxxx';
    
    ๐Ÿ‘ Configuring a connection (Salesforce is shown).

Configure the Hugging Face Source

Follow the steps below to specify the query to be used to extract Hugging Face data.

  1. Double-click the Hugging Face source to open the source component editor.
  2. In the Connection Manager menu, select the connection manager previously created.
  3. Specify the query to use for the data extraction. For example:
    SELECT , FROM Collections WHERE = ''
    
    ๐Ÿ‘ The SQL query to retrieve records. (Salesforce is shown.)
  4. Close the Hugging Face Source control and connect it to the ADO.NET Destination.

Configure the SQL Server Destination

Follow the steps below to specify the SQL server table to load the Hugging Face data into.

  1. Open the ADO.NET Destination and add a New Connection. Enter your server and database information here.
  2. In the Data access mode menu, select "table or view".
  3. In the Table Or View menu, select the table or view to populate.
  4. Configure any properties you wish to on the Mappings screen. ๐Ÿ‘ The mappings from the SSIS source component to SQL Server. (Salesforce is shown.)

Run the Project

You can now run the project. After the SSIS Task has finished executing, your database will be populated with Hugging Face data.

๐Ÿ‘ The completed import. (Salesforce is shown.)