![]() |
VOOZH | about |
Always-on applications rely on automatic failover capabilities and real-time data access. CData Sync integrates live Hugging Face data into your Google BigQuery instance, allowing you to consolidate all of your data into a single location for archiving, reporting, analytics, machine learning, artificial intelligence and more.
Using CData Sync, you can replicate Hugging Face data to Google BigQuery. To add a replication destination, navigate to the Connections tab.
You are now connected to Google BigQuery and can use it as both a source and a destination.
NOTE: You can use the Label feature to add a label for a source or a destination.
π Add a label.In this article, we will demonstrate how to load Hugging Face data into Google BigQuery and utilize it as a destination.
You can configure a connection to Hugging Face from the Connections tab. To add a connection to your Hugging Face account, navigate to the Connections tab.
HuggingFace Hub uses token-based authentication to enable access to its API. The API provides access to machine learning models, datasets, spaces, papers, and other resources on the HuggingFace Hub platform.
To authenticate to HuggingFace Hub, you will need to provide an API Key (Access Token). To obtain your access token:
After obtaining your access token, set the following connection properties:
Profile=C:\profiles\HuggingFace.apip;ProfileSettings='APIKey=hf_xxxxxxxxxxxxxxxxxxxx';π Configuring a Source connection (Salesforce is shown).
CData Sync enables you to control replication with a point-and-click interface and with SQL queries. For each replication you wish to configure, navigate to the Jobs tab and click Add Job. Select the Source and Destination for your replication.
π Select Source and Destination connections for the replication.To replicate an entire table, navigate to the Task tab in the Job, click Add Tasks, choose the table(s) from the list of Hugging Face tables you wish to replicate into Google BigQuery, and click Add Tasks again.
π Choose entire tables to replicate (Salesforce is shown).Select the Overview tab in the Job, and click Configure under Schedule. You can schedule a job to run automatically by configuring it to run at specified intervals, ranging from once every 10 minutes to once every month.
π Schedule your job to run automatically.Once you have configured the replication job, click Save Changes. You can configure any number of jobs to manage the replication of your Hugging Face data to Google BigQuery.
Once all the required configurations are made for the job, select the Hugging Face table you wish to replicate and click Run. After the replication completes successfully, a notification appears, showing the time taken to run the job and the number of rows replicated.
π Run the job.Now that you have seen how to replicate Hugging Face data into Google BigQuery, visit our CData Sync page to explore more about CData Sync and download a free 30-day trial. Start consolidating your enterprise data today!
As always, our world-class Support Team is ready to answer any questions you may have.
Learn more or sign up for a free trial:
CData Sync