![]() |
VOOZH | about |
Elasticsearch is a popular distributed full-text search engine. By centrally storing data, you can perform ultra-fast searches, fine-tuning relevance, and powerful analytics with ease. Elasticsearch has a pipeline tool for loading data called "Logstash". You can use CData JDBC Drivers to easily import data from any data source into Elasticsearch for search and analysis.
This article explains how to use the CData JDBC Driver for Google Data Catalog to load data from Google Data Catalog into Elasticsearch via Logstash.
Now, let's create a configuration file for Logstash to transfer Google Data Catalog data to Elasticsearch.
Google Data Catalog uses the OAuth authentication standard. Authorize access to Google APIs on behalf on individual users or on behalf of users in a domain.
Before connecting, specify the following to identify the organization and project you would like to connect to:
Click the project selection drop-down, and select your organization from the list. Then, click More -> Settings. The organization ID is displayed on this page.
Find this by navigating to the cloud console dashboard and selecting your project from the Select from drop-down. The project ID will be present in the Project info card.
When you connect, the OAuth endpoint opens in your default browser. Log in and grant permissions to the application to completes the OAuth process. For more information, refer to the OAuth section in the Help documentation.
Now let's run Logstash using the created "logstash.conf" file.
logstash-7.8.0\bin\logstash -f logstash.conf
A log indicating success will appear. This means the Google Data Catalog data has been loaded into Elasticsearch.
For example, let's view the data transferred to Elasticsearch in Kibana.
GET googledatacatalog_table/_search
{
"query": {
"match_all": {}
}
}
👁 Querying the Google Data Catalog data loaded into ElasticsearchWe have confirmed that the data is stored in Elasticsearch.
👁 Confirming the Google Data Catalog data loaded into ElasticsearchBy using the CData JDBC Driver for Google Data Catalog with Logstash, it functions as a Google Data Catalog connector, making it easy to load data into Elasticsearch. Please try the 30-day free trial.
Download a free trial of the Google Data Catalog Driver to get started:
Download NowLearn more:
👁 Google Data Catalog IconRapidly create and deploy powerful Java applications that integrate with Google Data Catalog.