![]() |
VOOZH | about |
Elasticsearch is a popular distributed full-text search engine. By centrally storing data, you can perform ultra-fast searches, fine-tuning relevance, and powerful analytics with ease. Elasticsearch has a pipeline tool for loading data called "Logstash". You can use CData JDBC Drivers to easily import data from any data source into Elasticsearch for search and analysis.
This article explains how to use the CData JDBC Driver for Impala to load data from Impala into Elasticsearch via Logstash.
Now, let's create a configuration file for Logstash to transfer Impala data to Elasticsearch.
In order to connect to Apache Impala, set the Server, Port, and ProtocolVersion. You may optionally specify a default Database. To connect using alternative methods, such as NOSASL, LDAP, or Kerberos, refer to the online Help documentation.
Now let's run Logstash using the created "logstash.conf" file.
logstash-7.8.0\bin\logstash -f logstash.conf
A log indicating success will appear. This means the Impala data has been loaded into Elasticsearch.
For example, let's view the data transferred to Elasticsearch in Kibana.
GET apacheimpala_table/_search
{
"query": {
"match_all": {}
}
}
👁 Querying the Impala data loaded into ElasticsearchWe have confirmed that the data is stored in Elasticsearch.
👁 Confirming the Impala data loaded into ElasticsearchBy using the CData JDBC Driver for Impala with Logstash, it functions as a Impala connector, making it easy to load data into Elasticsearch. Please try the 30-day free trial.
Download a free trial of the Impala Driver to get started:
Download NowLearn more:
👁 Apache Impala IconRapidly create and deploy powerful Java applications that integrate with Impala.