![]() |
VOOZH | about |
Elasticsearch is a popular distributed full-text search engine. By centrally storing data, you can perform ultra-fast searches, fine-tuning relevance, and powerful analytics with ease. Elasticsearch has a pipeline tool for loading data called "Logstash". You can use CData JDBC Drivers to easily import data from any data source into Elasticsearch for search and analysis.
This article explains how to use the CData JDBC Driver for ScrapingBee to load data from ScrapingBee into Elasticsearch via Logstash.
Now, let's create a configuration file for Logstash to transfer ScrapingBee data to Elasticsearch.
ScrapingBee uses API key authentication. To obtain an API key:
After obtaining your API key, set the following connection properties:
Profile=C:\profiles\ScrapingBee.apip;AuthScheme=APIKey;ProfileSettings="APIKey=your_api_key";
Once the authentication is configured, you can connect to ScrapingBee and query data from any of the available tables. All tables require at least one input parameter (such as a search query or product ID) to retrieve data.
Now let's run Logstash using the created "logstash.conf" file.
logstash-7.8.0\bin\logstash -f logstash.conf
A log indicating success will appear. This means the ScrapingBee data has been loaded into Elasticsearch.
For example, let's view the data transferred to Elasticsearch in Kibana.
GET api_table/_search
{
"query": {
"match_all": {}
}
}
👁 Querying the ScrapingBee data loaded into ElasticsearchWe have confirmed that the data is stored in Elasticsearch.
👁 Confirming the ScrapingBee data loaded into ElasticsearchBy using the CData JDBC Driver for ScrapingBee with Logstash, it functions as a ScrapingBee connector, making it easy to load data into Elasticsearch. Please try the 30-day free trial.
Connect to live data from ScrapingBee with the API Driver
Connect to ScrapingBee