![]() |
VOOZH | about |
The Apache Solr platform is a popular, blazing-fast, open source enterprise search solution built on Apache Lucene.
Apache Solr is equipped with the Data Import Handler (DIH), which can import data from databases and, XML, CSV, and JSON files. When paired with the CData JDBC Driver for BigQuery, you can easily import BigQuery data to Apache Solr. In this article, we show step-by-step how to use CData JDBC Driver in Apache Solr Data Import Handler and import BigQuery data for use in enterprise search.
CData simplifies access and integration of live Google BigQuery data. Our customers leverage CData connectivity to:
Most CData customers are using Google BigQuery as their data warehouse and so use CData solutions to migrate business data from separate sources into BigQuery for comprehensive analytics. Other customers use our connectivity to analyze and report on their Google BigQuery data, with many customers using both solutions.
For more details on how CData enhances your Google BigQuery experience, check out our blog post: https://www.cdata.com/blog/what-is-bigquery
> solr create -c CDataCoreFor this article, Solr is running as a standalone instance in the local environment and you can access the core at this URL: http://localhost:8983/solr/#/CDataCore/core-overview
GoogleBigQueryUniqueKeyπ Define schema in Solr for BigQuery data.
Now we are ready to use BigQuery data in Solr.
In this section, we walk through configuring the Data Import Handler.
<lib dir="${solr.install.dir:../../../..}/dist/" regex="solr-dataimporthandler-.*\.jar" />
<requestHandler name="/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler">
<lst name="defaults">
<str name="config">solr-data-config.xml</str>
</lst>
</requestHandler>
<dataConfig>
<dataSource driver="cdata.jdbc.googlebigquery.GoogleBigQueryDriver" url="jdbc:googlebigquery:DataSetId=MyDataSetId;ProjectId=MyProjectId;InitiateOAuth=GETANDREFRESH;">
</dataSource>
<document>
<entity name="Orders"
query="SELECT Id,GoogleBigQueryColumn1,GoogleBigQueryColumn2,GoogleBigQueryColumn3,GoogleBigQueryColumn4,GoogleBigQueryColumn5,GoogleBigQueryColumn6,GoogleBigQueryColumn7,LastModifiedDate FROM Orders"
deltaQuery="SELECT Id FROM Orders where LastModifiedDate >= '${dataimporter.last_index_time}'"
deltaImportQuery="SELECT Id,GoogleBigQueryColumn1,GoogleBigQueryColumn2,GoogleBigQueryColumn3,GoogleBigQueryColumn4,GoogleBigQueryColumn5,GoogleBigQueryColumn6,GoogleBigQueryColumn7,LastModifiedDate FROM Orders where Id=${dataimporter.delta.Id}">
<field column="Id" name="Id" ></field>
<field column="GoogleBigQueryColumn1" name="GoogleBigQueryColumn1" ></field>
<field column="GoogleBigQueryColumn2" name="GoogleBigQueryColumn2" ></field>
<field column="GoogleBigQueryColumn3" name="GoogleBigQueryColumn3" ></field>
<field column="GoogleBigQueryColumn4" name="GoogleBigQueryColumn4" ></field>
<field column="GoogleBigQueryColumn5" name="GoogleBigQueryColumn5" ></field>
<field column="GoogleBigQueryColumn6" name="GoogleBigQueryColumn6" ></field>
<field column="GoogleBigQueryColumn7" name="GoogleBigQueryColumn7" ></field>
<field column="LastModifiedDate" name="LastModifiedDate" ></field>
</entity>
</document>
</dataConfig>> solr stop -all > solr start
Using the CData JDBC Driver for BigQuery you are able to create an automated import of BigQuery data into Apache Solr. Download a free, 30 day trial of any of the hundreds of CData JDBC Drivers and get started today.
Download a free trial of the Google BigQuery Driver to get started:
Download NowLearn more:
π Google BigQuery IconRapidly create and deploy powerful Java applications that integrate with Google BigQuery data including Tables and Datasets.