![]() |
VOOZH | about |
The CData Excel Add-In for HDFS provides formulas that can query HDFS data. The following three steps show how you can automate the following task: Search HDFS data for a user-specified value and then organize the results into an Excel spreadsheet.
The syntax of the CDATAQUERY formula is the following:
=CDATAQUERY(Query, [Connection], [Parameters], [ResultLocation]);
This formula requires three inputs:
Connection: Either the connection name, such as HDFSConnection1, or a connection string. The connection string consists of the required properties for connecting to HDFS data, separated by semicolons.
In order to authenticate, set the following connection properties:
The procedure below results in a spreadsheet that organizes all the formula inputs in the first column.
=CDATAQUERY("SELECT * FROM Files WHERE FileId = '"&B5&"'","Host="&B1&";Port="&B2&";Path="&B3&";User="&B4&";Provider=HDFS",B6)
👁 Formula inputs used in this example. (Google Apps is shown.)Download a free trial of the Excel Add-In for HDFS to get started:
Download NowLearn more:
👁 HDFS IconThe HDFS Excel Add-In is a powerful tool that allows you to connect with live HDFS data, directly from Microsoft Excel.
Use Excel to read, write, and update HDFS file, etc. Perfect for mass imports / exports / updates, data cleansing & de-duplication, Excel based data analysis, and more!