![]() |
VOOZH | about |
Power BI transforms your company's data into rich visuals for you to collect and organize so you can focus on what matters to you. When paired with CData Connect AI, you get instant access to Impala data for visualizations, dashboards, and more. This article shows how to build and publish a dataset from Impala data in Power BI and then create reports on Impala data in the Power BI service.
CData Connect AI provides a pure SQL interface for Impala, allowing you to easily build reports from live Impala data in Power BI β with no need to replicate the data. As you build visualizations, Power BI generates SQL queries to gather data. Using optimized data processing out of the box, CData Connect AI pushes all supported SQL operations (filters, JOINs, etc) directly to Impala, leveraging server-side processing to quickly return Impala data.
NOTE: You can also import Impala data into Power BI through Connect AI (instead of using the on-premise gateway). Read how in the related Knowledge Base article.
Connectivity to Impala from Power BI is made possible through CData Connect AI. To work with Impala data from Power BI, we start by creating and configuring a Impala connection.
In order to connect to Apache Impala, set the Server, Port, and ProtocolVersion. You may optionally specify a default Database. To connect using alternative methods, such as NOSASL, LDAP, or Kerberos, refer to the online Help documentation.
π Configuring a connection (Salesforce is shown)When connecting to Connect AI through the REST API, the OData API, or the Virtual SQL Server, a Personal Access Token (PAT) is used to authenticate the connection to Connect AI. It is best practice to create a separate PAT for each service to maintain granularity of access.
With the connection configured and a PAT generated, you are ready to connect to Impala data from Power BI.
To connect to and visualize live Impala data in the Power BI service, install the on-premise data gateway, add a data source to the gateway from the Power BI service, and publish a dataset from Power BI Desktop to the service.
The Microsoft on-premise data gateway provides secure data transfer between connected data sources and various cloud-based Microsoft tools and platforms. You can read more about the gateway in the Microsoft documentation.
You can download and install the gateway from the Power BI service:
Once you have installed the data gateway, you add Connect AI as a data source to the Power BI service:
With the gateway installed and Connect AI added as a datasource to the Power BI service, you can publish a dataset from Power BI Desktop to the service.
Power BI detects each column's data type from the Impala metadata reported by Connect AI.
Power BI records your modifications to the query in the Applied Steps section, adjusting the underlying data retrieval query that is executed to the remote Impala data. When you click Close and Apply, Power BI executes the data Retrieval query.
Otherwise, click Load to pull the data into Power BI.
Now that you have published a dataset to the Power BI service, you can create new reports and dashboards based on the published data:
Now you have a direct connection to live Impala data from the Power BI service. You can create more data sources and new visualizations, build reports, and more β all without replicating Impala data.
To get live data access to hundreds of SaaS, Big Data, and NoSQL sources directly from your cloud applications, sign up for a free trial of CData Connect AI.
Learn more about CData Connect AI or sign up for free trial access:
Free Trial