![]() |
VOOZH | about |
Databricks Lakehouse Federation enables organizations to query and integrate data from multiple sources without requiring data movement. It allows federated queries across databases, data warehouses, and lakehouses, providing a unified interface for data analysis and management within Databricks. When combined with CData Connect AI, it enables seamless access to MongoDB data for data virtualization, while also supporting data lineage and fine-grained access control.
This article explains how to use CData Connect AI to establish a live connection to MongoDB and how to access live MongoDB data from the Databricks platform.
Accessing and integrating live data from MongoDB has never been easier with CData. Customers rely on CData connectivity to:
MongoDB's flexibility means that it can be used as a transactional, operational, or analytical database. That means CData customers use our solutions to integrate their business data with MongoDB or integrate their MongoDB data with their data warehouse (or both). Customers also leverage our live connectivity options to analyze and report on MongoDB directly from their preferred tools, like Power BI and Tableau.
For more details on MongoDB use case and how CData enhances your MongoDB experience, check out our blog post: The Top 10 Real-World MongoDB Use Cases You Should Know in 2024.
CData Connect AI offers a seamless SQL Server, cloud-to-cloud interface for MongoDB, enabling you to effortlessly create dashboards and visualizations using live MongoDB data in Databricks. While building visualizations, Databricks requires SQL queries to retrieve the necessary data. With built-in optimized data processing, CData Connect AI pushes all supported SQL operations (such as filters and JOINs) directly to MongoDB, utilizing server-side processing for fast and efficient data retrieval of MongoDB data.
To work with MongoDB data in Databricks - Lakehouse Federation, you need to connect to MongoDB from Connect AI and provide user access to the connection.
Set the Server, Database, User, and Password connection properties to connect to MongoDB. To access MongoDB collections as tables you can use automatic schema discovery or write your own schema definitions. Schemas are defined in .rsd files, which have a simple format. You can also execute free-form queries that are not tied to the schema.
π Configuring a connection (Salesforce is shown)When connecting to Connect AI through the REST API, the OData API, or the Virtual SQL Server, a Personal Access Token (PAT) is used to authenticate the connection to Connect AI. It is best practice to create a separate PAT for each service to maintain granularity of access.
With the connection configured and a PAT generated, you are ready to connect to MongoDB data from Databricks.
Follow these steps to establish a connection from Databricks to the CData Connect AI Virtual SQL Server API.
To access the newly created catalog and create a dashboard to visualize live MongoDB data in Databricks, follow these steps:
At this stage, you have established a direct, cloud-to-cloud connection to live MongoDB data in Databricks. This enables you to create dashboards to monitor and visualize your data seamlessly.
For more details on accessing live data from over 100 SaaS, Big Data, and NoSQL sources through cloud applications like Databricks, visit our Connect AI page. As always, let us know if you have any questions during your evaluation. Our world-class CData Support Team is always available to help!
Learn more about CData Connect AI or sign up for free trial access:
Free Trial