![]() |
VOOZH | about |
Columnar databases are those where the data is stored in columns instead of rows as is done in the traditional row-based databases as they offer impressive benefits in certain types of queries and data manipulation operations. Therefore, databases that are organized in columns provide better performances for operations that require applications to read data such as data analytics and data warehousing. This approach enhances the rate of query processing and reduces disk I/O making it suitable for cases that involve a large number of queries that are processed on large amounts of data.
Both Columnar and Row databases are a few methods used for processing big data analytics and data warehousing. But their approach is different from each other.
For example:
Here is an example of a simple database table with four columns and three rows.
| ID Number | Last Name | First Name | Bonus |
|---|---|---|---|
| 534782 | Miller | Ginny | 6000 |
| 585523 | Parker | Peter | 8000 |
| 479148 | Stacy | Gwen | 2000 |
In a Columnar DBMS, the data stored is in this format:
534782, 585523, 479148; Miller, Parker, Stacy; Ginny, Peter, Gwen; 6000, 8000, 2000.In a Row-oriented DBMS, the data stored is in this format:
534782, Miller, Ginny, 6000; 585523, Parker, Peter, 8000; 479148, Stacy, Gwen, 2000.Columnar databases are very beneficial for the use cases that have many queries per data record and insert only operations. Due to their characteristic of being able to read data and compress it, they are well suited to analytics and data warehouse. However; they are not suitable for transactional systems or any application where constant update is required.