![]() |
VOOZH | about |
We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.
Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.
Follow TNS on your favorite social media networks.
Become a TNS follower on LinkedIn.
Check out the latest featured and trending stories while you wait for your first TNS newsletter.
Distributed Postgres vendor pgEdge continues to tackle the complexities of logical replication with its latest version, known as the “Constellation Release,” which offers enhanced parallel processing, large object support and error handling.
The greater throughput, flexibility and control make pgEdge a viable open source alternative for legacy database workloads requiring multimaster capability, according to Phillip Merrick, pgEdge CEO. Today these workloads are typically running on legacy platforms such as Oracle Goldengate, he said in an email.
These workloads need the ability to take both read and write traffic at multiple nodes in a distributed database cluster to achieve low data latency or very high levels of availability (four or five nines) or both. An example might be a financial services application that operates across multiple regions or a critical e-commerce application that can never be down, he explained.
Merrick and his cofounder Denis Lussier, who together previously created EnterpriseDB, built pgEdge on open source PostgreSQL with the idea that a few nodes in a geographically distributed cluster, each handling reads and writes, can offer low latency, high availability, resiliency and performance.
Though a powerful feature, logical replication in Postgres presents challenges including consistency, synchronization, conflict resolution and overhead, affecting performance.
The Constellation Release features include:
While counting these features as part of this release, the company announced automated Data Definition Language (DDL) replication and Snowflake sequences in April.
DDL is used to create and modify Postgres objects through command statements like `CREATE`, `ALTER` and `DROP`. Postgres traditionally required modifications to table definitions through DDL commands be made manually on each node. With this automation, you can update the database schema on a single node, and the changes are seamlessly propagated to other nodes within the cluster.
Snowflake sequences address the complexity of managing sequences in multimaster replication scenarios. In a distributed multimaster Postgres system, the sequence must be updated across different regions and if each node updates the sequence independently it creates conflicts that cannot be resolved. This replacement for PostgreSQL sequence definitions provides a unique sequence — a timestamp, a counter and a unique node identifier — within a cluster that can be used across different regions without the need to write code or modify schema.
The company announced its free tier Cloud Developer Edition in January and its vector search capability using open source extension pgvector last October.
In the fourth quarter, pgEdge said it will add high-performance parallel replication, which it calls “a game-changer in the industry” to its capabilities.
High-performance parallel replication is the ability to run multiple data replication streams on each network connection between nodes, using multiple CPUs at each node for significantly higher levels of data throughput, Merrick explained. Current replication architectures for Postgres only permit one stream between nodes, placing an upper limit on replication performance.
This boost in replication throughput in high-traffic, cross-region transactional workloads will enable users to manage larger volumes of data replication across distributed clusters in high-demand environments while reducing lag times and ensuring timely synchronization.