VOOZH about

URL: https://thenewstack.io/introducing-tigergraph-native-parallel-graph-database/

⇱ Introducing TigerGraph, a Native Parallel Graph Database - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2017-09-19 09:00:31
Introducing TigerGraph, a Native Parallel Graph Database
contributed,

Introducing TigerGraph, a Native Parallel Graph Database

Sep 19th, 2017 9:00am by Yu Xu
👁 Featued image for: Introducing TigerGraph, a Native Parallel Graph Database
Feature image via Pixabay.
Yu Xu
Dr. Yu Xu is the founder and CEO of TigerGraph, the world’s first native parallel graph database. Dr. Xu received his Ph.D. in Computer Science and Engineering from the University of California San Diego. He is an expert in big data and parallel database systems and has over 26 patents in parallel data management and optimization. Prior to founding GraphSQL, Dr. Xu worked on Twitter’s data infrastructure for massive data analytics. Before that, he worked as Teradata’s Hadoop architect where he led the company’s big data initiatives.

Graph databases are the fastest growing category in all of data management, according to DB Engines, a database consultancy. They have gained quick adoption for several reasons. First and foremost, they overcome the challenge of storing massive, complex and interconnected data by storing data in a graph format, including nodes, edges and properties. They also offer advantages over both traditional RDBMS and newer big data products — including, but not limited to the following:

  • Faster at “joining” related data objects,
  • Better scalability to larger data sets,
  • More flexibility for evolving structures.

Indeed, over the past few years, the industry has seen a considerable adoption of graph databases by enterprises, mostly in vertical sectors. Leading graph vendor players include DataStax and Neo4j.

While graph technology offers clear competitive advantages, these current solutions are not without their limitations. The most critical of which is the inability to support real-time analytics — a sore point given that enterprise data is always growing and becoming more complex. This is because solutions thus far have been either storage or compute-focused, feature-limited analytics capabilities and are unable to update graphs in real time.

Introducing TigerGraph: The First Native Parallel Graph (NPG)

As the world’s first Native Parallel Graph (NPG) database, TigerGraph sets out to solve these challenges. Unlike other technologies, the TigerGraph NPG focuses on both storage and computation, supporting real-time graph updates and offering built-in parallel computation. NPGs represent what’s next in the graph database evolution: a complete, distributed, parallel graph computing platform supporting web-scale data analytics in real-time.

Unifying the MapReduce and parallel graph processing paradigms, TigerGraph’s computing platform is based on the BSP (Bulk Synchronous Parallel) programming model which enables developers to implement a scalable parallel graph algorithm quickly and easily. A SQL-like graph query language (GSQL) provides for ad-hoc exploration and interactive analysis of Big Data.

Unique Advantages

The platform is developed considering real-time processing of massive data, extreme scalability, and performance demands from modern data-driven organizations. The architecture is modular and supports both scale-up and scale-out deployment models for distributed applications. General advantages of the NPG include:

  • Fast data loading speed to build graph,
  • Fast execution of parallel graph algorithms,
  • Real-time capability for streaming updates and inserts using REST,
  • Ability to unify real-time analytics with large-scale offline data processing.

Specific performance features include:

  • Ability to traverse hundreds of millions of vertices/edges per second per machine,
  • Ability to load 50 to 150 GB of data per hour, per machine.
  • Ability to stream 2B+ daily events in real-time.

This last point was proven on a graph with 100 billion vertices and 600 billion edges on an only 20 commodity machine cluster, battle-tested by the world’s largest e-payment company with over two years in production.

A Closer Look Under the Hood

Fresh Development

The NPG’s core system was developed from scratch using C++ and system programming concepts to provide an integrated data technology stack. A native graph storage engine (GSE) was developed to co-locate with the graph processing engine (GPE) for fast and efficient processing of data and algorithms.

GPE is designed to provide built-in parallelism for a MapReduce-based computing model available via APIs. The graph is optimally stored both on disk and in-memory, allowing the system to take advantage of the data locality on disk, in-memory and CPU cache.

High Compression Rates

The TigerGraph system performs efficient data compression to take further advantage of the memory and CPU cache. The compression ratio, input data size to output graph size, varies with the data and structure; however, a 10x compression ratio is very common. For example, 1TB of input data, when transformed and loaded in the graph, requires only 100GB of system memory.

Such compression reduces not only the memory footprint but also cache misses, speeding up overall query performance.

MPP Computational Model

In the TigerGraph system, the graph is both a storage and a computational model. Each vertex and edge can be associated with a compute function. Therefore, it acts as a parallel unit of storage and computation simultaneously. This is unlike existing graph databases such as Neo4j.

With this approach, vertices and edges in the graph are not just static data units anymore; they become active computing elements. These vertices can send messages to each other via edges and respond to the messages.

A vertex or an edge in the graph can store any amount of arbitrary information. The TigerGraph system executes compute functions in parallel on every vertex/edge, taking advantage of multi-core CPU machines and in-memory computing.

Graph Partitioning

The TigerGraph system supports a variety of graph partitioning algorithms to deliver the best performance. In most cases, the partitioning is automatically performed on the input data and it delivers great results without requiring optimization and tuning. But the flexibility in the TigerGraph system allows application-specific and other mixed partitioning strategies to achieve even greater application performance.

Built into the TigerGraph system is the ability to run multiple graph engines as an active-active network. Each graph engine can host identical graphs with different partitioning algorithms tailored for different types of application queries. The front-end server (generally a REST server) can route application queries to different graph engines based on the types of query.

A Transformational Technology

Whereas for large quantities of data, loading speed is slow, the TigerGraph NPG can load certain sized data in one hour — compared to over a day with other database systems. Naturally, this offers positive implications for use cases with massive amounts of data.

Further, by offering parallelism for large-scale graph analytics, the NPG supports graph parallel algorithms for Very Large Graphs (VLGs) — providing a considerable technological advantage which grows as graphs inevitably grow larger. The NPG works for limited, fast queries that touch anywhere from a small portion of the graph to millions of vertices and edges, as well as more complex analysis that must touch every single vertex in the graph itself. Additionally, the NPG’s real-time incremental graph updates make it suitable for real-time graph analytics, unlike other solutions.

An advantage with NPG lies in the fact that it represents graphs as a computational model. As discussed previously, compute functions can be associated with each vertex and edge in the graph, transforming them into active parallel compute-storage elements, in a behavior identical to what neurons exhibit in human brains. Vertices in the graph can exchange messages via edges, facilitating massively parallel and fast computation.

To learn more about the TigerGraph Native Parallel Graph, visit the Wiki here.

TRENDING STORIES
Dr. Yu Xu is the founder and CEO of TigerGraph, the world’s first native parallel graph database. Dr. Xu received his Ph.D in Computer Science and Engineering from the University of California San Diego. He is an expert in big...
Read more from Yu Xu
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Real.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.