The Convergence of Data Stores

July 24, 2013

Big Data, Data Grid, NOSQL

The line between data grids and NoSQL database is disappearing. The line between NoSQL databases and Big Data platforms is disappearing.

Data Grids & NoSQL Databases

There are data grids and NoSQL databases that support distributed data, search (index / query), and map / reduce tasks.

There are data grids that can persist data held in memory.
There are NoSQL databases that can cache persisted data.

NoSQL databases are better suited to semi-structured data (e.g. documents), and data grids are better suited to structured data (e.g objects).

However, data grids will evolve to support both object and document models.

There are data grids that can read / write to NoSQL databases. JBoss Data Grid

There are data grids that can integrate with Big Data platforms. GridGain

NoSQL Databases & Big Data Platforms

There are NoSQL databases and Big Data platforms that support distributed data, search (index / query), and map / reduce tasks.

NoSQL databases are better suited to semi-structured data (e.g. documents), and Big Data platforms are better suited to unstructured data (e.g. files).

However, there are NoSQL databases that support files.

There are NoSQL databases built on top of Big Data platforms. Apache HBase
There are Big Data platforms built on top of NoSQL databases. DataStax

NoSQL databases are better suited to real time processing, and Big Data platforms are better suited to batch processing.

However, there are Big Data platforms that support real time processing.

For example, Cloudera Impala.

SQL

There are data grids that support SQL. GridGain
There are Big Data platforms that support SQL. Apache Hadoop (Hive)

There are in-memory relational databases. VoltDB
There are massively parallel processing (MPP) databases. Teradata

Use Cases & Convergence

If the line between data grids and NoSQL databases is disappearing and…
the line between NoSQL databases and Big Data platforms is disappearing…

Data Grid, NoSQL, and Big Data are just use cases.

The next generation of data stores will support structured (objects), semi-structured (documents), and unstructured data (files).

The data will be both persistent and in-memory.

They will support search (index / query), map / reduce tasks, and SQL.

Until then we will see integrated stacks such as JBoss Data Grid and MongoDB, Redis and Riak, or Apache Hadoop and GridGain.

About Shane K Johnson

Technical Marketing Manager, Red Hat Inc.

View all posts by Shane K Johnson

Trackbacks/Pingbacks

  1. The Convergence of Data Stores | I can explain ... - July 25, 2013

    […] The line between data grids and NoSQL database is disappearing. The line between NoSQL databases and Big Data platforms is disappearing.  […]

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: