Tag Archives: Distributed

AMP’d for Hadoop Alternatives

September 4, 2013

1 Comment

Apache Hadoop is not the only game in town.

I’ve been following AMPLab (UC Berkeley – link) for some time now. The goal of this collaboration is to tame big data by integrating algorithms, machines, and people (AMP).

[…]

Continue reading...

Topic of Interest – Distributed Stream Processing

August 29, 2013

1 Comment

I’ve been interested in distributed stream processing since the Storm and S4 announcements. Storm was open sourced by Twitter and S4, now an Apache Incubator project, was open sourced by Yahoo! Now, LinkedIn has open sourced Samza.
[…]

Continue reading...

ACID v BASE

May 21, 2013

1 Comment

It started with a post on High Scalability about ATMs, ACID transactions, and BASE (link). It continued on Hacker News (link).

It ended with this:

The system is BASE. The components are ACID.

[…]

Continue reading...

To the Moon

April 17, 2013

0 Comments

Last week HP released the HP ProLiant Moonshot Server based on the HP Moonshot 1500 Chassis (link). Two things occurred to me: it has an awesome name, it’s an ideal platform for distributed systems.

[…]

Continue reading...

Big Data and the Flying V

January 29, 2013

2 Comments

Big Data in Theory

What is it? It’s big data. Right?

I’m not sure if I like the term Big Data. I think it’s right up there with the term Cloud.

I do, however, like the framework created by Doug Laney: Volume, Velocity, and Variety. It’s the de facto description of Big Data, and it predates the Big Data phenomenon. That, and I like both alliteration and the KISS principle. Who doesn’t?

[…]

Continue reading...

Data Grid. Why?

November 7, 2012

1 Comment

Why use a data grid? This post aims to answer that very question.

[…]

Continue reading...

Data Grid, JBoss Data Grid

October 31, 2012

2 Comments

This post gives an overview of the features that are unique to JBoss Data Grid or are otherwise implemented in a different manner with Oracle Coherence. These features range from low level configuration options (e.g. single node locking) to programming models (e.g. batching).

[…]

Continue reading...

We, Data Grid

October 25, 2012

8 Comments

This post gives an overview of the standard features, functionality, and configuration options of a data grid. It does so by establishing a base line with two data grids: JBoss Data Grid (6.0.1) and Oracle Coherence (3.7.1). It begins with core concepts and proceeds directly to intermediate and advanced concepts implemented by both data grids.
[…]

Continue reading...

Data Grid – Cache Evolved

October 17, 2012

3 Comments

A data grid is one part cache and one part NOSQL. This post will discuss clustered caches, their limitations, and how data grids have incorporated NOSQL concepts in order to overcome them. It concludes with a couple of use cases related to application server clusters and data grids.

[…]

Continue reading...