Tag Archives: Apache Hadoop

BDAS – Berkeley Data Analytics Stack

September 5, 2013

0 Comments

Yesterday, I highlighted AMPLab and a set of projects that can be used as an alternative to Apache Hadoop (link).

Today, I’m highlighting BDAS: Berkeley Data Analytics Stack.

[…]

Continue reading...

AMP’d for Hadoop Alternatives

September 4, 2013

1 Comment

Apache Hadoop is not the only game in town.

I’ve been following AMPLab (UC Berkeley – link) for some time now. The goal of this collaboration is to tame big data by integrating algorithms, machines, and people (AMP).

[…]

Continue reading...

Topic of Interest – Distributed Stream Processing

August 29, 2013

1 Comment

I’ve been interested in distributed stream processing since the Storm and S4 announcements. Storm was open sourced by Twitter and S4, now an Apache Incubator project, was open sourced by Yahoo! Now, LinkedIn has open sourced Samza.
[…]

Continue reading...

Faster Big Data

August 29, 2013

3 Comments

Volume is one thing. Velocity is another.

Apache Hadoop
It is the big data platform for storing and processing lots of data, later.

What about storing and processing a subset of that data, now, by…
[…]

Continue reading...