Archive | Big Data RSS feed for this archive

Poll Results: Do you have a data grid?

September 10, 2013

0 Comments

I created a poll last October: Do you have a data grid?

I removed it in July after creating two new polls, but here are the results.

[…]

Continue reading...

BDAS – Berkeley Data Analytics Stack

September 5, 2013

0 Comments

Yesterday, I highlighted AMPLab and a set of projects that can be used as an alternative to Apache Hadoop (link).

Today, I’m highlighting BDAS: Berkeley Data Analytics Stack.

[…]

Continue reading...

AMP’d for Hadoop Alternatives

September 4, 2013

1 Comment

Apache Hadoop is not the only game in town.

I’ve been following AMPLab (UC Berkeley – link) for some time now. The goal of this collaboration is to tame big data by integrating algorithms, machines, and people (AMP).

[…]

Continue reading...

Topic of Interest – Distributed Stream Processing

August 29, 2013

1 Comment

I’ve been interested in distributed stream processing since the Storm and S4 announcements. Storm was open sourced by Twitter and S4, now an Apache Incubator project, was open sourced by Yahoo! Now, LinkedIn has open sourced Samza.
[…]

Continue reading...

Faster Big Data

August 29, 2013

3 Comments

Volume is one thing. Velocity is another.

Apache Hadoop
It is the big data platform for storing and processing lots of data, later.

What about storing and processing a subset of that data, now, by…
[…]

Continue reading...

The Convergence of Data Stores

July 24, 2013

1 Comment

The line between data grids and NoSQL database is disappearing. The line between NoSQL databases and Big Data platforms is disappearing.

[…]

Continue reading...

GridLogZ – Build, Install, & Configure

June 21, 2013

0 Comments

This screencast demonstrates how to build, install, and configure GridLogZ. It does not include an audio soundtrack.

Prior to the screencast, the following items were downloaded from the Red Hat Customer Portal (link):

  • JBoss EAP 6.0
  • JBoss EAP 6.1
  • JBoss EAP 6.1 Maven Repository
  • JBoss Data Grid 6.1 Maven Repository

[…]

Continue reading...

GridLogZ: In-Memory Log Analysis

June 20, 2013

2 Comments

History

I wanted to create a proof of concept that demonstrated how to leverage JBoss Data Grid (JDG) with JBoss EAP. I wanted to leverage the map / reduce framework in JDG. However, I needed a data source: a big data source.

I decided to use log data because analyzing log files is an established use case for big data / analytics. For example, analyzing log files with Apache Hadoop.

[…]

Continue reading...

To the Moon

April 17, 2013

0 Comments

Last week HP released the HP ProLiant Moonshot Server based on the HP Moonshot 1500 Chassis (link). Two things occurred to me: it has an awesome name, it’s an ideal platform for distributed systems.

[…]

Continue reading...

Boost performance with Red Hat JBoss Data Grid and Intel

March 5, 2013

1 Comment

This afternoon Tony Hamilton, Enterprise Marketing Manager for Big Data & Analytics, and I will be giving a webinar on Big Data and data grids.

Please join us as we cover everything from recent Red Hat / Intel announcements to the components of Big Data solutions to hybrid Big Data / data grid architectures.

[…]

Continue reading...