onerupya.blogspot.com: Running Hadoop on Ubuntu Linux (Single-Node Cluster)

Friday, 21 November 2014

Running Hadoop on Ubuntu Linux (Single-Node Cluster)

Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File System (GFS) and of the MapReduce computing paradigm. Hadoop’s HDFS is a highly fault-tolerant distributed file system and, like Hadoop in general, designed to be deployed on low-cost hardware. It provides high throughput access to application data and is suitable for applications that have large data sets. The main goal of this tutorial is to get a simple Hadoop installation up and running so that you can play around with the software and learn more about it.

original post can be found michael-noll.com here

1 comment:

Unknown21 November 2014 at 21:12
http://archive.ics.uci.edu/ml/
ReplyDelete
Replies

Add comment

onerupya.blogspot.com

Friday, 21 November 2014

Running Hadoop on Ubuntu Linux (Single-Node Cluster)

1 comment:

About Me

Search This Blog

Translate