category

home / developersection / category

HDFS Architecture in Hadoop
hadoop 05-May-2016
HDFS Architecture in Hadoop

The core concept of HDFS is that it can be made up of dozens, hundreds, or even thousands of individual computers, where the system’s files are stored in directly attached disk drives.

Three modes of Hadoop Cluster Architecture
hadoop 04-May-2016
Three modes of Hadoop Cluster Architecture

Hadoop is primarily structured and  designed to be deployed on a massive cluster of networked systems or nodes, featuring master nodes (which host the

Why we need Map-Reduce in Hadoop?
hadoop 04-May-2016
Why we need Map-Reduce in Hadoop?

After we have stored piles and piles of data in HDFS (a distributed storage system spread over an expandable cluster of individual slave nodes), the f

Various Data compression codecs in Hadoop
hadoop 04-May-2016
Various Data compression codecs in Hadoop

Here we enlist and identify some common codecs that are supported by the Hadoop framework. Be sure to opt for the codec that most closely matches the

Hadoop Toolbox
hadoop 02-May-2016
Hadoop Toolbox

Besides, the major contribution of Amazon EMR services and its other related tools, many other companies also provide certain useful Hadoop Tools enlisted as following:

Concept of Map Reduce in Hadoop
hadoop 01-May-2016
Concept of Map Reduce in Hadoop

Though MapReduce as a technology is relatively new, it builds upon much of the fundamental work from both mathematics and computer science, particular

Hadoop Distributions: EMC, HotonWork and MapR
hadoop 30-Apr-2016
Hadoop Distributions: EMC, HotonWork and MapR

Besides Cloudera, there are few other popular Hadoop distribution which are well implemented for commercial and development purposes.EMC: Pivotal HD,

Hadoop Distributed processing with MapReduce
hadoop 30-Apr-2016
Hadoop Distributed processing with MapReduce

MapReduce comprises the sequential processing of operations on distributed volumes of data sets. The data comprises of key-value pairs, and the overal