home / developersection / tag
Previously we are introduced to the HBase architecture and examine the most basic component of the architecture that is RegionServer.
HBase data stores comprises of one or more tables, that are indexed by row keys. Data is stored in rows with columns, and rows can have multiple versions.
As we might have guessed, the Google’s BigTable distributed data storage system(DDSS) was designed to meet the demands of big data. Now, big data applications store massive amount of data but big data content is also often variable.
I think everybody remember his first surfing experience on the World Wide Web,; we just knew that it was an incredible innovation for the IT industry.
Selecting a database which fits in our application requirement is a very daunting task since “no one size fits all”.
Pig Latin has a simple syntax with powerful semantics we will use to carry out two primary operations:
Developers and Programmers are still continue to explore various approaches to leverage the distributed computation benefits of MapReduce and the almost limitless storage capabilities of HDFS in intuitive manner that can be exploited by R.
Machine learning refers to a feild of artificial intelligence (A.I.) functions that provides tools enabling computers to enhance their analysis on the basis of previous events.
MapReduce API has two main classes which do the Map-Reduce task for us: Mapper class and Reducer class.
Simple” often sense as “elegant” when it comes to those remarkable architectural drawings for that new Silicon Valley mansion we have planned for when the money starts rolling in after we implement Hadoop.
With the release of Hadoop 2, however, YARN was introduced, which open doors for whole new world of data processing opportunities.
Mapper class is responsible for providing implementations for mapping jobs in MapReduce.