Concept of Data compression in Hadoop
The massive data volumes
that are very comman in a typical Hadoop deployment make compression a
necessity. Data compression definitely saves us a great deal of storage space
and it also makes sure to accelerate the movement of that data throughout our
cluster. It’s not a big surprise that a numerous available compression schemes,
called codecs, are out there for us to consider.