Latest blog tagged on "Bigdata"

What Is the Future of Data Warehousing?

No one can deny it that today, we live in The Age of the Customer. Consumers all over the world are now digitally empowered, and they have the means to decide which businesses will succeed and grow, and which ones will fail.


3 Pillars of the Data Economy

Our industry continues to see an ongoing upsurge in data volumes. This comes with the insatiable need by industry-leading companies to exploit existing data to foster business development and growth.

By Simond Gear posted   10 days ago

Analytics Bots: What Are They and How Do They Work?

In media, the word “bot” tends to bring to mind a twitter account run by a computer program that spits out mostly nonsensical tweets at other users.

By Jayden Bell posted   21 days ago

How You Can Use Data to Improve the Customer Experience

When you want to improve user experience with your product, or when you want to improve customer experience with your service, how do you go about doing it?

By AMANDEEP SINGH posted   22 days ago

Switching from Java to Big Data/Hadoop Career: The Whys and Hows

Once in a while, there may come a feeling that you are stuck in the same job profile and living a monotonous professional life. This generally leads to the realization that a change in your profile is much needed.

By Simond Gear posted   one month ago

Three Ways AI Will Change IT Operations and Data Center Management

Artificial Intelligence, and machine learning in particular, is predicted to have an enormous impact on many industries over the next few years, not least the industry that builds and manages the infrastructure on which machine learning algorithms run.

By Simond Gear posted   one month ago

What is the Impact of Big Data on Mobile Marketing?

Every business in one way or the other has access to data corresponding to their customers, competition, and market. Naturally, to stand out from other businesses and to ensure competitive advantage they need insights from other areas.

By Ashish srivastava posted   one month ago

Big Data In Banking: Advantages and Challenges

Because of the confidential nature of data in banking services, most of the financial institutions have been slow in adapting to big data even though they do realize that there are huge benefits in terms of customer centricity.

By Royce Roy posted   3 months ago

Big Data Analytics

Big data analytics is the process of inspecting large and different data sets i.e., big data – to discover hidden patterns, market trends, unknown correlations, customer preferences and other useful information that can help organizations.

By AMANDEEP SINGH posted   5 months ago

HBase Architecture: Introduction and RegionServers(Part-1)

The reason that folks such as chief financial officers are excited by the thought of using Hadoop is that it lets us store massive amounts of data across a cluster of low cost commodity servers — that’s music to the ears of financially minded people.

By Jayden Bell posted   one year ago

Big Data: HBase as Distributed, Persistent, Multidimensional Sorted Map

Now we are very well familiar with the power packed characteristics and nature of Hbase.

By marcel ethan posted   one year ago

Big Data: ACID versus BASE Data Stores

I think back in our school days, almost all of us have studied about difference between “ACID” and “BASE” in chemistry.

By zack mathews posted   one year ago

Pig Script Interfaces and Mode of Running in Hadoop

All scripts are run on a single machine without requiring Hadoop MapReduce and HDFS. This can be useful for developing and testing Pig logic.

By Chris S posted   one year ago

Introduction to Ooize in Hadoop

Moving data and running different kinds of applications in Hadoop is great stuff, but it’s only half the battle. For Hadoop’s efficiencies to truly start paying off for us, start thinking about how we can tie together a number of these actions to for

By Jayden Bell posted   one year ago

Clustering and Classification with Mahout

Unlike the supervised learning method described earlier for Mahout’s recommendation engine feature, clustering is a kind of unsupervised learning — where the data labels points are not known ahead of time and should be inferred from the data without

By Tom Cruser posted   one year ago

Statistical Analysis in Hadoop

Big data is all about applying analytics to more data, for more people. To carry out this task, big data practitioners use new tools — such as Hadoop — to explore and understand data in ways that previously might not have been possible (challenges that were “too complex,” “too expensive,” or “too slow”). Some of the “bigger analytics” that we often hear mentioned when Hadoop comes up in a conversation revolve around concepts such as machine learning, data mining, and predictive analytics.

By Andrew Watson posted   one year ago

MapReduce Driver Class:

Although the mapper and reducer implementations are all we need to perform the MapReduce job, there is one more piece of code necessary in MapReduce:

By Andrew Watson posted   one year ago

Pig Data Types in Hadoop

We have already seen the Pig architecture and Pig Latin Application flow. We also learn the Pig Design principle in the previous post.

By Ailsa Singh posted   one year ago

Pig Design Principles in Hadoop

Pig Latin is the programing platform which provides a language for Pig programs. Pig helps to convert the Pig Latin script into MapReduce tasks that can be run within Hadoop cluster.

By Glen Martin posted   one year ago

Introduction to Pig in Hadoop

Java MapReduce programs and the Hadoop Distributed File System (HDFS) provide us with a powerful distributed computing framework, but they come with one major drawback — relying on them limits the use of Hadoop to Java programmers who can think in Map and Reduce terms when writing programs.

By Glen Martin posted   one year ago

YARN’s Resource Management

The most key component of YARN is the Resource Manager, which governs and maintains all the data processing resources in the Hadoop cluster. In other words, the Resource Manager is a dedicated scheduler who has a task to assigns resources to requesti

By Mikki Halpin posted   one year ago

Hadoop File System Commands: ls Command Output Analysis

In this post, I have explained about Hadoop File System Commands.

By David Miller posted   one year ago

HDFS Architecture in Hadoop

The core concept of HDFS is that it can be made up of dozens, hundreds, or even thousands of individual computers, where the system’s files are stored in directly attached disk drives.

By Jayden Bell posted   one year ago

Data Replication in Hadoop: Slave node disk failures (Part -2)

Hadoop was originally designed with an intention to store petabyte data at the scale, with any Potential limitations to scaling out are minimized.

By Elena Glibart posted   one year ago

Three modes of Hadoop Cluster Architecture

Hadoop is primarily structured and designed to be deployed on a massive cluster of networked systems or nodes, featuring master nodes (which host the services that maintains Hadoop’s storage and manipulating power ) and slave nodes (where the data sets are stored and processed). We can, however, run Hadoop on a single computer, which is a great way to learn the basics of Hadoop by experimenting in a controlled space.

By Felix Pickles posted   one year ago

Don't want to miss updates? Please click the below button!

Follow MindStick