Big Data Archive

NoSQL Databases

NoSQL databases usually store non-relational type of data on a super large scale and can solve problems which relational  databases can not manage such as  predicting subscriber behavior, indexing the entire Internet, or targeting ads on a large platform namely Facebook.

A list of popular NoSQL databases can be found below:

                                 Types of NoSQL Databases

Hadoop Overview

Hadoop allows distributed processing of large datasets across clusters of computers using comprehensive programming models.

hadoop basics

Hadoop mainly consists of:

(a) Processing/Computation layer (MapReduce),

(b) Storage layer (Hadoop Distributed File System).

haddop basics

MapReduce is a comprehensive parallel programming model for writing distributed applications of large amounts of data (multi-terabyte data-sets), on large clusters of commodity hardware in a fault-tolerant and reliable manner. The MapReduce program runs on Hadoop which is an Apache open-source framework.


5Vs of Big Data

The Structure of Big Data

Big Data consists of huge volume, high velocity, and extensible variety of data. The data in it will be of following three types.

  • Structured data: Relational data.
  • Semi Structured data: XML data.
  • Unstructured data: Word, PDF, Text, Media Logs.

The Applications of Big Data Analytics