NoSQL databases usually store non-relational type of data on a super large scale and can solve problems which relational databases can not manage such as predicting subscriber behavior, indexing the entire Internet, or targeting ads on a large platform namely Facebook.
A list of popular NoSQL databases can be found below:
Types of NoSQL Databases
Hadoop allows distributed processing of large datasets across clusters of computers using comprehensive programming models.
Hadoop mainly consists of:
(a) Processing/Computation layer (MapReduce),
(b) Storage layer (Hadoop Distributed File System).
MapReduce is a comprehensive parallel programming model for writing distributed applications of large amounts of data (multi-terabyte data-sets), on large clusters of commodity hardware in a fault-tolerant and reliable manner. The MapReduce program runs on Hadoop which is an Apache open-source framework.
5Vs of Big Data
The Structure of Big Data
Big Data consists of huge volume, high velocity, and extensible variety of data. The data in it will be of following three types.
- Structured data: Relational data.
- Semi Structured data: XML data.
- Unstructured data: Word, PDF, Text, Media Logs.
The Applications of Big Data Analytics