Abstract
Big Data is a huge collection of data that comprises both structured data found in traditional databases and unstructured data like text documents, video and audio. Big Data is not merely data but also a collection of various tools, techniques, frameworks and platforms.. Different sources and the system at various rates are used to generate the data’s approach. HADOOP is the popular tool for implementing BIG DATA. HADOOP is an open source technology that enables the distributing process of large data sets of fault tolerance with a very high degree. This paper deals with the technology aspects of BIG DATA for its implementation in organizations by using HADOOP MapReduce technique.
Conflict of Interest
The authors declare no conflict of interest.
Ethical Approval
Not applicable
Data Availability
The datasets used in this study are openly available at [repository link] and the source code is available on GitHub at [GitHub link].
Funding
This work did not receive any external funding.