Evaluation of Big Data Tools: A Comparative Study

Authors

  • Prince Hamza Shafique Department of Computer Science, NFC Institute of Engineering and technology, Multan, Pakistan.
  • Mubashar Hussain Department of Computer Science, University of Engineering and Technology, Lahore, Pakistan.
  • Salahuddin Department of Computer Science, NFC Institute of Engineering and technology, Multan, Pakistan.
  • Meiraj Aslam Department of Computer Science, NFC Institute of Engineering and technology, Multan, Pakistan.
  • Muhammad Sufyan Department of Computer Science, NFC Institute of Engineering and technology, Multan, Pakistan.
  • Syed Shahid Abbas Department of Computer Science, NFC Institute of Engineering and technology, Multan, Pakistan.

Keywords:

Java Execution Environment, Apache Software Foundation, IBM Workbooks, Computing Cluster

Abstract

Due to increasing usage of internet huge volume of data is available online. Main source of this gigantic volume of data are social networking sites like Facebook and tweeter etc. It is difficult to handle this huge volume of data. This growing data affects business badly. This data is called Big Data. There are many tools for Big data analytics in this research our focus is on four Big data tools 1) Hadoop, 2) IBM InfoSphere BigInsights, 3) High Performance Computing Cluster (HPCC) and 4) Apache Spark. In this research I have studied architectures, file systems, shortcomings and solutions of those problems. In future this research could be enhanced by running an algorithm on all these tools and then comparing the results. These tools can also be compared by setting some parameters.

Downloads

Published

2024-10-25

How to Cite

Prince Hamza Shafique, Mubashar Hussain, Salahuddin, Meiraj Aslam, Muhammad Sufyan, & Syed Shahid Abbas. (2024). Evaluation of Big Data Tools: A Comparative Study. Journal of Computing & Biomedical Informatics. Retrieved from https://jcbi.org/index.php/Main/article/view/727

Issue

Section

Articles