Journal IJCRT UGC-CARE, UGCCARE( ISSN: 2320-2882 ) | UGC Approved Journal | UGC Journal | UGC CARE Journal | UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, International Peer Reviewed Journal and Refereed Journal, ugc approved journal, UGC CARE, UGC CARE list, UGC CARE list of Journal, UGCCARE, care journal list, UGC-CARE list, New UGC-CARE Reference List, New ugc care journal list, Research Journal, Research Journal Publication, Research Paper, Low cost research journal, Free of cost paper publication in Research Journal, High impact factor journal, Journal, Research paper journal, UGC CARE journal, UGC CARE Journals, ugc care list of journal, ugc approved list, ugc approved list of journal, Follow ugc approved journal, UGC CARE Journal, ugc approved list of journal, ugc care journal, UGC CARE list, UGC-CARE, care journal, UGC-CARE list, Journal publication, ISSN approved, Research journal, research paper, research paper publication, research journal publication, high impact factor, free publication, index journal, publish paper, publish Research paper, low cost publication, ugc approved journal, UGC CARE, ugc approved list of journal, ugc care journal, UGC CARE list, UGCCARE, care journal, UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, ugc care list of journal, ugc care list 2020, ugc care approved journal, ugc care list 2020, new ugc approved journal in 2020, ugc care list 2021, ugc approved journal in 2021, Scopus, web of Science.
How start New Journal & software Book & Thesis Publications
Submit Your Paper
Login to Author Home
Communication Guidelines

WhatsApp Contact
Click Here

  Published Paper Details:

  Paper Title

A COMPARATIVE ANALYSIS ON PARALLEL IMPLEMENTATIONS OF DECISION TREE LEARNING FOR LARGE SCALE COMPLEX DATASETS IN APACHE SPARK

  Authors

  Surekha Samsani

  Keywords

Decision Tree classification, Machine Learning, Parallel implementation, Big datasets, Apache Spark.

  Abstract


Decision Tree classification is one of the most widely used machine learning algorithm in dozens of sectors due to its classification effectiveness and model interpretability. However, the voluminous training data and the inherent problem complexity are the two major factors that strongly influence the time and computational efficiencies of the decision tree classification models. As the complexity of the problem increases, the number of computations required to learn every significant pattern will also increase. These enormous computations upsurge the training time and necessitate extensive computing resources especially when the training dataset is huge. To analyze such voluminous and complex datasets at a rapid speed, the need to shift to parallel processing approaches is essential for lowering overall computational costs. In this scenario, this paper provides a comparative analysis of various parallel implementations of decision tree classification models in Apache Spark on three UCI big datasets namely SUSY, PokerHand, and RLCP. The performance exhibited by various parallel implementations is compared in terms of scalability, time complexity and classification performances.

  IJCRT's Publication Details

  Unique Identification Number - IJCRT2105359

  Paper ID - 207049

  Page Number(s) - d248-d255

  Pubished in - Volume 9 | Issue 5 | May 2021

  DOI (Digital Object Identifier) -   

  Publisher Name - IJCRT | www.ijcrt.org | ISSN : 2320-2882

  E-ISSN Number - 2320-2882

  Cite this article

  Surekha Samsani,   "A COMPARATIVE ANALYSIS ON PARALLEL IMPLEMENTATIONS OF DECISION TREE LEARNING FOR LARGE SCALE COMPLEX DATASETS IN APACHE SPARK", International Journal of Creative Research Thoughts (IJCRT), ISSN:2320-2882, Volume.9, Issue 5, pp.d248-d255, May 2021, Available at :http://www.ijcrt.org/papers/IJCRT2105359.pdf

  Share this article

  Article Preview

  Indexing Partners

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
Call For Paper May 2024
Indexing Partner
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
DOI Details

Providing A Free digital object identifier by DOI.one How to get DOI?
For Reviewer /Referral (RMS) Earn 500 per paper
Our Social Link
Open Access
This material is Open Knowledge
This material is Open Data
This material is Open Content
Indexing Partner

Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer