Journal IJCRT UGC-CARE, UGCCARE( ISSN: 2320-2882 ) | UGC Approved Journal | UGC Journal | UGC CARE Journal | UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, International Peer Reviewed Journal and Refereed Journal, ugc approved journal, UGC CARE, UGC CARE list, UGC CARE list of Journal, UGCCARE, care journal list, UGC-CARE list, New UGC-CARE Reference List, New ugc care journal list, Research Journal, Research Journal Publication, Research Paper, Low cost research journal, Free of cost paper publication in Research Journal, High impact factor journal, Journal, Research paper journal, UGC CARE journal, UGC CARE Journals, ugc care list of journal, ugc approved list, ugc approved list of journal, Follow ugc approved journal, UGC CARE Journal, ugc approved list of journal, ugc care journal, UGC CARE list, UGC-CARE, care journal, UGC-CARE list, Journal publication, ISSN approved, Research journal, research paper, research paper publication, research journal publication, high impact factor, free publication, index journal, publish paper, publish Research paper, low cost publication, ugc approved journal, UGC CARE, ugc approved list of journal, ugc care journal, UGC CARE list, UGCCARE, care journal, UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, ugc care list of journal, ugc care list 2020, ugc care approved journal, ugc care list 2020, new ugc approved journal in 2020, ugc care list 2021, ugc approved journal in 2021, Scopus, web of Science.
How start New Journal & software Book & Thesis Publications

INTERNATIONAL JOURNAL OF CREATIVE RESEARCH THOUGHTS - IJCRT (IJCRT.ORG)

International Peer Reviewed & Refereed Journals, Open Access Journal

IJCRT Peer-Reviewed (Refereed) Journal as Per New UGC Rules.

ISSN Approved Journal No: 2320-2882 | Impact factor: 7.97 | ESTD Year: 2013

Call For Paper - Volume 14 | Issue 3 | Month- March 2026

Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(CrossRef DOI)

Submit Your Paper
Login to Author Home
Communication Guidelines

WhatsApp Contact
Click Here

  Published Paper Details:

  Paper Title

A DJANGO-BASED MACHINE LEARNING WEB APPLICATION TO ENSURE DATA QUALITY IN CSV FILES AND TRAIN A PREDICTIVE MODEL

  Authors

  Mrs G Amala,  Mohammed Saad Mansoor,  Mohammed Huzaifa Mansoor,  Thejas H J

  Keywords

Data Quality, Data Cleaning, Django, Machine Learning, Predictive Model, CSV Data, Web Application.

  Abstract


Ensuring the quality of tabular data is a critical preprocessing step before training reliable predictive models. We present a web-based application built with Django that automates data quality assessment and cleaning on uploaded CSV datasets, followed by training of a supervised learning model. The system ingests user-provided CSV files and analyzes issues such as missing values, outliers, and skewed features, providing visual dashboards summarizing key data quality metrics. Automated cleaning operations (mean/mode imputation, outlier clipping via the IQR rule, log-transform for skewed features, and one-hot encoding of categoricals) are performed using Python libraries (Pandas, SciPy, scikit-learn), producing a cleaned dataset ready for analysis. Users can then select a target variable and train multiple regression/classification algorithms (e.g. linear regression, random forest, k-NN), with the best model highlighted by evaluation metrics (e.g. mean squared error). In a case study on a housing price dataset, the application identified approximately 5.97% missing values, 1.47% outliers, and 25.0% highly skewed features, yielding a composite data quality score of 67.56%. A Random Forest regressor achieved the lowest error among models, consistent with known performance. The interface displays feature correlations and per-unit effect sizes, along with interactive charts. This integrated solution streamlines the machine learning pipeline by combining statistical data cleaning and model training in a single Django-based framework.

  IJCRT's Publication Details

  Unique Identification Number - IJCRT2505289

  Paper ID - 285271

  Page Number(s) - c553-c561

  Pubished in - Volume 13 | Issue 5 | May 2025

  DOI (Digital Object Identifier) -   

  Publisher Name - IJCRT | www.ijcrt.org | ISSN : 2320-2882

  E-ISSN Number - 2320-2882

  Cite this article

  Mrs G Amala,  Mohammed Saad Mansoor,  Mohammed Huzaifa Mansoor,  Thejas H J,   "A DJANGO-BASED MACHINE LEARNING WEB APPLICATION TO ENSURE DATA QUALITY IN CSV FILES AND TRAIN A PREDICTIVE MODEL", International Journal of Creative Research Thoughts (IJCRT), ISSN:2320-2882, Volume.13, Issue 5, pp.c553-c561, May 2025, Available at :http://www.ijcrt.org/papers/IJCRT2505289.pdf

  Share this article

  Article Preview

  Indexing Partners

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
Call For Paper March 2026
Indexing Partner
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
DOI Details

Providing A digital object identifier by DOI.org How to get DOI?
For Reviewer /Referral (RMS) Earn 500 per paper
Our Social Link
Open Access
This material is Open Knowledge
This material is Open Data
This material is Open Content
Indexing Partner

Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer