Journal IJCRT UGC-CARE, UGCCARE( ISSN: 2320-2882 ) | UGC Approved Journal | UGC Journal | UGC CARE Journal | UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, International Peer Reviewed Journal and Refereed Journal, ugc approved journal, UGC CARE, UGC CARE list, UGC CARE list of Journal, UGCCARE, care journal list, UGC-CARE list, New UGC-CARE Reference List, New ugc care journal list, Research Journal, Research Journal Publication, Research Paper, Low cost research journal, Free of cost paper publication in Research Journal, High impact factor journal, Journal, Research paper journal, UGC CARE journal, UGC CARE Journals, ugc care list of journal, ugc approved list, ugc approved list of journal, Follow ugc approved journal, UGC CARE Journal, ugc approved list of journal, ugc care journal, UGC CARE list, UGC-CARE, care journal, UGC-CARE list, Journal publication, ISSN approved, Research journal, research paper, research paper publication, research journal publication, high impact factor, free publication, index journal, publish paper, publish Research paper, low cost publication, ugc approved journal, UGC CARE, ugc approved list of journal, ugc care journal, UGC CARE list, UGCCARE, care journal, UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, ugc care list of journal, ugc care list 2020, ugc care approved journal, ugc care list 2020, new ugc approved journal in 2020, ugc care list 2021, ugc approved journal in 2021, Scopus, web of Science.
How start New Journal & software Book & Thesis Publications
Submit Your Paper
Login to Author Home
Communication Guidelines

WhatsApp Contact
Click Here

  Published Paper Details:

  Paper Title

CAPTION GENERATION FROM IMAGES AND VIDEOS TO AID PATIENTS WITH VISUAL AGNOSIA

  Authors

  Chandan Kumar S,  Ujwal T R,  Sudipth,  Dr. Leena Giri G

  Keywords

CLIP, CNN, NLP

  Abstract


This project represents an initiative to enhance accessibility and inclusivity for individuals grappling with medical conditions like visual agnosia, a neurological condition characterized by difficulties in recognizing and interpreting visual information. The technical foundation is built upon a sophisticated two-stage architecture. Firstly, the Image Encoder leverages the CLIP encoder, to extract high-level features from images. These features serve as a rich representation of the visual content and are subsequently passed to RNN. The RNN employs LSTM network well-suited for sequential data processing. The LSTM is responsible for decoding the extracted image features into coherent and descriptive textual captions. Furthermore, an integral part of this project is the integration of the gTTS (Google Text-toSpeech) library, which introduces text-to-speech capabilities. This in addition lets the transformation of retrieved textual captions into spoken words, thereby creating a comprehensive and accessible experience for individuals with visual agnosia. The deployment of gTTS not only facilitates generating of audio descriptions but also enables users to customize speech speed, language preferences, and output formats. The system's overarching objective is to provide individuals with visual agnosia a robust and adaptable toolset for interpreting visual content. By combining image feature extraction with sequence generation and auditory synthesis, this project aims to bridge the gap in understanding visual stimuli, empowering users with detailed textual descriptions and spoken narratives. The intricate interplay of deeplearning method, neural network architectures, and library integrations underscores the project's technical complexity and potential impact on lives of individuals facing challenges in visual recognition.

  IJCRT's Publication Details

  Unique Identification Number - IJCRT2406169

  Paper ID - 263272

  Page Number(s) - b583-b588

  Pubished in - Volume 12 | Issue 6 | June 2024

  DOI (Digital Object Identifier) -   

  Publisher Name - IJCRT | www.ijcrt.org | ISSN : 2320-2882

  E-ISSN Number - 2320-2882

  Cite this article

  Chandan Kumar S,  Ujwal T R,  Sudipth,  Dr. Leena Giri G,   "CAPTION GENERATION FROM IMAGES AND VIDEOS TO AID PATIENTS WITH VISUAL AGNOSIA", International Journal of Creative Research Thoughts (IJCRT), ISSN:2320-2882, Volume.12, Issue 6, pp.b583-b588, June 2024, Available at :http://www.ijcrt.org/papers/IJCRT2406169.pdf

  Share this article

  Article Preview

  Indexing Partners

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
Call For Paper July 2024
Indexing Partner
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
DOI Details

Providing A Free digital object identifier by DOI.one How to get DOI?
For Reviewer /Referral (RMS) Earn 500 per paper
Our Social Link
Open Access
This material is Open Knowledge
This material is Open Data
This material is Open Content
Indexing Partner

Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer