CAPTION GENERATION FROM IMAGES AND VIDEOS TO AID PATIENTS WITH VISUAL AGNOSIA

This project represents an initiative to enhance accessibility and inclusivity for individuals grappling with medical conditions like visual agnosia, a neurological condition characterized by difficulties in recognizing and interpreting visual information. The technical foundation is built upon a sophisticated two-stage architecture. Firstly, the Image Encoder leverages the CLIP encoder, to extract high-level features from images. These features serve as a rich representation of the visual content and are subsequently passed to RNN. The RNN employs LSTM network well-suited for sequential data processing. The LSTM is responsible for decoding the extracted image features into coherent and descriptive textual captions. Furthermore, an integral part of this project is the integration of the gTTS (Google Text-toSpeech) library, which introduces text-to-speech capabilities. This in addition lets the transformation of retrieved textual captions into spoken words, thereby creating a comprehensive and accessible experience for individuals with visual agnosia. The deployment of gTTS not only facilitates generating of audio descriptions but also enables users to customize speech speed, language preferences, and output formats. The system's overarching objective is to provide individuals with visual agnosia a robust and adaptable toolset for interpreting visual content. By combining image feature extraction with sequence generation and auditory synthesis, this project aims to bridge the gap in understanding visual stimuli, empowering users with detailed textual descriptions and spoken narratives. The intricate interplay of deeplearning method, neural network architectures, and library integrations underscores the project's technical complexity and potential impact on lives of individuals facing challenges in visual recognition.

IJCRT's Publication Details

Unique Identification Number - IJCRT2406169

Paper ID - 263272

Page Number(s) - b583-b588

Pubished in - Volume 12 | Issue 6 | June 2024

DOI (Digital Object Identifier) -

Publisher Name - IJCRT | www.ijcrt.org | ISSN : 2320-2882

E-ISSN Number - 2320-2882

Cite this article

Chandan Kumar S, Ujwal T R, Sudipth, Dr. Leena Giri G, "CAPTION GENERATION FROM IMAGES AND VIDEOS TO AID PATIENTS WITH VISUAL AGNOSIA", International Journal of Creative Research Thoughts (IJCRT), ISSN:2320-2882, Volume.12, Issue 6, pp.b583-b588, June 2024, Available at :http://www.ijcrt.org/papers/IJCRT2406169.pdf

Share this article

Article Preview

Indexing Partners

Call For Paper July 2024

Call For Papers
July 2024
Volume 12 | Issue 7
Last Date :
31-Jul-2024
Submit Manuscript Online
Impact Factor: 7.97

Review Results : Within 02-03 Days
Paper Publication : Within 02-03 Days

Published Issue Details

Current Issue Past Issue Conference Proceedings Sample Certificate Sample Publication letter Sample Hardcopy of Journal Sample Paper format CopyRight Transfer Form Undertaking Form

For Authors

Call For Paper Track Submitted Paper Submit Manuscript online Publication Guidelines Publication Charges Pay Charges Online Hardcopy Related DOI List of Research Area

Forms / Downloads

Sample Paper format CopyRight Transfer Form Undertaking Form

Other IMP Links

START A NEW JOURNAL &
JOURNAL SUPPORTING SOFTWARE Publish BOOK, DISSERTATION AND THESIS Best Research Paper Award Conference/ Special Issue Praposal

Indexing Partner

Research Area

Engineering Science & Technology Pharmacy Science All Commerce Arts Medical Science Life Sciences Health Science Social Science and Humanities Managment and Tourism LAW & Education

LICENSE

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

ISSN and 7.97 Impact Factor Details

ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013

Facts & Figures

Impact Factor: 7.97 Issues Per Year: 12 Article Submitted: 35967 Article Published: 7580 No. of contributors: 27154 Total Reviewers: 3580 Total Countries: 52

ISSN and 7.97 Impact Factor Details

ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013

DOI Details

Providing A Free digital object identifier by DOI.one How to get DOI?

CONFERENCE

CONFERENCE MANAGMENT & PUBLICATION CONFERENCE PROPOSAL

RECENT CONFERENCE

CONFERENCE PROPOSAL

CONFERENCE PROCEEDINGS

For Reviewer /Referral (RMS) Earn 500 per paper

About RMS
Editorial Board
Login into RMS Account
Join reviewer/RMS Member

Important Links

All Policy
Major Indexing
Payment Terms
FAQ
Privacy Policy
Copyright infringement claims

NEWS & Conference

Impact Factor: 7.97 Year: 2017

Impact Factor: 7.97 and ISSN Approved

Submit Paper online

Impact Factor: 7.97 and ISSN Approved

Impact Factor: 7.97 Year: 2017

Impact Factor: 7.97 and ISSN Approved

Impact Factor: 7.97 Year: 2017

Submit Paper online

Impact Factor: 7.97 and ISSN Approved

Submit Paper online

Impact Factor: 7.97 Year: 2017

Publication Guidelines

Submit Paper online

Impact Factor: 7.97 Year: 2017

Impact Factor: 7.97 and ISSN Approved

Digital Library

Search Your Paper Details.

IJCRT RMS | Earn 500 Per Paper.

RMS Join and earn 500 per paper.

See Image of rms

Our Social Link

Open Access

LICENSE

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Indexing Partner

Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

IJCRT
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

Submit Your Paper Any time There No deadline.

Publication fees with free DOI: 1500 INR for Indian author & 55$ for foreign International author.

INTERNATIONAL JOURNAL OF CREATIVE RESEARCH THOUGHTS - IJCRT (IJCRT.ORG)

International Peer Reviewed & Refereed Journals, Open Access Journal

Call For Paper - Volume 12 | Issue 7 | Month- July 2024