Journal IJCRT UGC-CARE, UGCCARE( ISSN: 2320-2882 ) | UGC Approved Journal | UGC Journal | UGC CARE Journal | UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, International Peer Reviewed Journal and Refereed Journal, ugc approved journal, UGC CARE, UGC CARE list, UGC CARE list of Journal, UGCCARE, care journal list, UGC-CARE list, New UGC-CARE Reference List, New ugc care journal list, Research Journal, Research Journal Publication, Research Paper, Low cost research journal, Free of cost paper publication in Research Journal, High impact factor journal, Journal, Research paper journal, UGC CARE journal, UGC CARE Journals, ugc care list of journal, ugc approved list, ugc approved list of journal, Follow ugc approved journal, UGC CARE Journal, ugc approved list of journal, ugc care journal, UGC CARE list, UGC-CARE, care journal, UGC-CARE list, Journal publication, ISSN approved, Research journal, research paper, research paper publication, research journal publication, high impact factor, free publication, index journal, publish paper, publish Research paper, low cost publication, ugc approved journal, UGC CARE, ugc approved list of journal, ugc care journal, UGC CARE list, UGCCARE, care journal, UGC-CARE list, New UGC-CARE Reference List, UGC CARE Journals, ugc care list of journal, ugc care list 2020, ugc care approved journal, ugc care list 2020, new ugc approved journal in 2020, ugc care list 2021, ugc approved journal in 2021, Scopus, web of Science.
How start New Journal & software Book & Thesis Publications
Submit Your Paper
Login to Author Home
Communication Guidelines

WhatsApp Contact
Click Here

  Published Paper Details:

  Paper Title

VISION & VOICE-ENABLED AI DOCTOR: AN INTELLIGENT DIAGNOSTIC FRAMEWORK

  Authors

  C.Rambabu,  Musanalli Bugude Devendra Kumar,  Banda Sekar,  Karanam Lakshmi Narasimha Bhargava,  Dasari Mahesh

  Keywords

Voice-based diagnosis, Medical image analysis, Artificial Intelligence (AI), OpenAI API, ElevenLabs, Deep learning, Speech-to-text, Vision AI, Multimodal input, Text-to-speech (TTS), Flask, Gradio, Real-time healthcare, Diagnosis automation, Python, Remote medical consultation

  Abstract


AI Doctor 2.0 is an AI-based diagnostic framework that integrates both voice and vision inputs to simulate intelligent real-time doctor-like consultations. The system uses OpenAI's GPT and Vision models for understanding natural language and interpreting medical images like Xrays or skin rashes. ElevenLabs API is used to generate human-like speech for voice-based diagnosis feedback. The platform allows patients to describe symptoms through speech or upload an image, which is then analyzed using pretrained AI models. Voice input is processed via speech-to-text conversion, and images are encoded and analyzed using a multimodal large language model. The AI returns a diagnosis, which is both displayed and read out to the user, increasing accessibility. The system is implemented using Python, Flask/FastAPI, and Gradio, allowing real-time interaction and easy deployment. Testing included real-world inputs to evaluate system accuracy, performance, and output quality. The results demonstrate the effectiveness of combining NLP and vision analysis in healthcare applications. This framework can be extended to include multilingual support, medical report generation, and IoT health device integration in future versions. AI Doctor 2.0 serves as a promising step towards accessible, intelligent, and scalable AI-powered digital healthcare platforms for remote diagnosis and primary consultation support

  IJCRT's Publication Details

  Unique Identification Number - IJCRT2504480

  Paper ID - 282136

  Page Number(s) - e130-e135

  Pubished in - Volume 13 | Issue 4 | April 2025

  DOI (Digital Object Identifier) -   

  Publisher Name - IJCRT | www.ijcrt.org | ISSN : 2320-2882

  E-ISSN Number - 2320-2882

  Cite this article

  C.Rambabu,  Musanalli Bugude Devendra Kumar,  Banda Sekar,  Karanam Lakshmi Narasimha Bhargava,  Dasari Mahesh,   "VISION & VOICE-ENABLED AI DOCTOR: AN INTELLIGENT DIAGNOSTIC FRAMEWORK", International Journal of Creative Research Thoughts (IJCRT), ISSN:2320-2882, Volume.13, Issue 4, pp.e130-e135, April 2025, Available at :http://www.ijcrt.org/papers/IJCRT2504480.pdf

  Share this article

  Article Preview

  Indexing Partners

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
Call For Paper November 2025
Indexing Partner
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
ISSN and 7.97 Impact Factor Details


ISSN
ISSN
ISSN: 2320-2882
Impact Factor: 7.97 and ISSN APPROVED
Journal Starting Year (ESTD) : 2013
ISSN
DOI Details

Providing A digital object identifier by DOI.org How to get DOI?
For Reviewer /Referral (RMS) Earn 500 per paper
Our Social Link
Open Access
This material is Open Knowledge
This material is Open Data
This material is Open Content
Indexing Partner

Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 7.97 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer
indexer