Detecting and Captioning Images Using Deep Neural Networks and Flask
Keywords:
RNN, CNN, LSTM, API, Flask, MSCOCO, NLP Model, Flask Rest API, Transfer Learning, VGG Model, Tensorflow, KerasAbstract
One of the most important functions of the human visual system is to automatically caption images. There are numerous benefits to having an application that automatically captions the scenes around them and then converts the caption to a plain message. We offer a model based on CNN-LSTM neural networks that recognizes items in photos and creates descriptions for them automatically in this study. It performs the task of object detection using multiple pre-trained models, and the captions are generated using CNN and LSTM. For the job of object detection, it employs Transfer Learning-based pre-trained models. This model is capable of doing two tasks. The first is to recognize objects in the image.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2021 Mohammed Saif Altaf Ahmed, Mohurle Vaibhav Jitkumar, Dhumale Kajal Gangadhar
This work is licensed under a Creative Commons Attribution 4.0 International License.