Image Caption Generator Using CNN and LSTM
Davis S Cherian
International Journal of Computational and Electronic Aspects in Engineering
Volume 3: Issue 2, June 2022, pp 26-31
Author's Information
Davis S Cherian1
Corresponding Author
1Student, Department of Computer Science & Engineering, Christian College of Engineering & Technology, Bhilai, Chhattisgarh, India
davischerian29@gmail.com
Abstract:-
This project entitled “Image Caption Generator Using CNN and LSTM” is a work that demonstrates the automated generation of captions for a wide variety of images. This technology is used by major tech-giants like Google, Microsoft, IBM, etc. to generate captions for the huge dataset of images produced over various platforms and social media websites. This project embodies the use of various Artificial Neural Networks namely CNN (Convolutional Neural Networks), RNN (Recurrent Neural Networks) and LSTM (Long Short-Term Memory) Units. The functionality of the model developed using these neural nets. has been rendered to an interactive Web Application for the users to understand the methodology.Index Terms:-
Convolutional Neural Networks, Recurrent Neural Networks, Long Short-Term Memory, Computer Vision, Natural Language Processing.REFERENCES
- J. D. Kelleher, Deep Learning, The MIT Press, 2019.
Online - P. Kim, "Convolutional Neural Network," MATLAB Deep Learning, p. 121–147, 2017.
Crossref - U. Karn, "An Intuitive Explanation of Convolutional Neural Networks," 2016.
Online - M. Glossary, "Activation Functions," 2017.
Online - A. L. Caterini and D. Chang, "Recurrent Neural Networks," Deep Neural Networks in a Mathematical Framework, pp. 59-79, 2018.
Online - Van Houdt, G., Mosquera, C. & Nápoles, G. A review on the long short-term memory model. Artif Intell Rev, Volume 53, 5929–5955, 2020.
Crossref - A. Gulli and S. Pal, Deep Learning with Keras, Packt Publishing Ltd., 2017.
Online
To view full paper, Download here