National Institute of Technology Warangal / B.Tech
Image Caption Generation Using PoS Guidance (BTP)
Given an image I, the task is to generate a meaningful English sentence S, that correctly describes the contents of the image with proper grammar. It uses a model based on the encoder-decoder framework, convolution neural network (CNN) as the encoder, recurrent neural network (RNN) as the decoding unit.