Image Captioning with CNN + LSTM

PyTorch image captioning model using a CNN encoder and LSTM decoder, trained and evaluated on the VizWiz dataset (7,750 images from people who are blind). Two architectures designed and compared using BLEU-1/2/3/4 metrics.