Nelkit Chavez
Back to projects
AI & Data Science2026 · MSc Data Science @ UTS

Image Captioning with CNN + LSTM

Image Captioning with CNN + LSTM

PyTorch image captioning model using a CNN encoder and LSTM decoder, trained and evaluated on the VizWiz dataset (7,750 images from people who are blind). Two architectures designed and compared using BLEU-1/2/3/4 metrics.

Tech stack

PythonPyTorchCNNLSTMBLEU
Image Captioning with CNN + LSTM · Nelkit Chavez