Guiding Long-Short Term Memory for Image Caption Generation (ICCV 2015)
From Captions to Visual Concepts and Back (CVPR 2015)
Rich Image Captioning in the Wild (CVPR 2016 workshop)
Main work: Built on top of a state-of art framework, we developed a deep vision model that detects a broad range of visual concepts, an entity recognition model that identifies celebrities and landmarks ,and a confidence model for the caption output.
MELM+DMSM
maximum entropy language model
deep multimodal similarity model
Guided Open Vocabulary Image Captioning with Constrained Beam Search (NMNLP2017)
Partially-Supervised Image Captioning (NIPS 2018)