DeepDiary: Automatic Caption Generation for Lifelogging Image Streams
@article{Fan2016DeepDiaryAC, title={DeepDiary: Automatic Caption Generation for Lifelogging Image Streams}, author={Chenyou Fan and David J. Crandall}, journal={ArXiv}, year={2016}, volume={abs/1608.03819} }
Lifelogging cameras capture everyday life from a first-person perspective, but generate so much data that it is hard for users to browse and organize their image collections effectively. In this paper, we propose to use automatic image captioning algorithms to generate textual representations of these collections. We develop and explore novel techniques based on deep learning to generate captions for both individual images and image streams, using temporal consistency constraints to create… Expand
Supplemental Code
2 Citations
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models
- Computer Science
- ICLR 2020
- 2020
- 3
- PDF
Towards Personalized Image Captioning via Multimodal Memory Networks
- Computer Science, Medicine
- IEEE Transactions on Pattern Analysis and Machine Intelligence
- 2019
- 18
References
SHOWING 1-10 OF 58 REFERENCES
Show and tell: A neural image caption generator
- Computer Science
- 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2015
- 3,739
- Highly Influential
- PDF
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
- Computer Science
- 2015 IEEE International Conference on Computer Vision (ICCV)
- 2015
- 895
- PDF
Deep visual-semantic alignments for generating image descriptions
- Computer Science
- CVPR
- 2015
- 1,895
- Highly Influential
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping
- Computer Science, Mathematics
- NIPS
- 2014
- 640
- PDF
CIDEr: Consensus-based image description evaluation
- Computer Science
- 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2015
- 1,530
- PDF
Passively recognising human activities through lifelogging
- Psychology, Computer Science
- Comput. Hum. Behav.
- 2011
- 125
- PDF
PlaceAvoider: Steering First-Person Cameras away from Sensitive Spaces
- Computer Science
- NDSS
- 2014
- 112
- PDF
Sequence to Sequence -- Video to Text
- Computer Science
- 2015 IEEE International Conference on Computer Vision (ICCV)
- 2015
- 915
- Highly Influential
- PDF