Example: air traffic controller
Search results with tag "Visual semantic alignments for generating"
Long Short-Term Memory - University of Wisconsin–Madison
pages.cs.wisc.eduKarpathy, Andrej, and Li Fei-Fei. "Deep visual-semantic alignments for generating image descriptions." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015. (Paper introducing image captioning using ConvNet + LSTM)
Deep Visual-Semantic Alignments for Generating Image ...
cs.stanford.eduFigure 2. Overview of our approach. A dataset of images and their sentence descriptions is the input to our model (left). Our model first infers the correspondences (middle, Section3.1) and then learns to generate novel descriptions (right, Section3.2).