Transcription of Abstract - arXiv
{{id}} {{{paragraph}}}
Two-Stream Convolutional Networksfor Action Recognition in VideosKaren SimonyanAndrew ZissermanVisual Geometry Group, University of investigate architectures of discriminatively trained deep Convolutional Net-works (ConvNets) for action recognition in video. The challenge is to capturethe complementary information on appearance from still frames and motion be-tween frames. We also aim to generalise the best performing hand-crafted featureswithin a data-driven learning contribution is three-fold. First, we propose a two-stream ConvNet architec-ture which incorporates spatial and temporal networks. Second, we demonstratethat a ConvNet trained on multi-frame dense optical flow is able to achieve verygood performance in spite of limited training data.
Trajectory stacking. An alternative motion representation, inspired by the trajectory-based de-scriptors [29], replaces the optical flow, sampled at the …
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}