Transcription of arXiv:1507.05717v1 [cs.CV] 21 Jul 2015
{{id}} {{{paragraph}}}
An End-to-End Trainable Neural Network for Image-based SequenceRecognition and Its Application to Scene Text RecognitionBaoguang Shi, Xiang Bai and Cong YaoSchool of Electronic Information and CommunicationsHuazhong University of Science and Technology, Wuhan, sequence recognition has been a long-standing research topic in computer vision. In this pa-per, we investigate the problem of scene text recognition,which is among the most important and challenging tasksin image-based sequence recognition.
2. The Proposed Network Architecture The network architecture of CRNN, as shown in Fig.1, consists of three components, including the convolutional layers, the recurrent layers, and a transcription layer, from bottom to top. At the bottom of CRNN, the convolutional layers auto-matically extract a feature sequence from each input image.
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}