PDF4PRO ⚡AMP

Modern search engine that looking for books and documents around the web

Example: biology

Convolutional Sequence to Sequence Learning - arXiv

Convolutional Sequence to Sequence LearningJonas GehringMichael AuliDavid GrangierDenis YaratsYann N. DauphinFacebook AI ResearchAbstractThe prevalent approach to Sequence to sequencelearning maps an input Sequence to a variablelength output Sequence via recurrent neural net-works. We introduce an architecture based en-tirely on Convolutional neural to recurrent models, computations over allelements can be fully parallelized during trainingto better exploit the GPU hardware and optimiza-tion is easier since the number of non-linearitiesis fixed and independent of the input length. Ouruse of gated linear units eases gradient propaga-tion and we equip each decoder layer with a sep-arate attention module. We outperform the accu-racy of the deep LSTM setup of Wu et al. (2016)on both WMT 14 English-German and WMT 14 English-French translation at an order of magni-tude faster speed, both on GPU and IntroductionSequence to Sequence Learning has been successful inmany tasks such as machine translation, speech recogni-tion (Sutskever et al.)

an order of magnitude faster speed thanWu et al.(2016) on GPU and CPU hardware ( x4, x5). 2. Recurrent Sequence to Sequence Learning Sequence to sequence modeling has been synonymous with recurrent neural network based encoder-decoder ar-chitectures (Sutskever et al.,2014;Bahdanau et al.,2014). The encoder RNN processes an input sequence x =

Loading..

Tags:

  Order

Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Spam in document Broken preview Other abuse

Transcription of Convolutional Sequence to Sequence Learning - arXiv

Related search queries