Transcription of A arXiv:1409.0473v7 [cs.CL] 19 May 2016
{{id}} {{{paragraph}}}
Published as a conference paper at ICLR 2015 NEURALMACHINETRANSLATIONBYJOINTLYLEARNIN G TOALIGN ANDTRANSLATED zmitry BahdanauJacobs University Bremen, GermanyKyungHyun ChoYoshua Bengio Universit e de Montr ealABSTRACTN eural machine translation is a recently proposed approach to machine transla-tion. Unlike the traditional statistical machine translation, the neural machinetranslation aims at building a single neural network that can be jointly tuned tomaximize the translation performance. The models proposed recently for neu-ral machine translation often belong to a family of encoder decoders and encodea source sentence into a fixed-length vector from which a decoder generates atranslation. In this paper, we conjecture that the use of a fixed-length vector is abottleneck in improving the performance of this basic encoder decoder architec-ture, and propose to extend this by allowing a model to automatically (soft-)searchfor parts of a source sentence that are relevant to predicting a target word, withouthaving to form these parts as a hard segment explicitly.
to align and translate simultaneously. In the Encoder–Decoder framework, an encoder reads the input sentence, a sequence of vectors x = (x 1; ;x T x), into a vector c.2 The most common approach is to use an RNN such that h t = f(x t;h t 1) (1) and c= q(fh 1; ;h T x g); where h t 2Rn is a hidden state at time t, and cis a vector generated from ...
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}