Sequence-to-Sequence with RNNs

Screen Shot 2022-07-21 at 8.29.26 PM.png

Context Vector

Problem - Input sequence bottlenecked through fixed-sized vector

Sequence-to-Sequence with RNNs and Attention

Screen Shot 2022-07-21 at 8.41.14 PM.png

Getting the Context Score for each step

Using Different Context vector in each time step of decoder

Compute (scalar) Alignment Scores

$$ e_{t,i}=f_{att}(s_{t-1},h_i)\quad f_{att}\,is\,MLP $$

$$ c_t = \sum_ia_{t,i}h_i $$