Attention
Models
What are the most important components when you write a 5-paragraph essay?
What are some things you do not want to see in an essay?
What are the most important words this sentence for translation?
Simpson’s paradox is a phenomenon in probability and statistics in which a trend appears in several groups of data but disappears or reverses when the groups are combined.
Attention Models
Attention models modify the Encoder-Decoder model by incorporating what is known as attention in the model.
Attention is defined as incorporating the inputs short-term memory from the encoder model to the decoders inputs.
m408.inqs.info/lectures/13a