Transformer#
Introduzione#
Encoder
+ Positional Encoding
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Decoder
+ Positional Encoding
Masked Multi-Head Attention
Add & Norm
Multi-Head Attention
Add & Norm
Feed Forward
Add & Norm
Linear
Softmax