Question: 32
(Choose 1 answer)
What is a key component of the Transformer architecture, which allows it to model long-range dependencies
in sequential data?
A. LSTM (Long Short-Term Memory)
B. Attention mechanism
C. Max-pooling layer
D. Convolutional layer