All tags
Posts tagged with "nlp"
Transformers
Self-attention over the whole sequence ā the modern default.
CNNs for sequences
Convolutional models that read local windows in parallel.
RNNs (LSTM / GRU)
Recurrent models that carry a hidden state across the sequence.
N-gram language models
The classic statistical baseline ā predict the next token from the previous nā1.