
Machine Learning Research
The Transformation Continues: Technique boosts transformer performance on long sequences.
Transformer networks are gaining popularity as a high-accuracy alternative to recurrent neural networks. But they can run slowly when they’re applied to long sequences.