Nov 1, 20251 min read

Flash STU: Fast Spectral Transform Units

Work on efficient Spectral Transform Units for long-context sequence modeling.
Flash STU: Fast Spectral Transform Units
The goal is to make sequence models faster and more scalable for language modeling, control, dynamical systems, and long-range prediction tasks