Nov 1, 20251 min read
Flash STU: Fast Spectral Transform Units
Work on efficient Spectral Transform Units for long-context sequence modeling.

The goal is to make sequence models faster and more scalable for language modeling, control, dynamical systems, and long-range prediction tasks