ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models Paper • 2510.21450 • Published 6 days ago • 2