--

How would the code change if one removed the for loop over the sequence elements? Perhaps it's more efficient to transpose the input vector?

--

--

Riccardo Di Sipio
Riccardo Di Sipio

Written by Riccardo Di Sipio

Senior Machine Learning developer at Dayforce. NLP, LLMs, graph neural networks. Formerly physicist at U Toronto, Bologna, CERN LHC/ATLAS.

No responses yet