[HN Gopher] Nystromformer: A Nystrom-Based Algorithm for Approxi...
___________________________________________________________________
Nystromformer: A Nystrom-Based Algorithm for Approximating Self-
Attention
Author : tmfi
Score : 46 points
Date : 2021-02-11 18:42 UTC (4 hours ago)
(HTM) web link (arxiv.org)
(TXT) w3m dump (arxiv.org)
| elcomet wrote:
| Here's a nice video by Yannick Kilcher explaning the
| Nystromformer: https://www.youtube.com/watch?v=m-zrcmRd7E4
|
| The benefits over regular transformers is that it is more
| efficient (does less operations), as the original transformer has
| a quadratic complexity in the number of input tokens.
___________________________________________________________________
(page generated 2021-02-11 23:01 UTC)