missing

Skip Navigation

missing @kbin.social

Posts 0

Comments 1

Sort

Type

1y ago

Retentive Network: A Successor to Transformer for Large Language Models

If the claims here are true.. wow research and development are moving very quickly

1y ago

Retentive Network: A Successor to Transformer for Large Language Models

How does this compare with RWKV performance and architecture wise? Admittingly I am still reading the paper.