Skip Navigation
InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)GL
Glagnar @programming.dev
Posts 1
Comments 1

Extending Context Window of Large Language Models via Positional Interpolation

Paper released by Meta a few days ago, detailing a method for extending the context or "memory" of an LLM up to 32k tokens. What is interesting is that they give a mention to: https://kaiokendev.github.io/

This is a blog post written by a guy in his spare time who came up with the same method simultaneously, he calls it SuperHOT.

It's really exciting how AI/ Machine learning can be advanced by relatively ordinary people putting in hard work without the resources of Microsoft etc.

0