Microsoft LongNet: One BILLION Tokens LLM — David Shapiro ~ AI (06.07.2023)
We could have AI models in a couple years that hold the entire internet in their context window.
You're viewing a single thread.
That's a really bold claim.
Also not sure how that would be helpful. If every prompt needs to rip through those tokens first, before predicting a response, it'll be stupid slow. Even now with llama.cpp, it's annoying when it pauses to do the context window shuffle thing.
Yeah, long term memory where ai can access only what it needs/wants is the way.
For now, I'd be happy with an AI that had access to and remembered the beginning of our conversation.