Transformers struggle with generalizing tasks beyond pre-training data
Transformers struggle with generalizing tasks beyond pre-training data
arxiv.org /abs/2311.00871
There is a discussion on Hacker News, but feel free to comment here as well.
You're viewing a single thread.
All Comments
2 comments
That and Starscream is being very uncooperative
1 0 Reply