Think Before You Speak: Training Language Models with Pause Tokens
Think Before You Speak: Training Language Models with Pause Tokens
arxiv.org /abs/2310.02226
There is a discussion on Hacker News, but feel free to comment here as well.
0 comments