Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Vaishnavh Nagarajan, Chen Henry Wu, Charles Ding, Aditi Raghunathan

2025-04-22

Roll the dice & look before you leap: Going beyond the creative limits
of next-token prediction

Summary

This paper talks about new ways to make language models more creative by changing how they predict and generate text, moving beyond just picking the next word one at a time.

What's the problem?

The problem is that most language models, like the ones used in chatbots, usually generate text by guessing one word at a time based on what came before. This method can limit how creative or original the text sounds, making it harder for the model to come up with surprising or interesting ideas.

What's the solution?

The researchers experimented with different methods, like generating several words at once and using a technique called hash-conditioning, which helps the model consider more possibilities at each step. These changes allow the model to be more flexible and creative when coming up with responses or stories.

Why it matters?

This matters because it could help AI produce more imaginative, engaging, and useful content, making it better at creative writing, brainstorming, and other tasks where originality is important.

Abstract

Minimal algorithmic tasks reveal that multi-token approaches and hash-conditioning improve open-ended creativity in language models beyond next-token learning and softmax-based sampling.

View Paper