Discrete Markov Bridge

Hengli Li, Yuxuan Wang, Song-Chun Zhu, Ying Nian Wu, Zilong Zheng

2025-05-27

Summary

This paper introduces a new method called Discrete Markov Bridge for working with data that comes in separate, countable pieces, like words in text or pixels in images. The method uses two main parts, Matrix Learning and Score Learning, to better understand and generate this kind of data. The authors show that their approach works better than older methods when tested on popular datasets like Text8 and CIFAR-10.

What's the problem?

The main problem is that older methods for modeling discrete data usually use a fixed way to move between data points during training. This limits how well the model can learn the hidden patterns in the data and makes it less flexible and powerful.

What's the solution?

To solve this, the authors created the Discrete Markov Bridge framework. It has two stages: Matrix Learning, which learns how to move from the real data to a hidden, simpler version, and Score Learning, which figures out how to go back from this hidden version to the original data. This two-way process lets the model learn more flexible and useful representations, making it easier to handle complex data.

Why it matters?

This work matters because it makes it possible to model and generate discrete data more accurately and efficiently. By improving how these models learn and represent data, the method can lead to better results in tasks like text and image generation, which are important in many areas of artificial intelligence.

Abstract

A novel framework, Discrete Markov Bridge, is introduced for discrete data modeling with Matrix Learning and Score Learning components, demonstrating superior performance compared to existing methods on Text8 and CIFAR-10 datasets.

View Paper