GLiClass: Generalist Lightweight Model for Sequence Classification Tasks
Ihor Stepanov, Mykhailo Shtopko, Dmytro Vodianytskyi, Oleksandr Lukashov, Alexander Yavorskyi, Mykyta Yaroshenko
2025-08-12
Summary
This paper talks about GLiClass, a new lightweight AI model designed to classify sequences of text efficiently and accurately. It builds on another model called GLiNER and is designed to perform well even when it has never seen examples of the task before (zero-shot) or with very few examples (few-shot). It also adapts a training technique called PPO to handle multi-label text classification when data is limited.
What's the problem?
The problem is that many existing language models used for sequence classification can be slow and require lots of computing power because they process all the text with complicated methods. Also, when there is very little data to train on, especially for tasks where texts can belong to multiple categories at once, it is hard to get accurate results.
What's the solution?
GLiClass solves these issues by using a special architecture inspired by GLiNER that processes labels and input text together in a way that allows rich interaction between them but keeps computation efficient by only needing a single forward pass through the model. The model is trained on synthetic data to generalize well for zero-shot and few-shot scenarios. Additionally, the paper adapts the PPO reinforcement learning method for multi-label tasks to improve performance when there is little data.
Why it matters?
This matters because GLiClass allows faster and more efficient sequence classification without losing accuracy, enabling practical use in real-world applications such as topic classification, sentiment analysis, or as a reranker in systems that combine retrieval with generation. Its ability to work well with scarce training data and in multi-label settings makes it a versatile tool for many AI tasks where resources or data are limited.
Abstract
GLiClass, an adaptation of GLiNER, achieves efficient and accurate sequence classification with zero-shot and few-shot capabilities, and PPO is adapted for multi-label text classification in data-sparse conditions.