Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Vadim Kurochkin, Yaroslav Aksenov, Daniil Laptev, Daniil Gavrilov, Nikita Balagansky

2025-05-30

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Summary

This paper talks about KronSAE, a new kind of sparse autoencoder that uses advanced math to make training faster and more efficient, and introduces a special function to help make the features it learns easier to understand.

What's the problem?

The problem is that training sparse autoencoders, which are tools used to break down complex data into simpler, more meaningful parts, can be slow and hard to interpret, especially when dealing with lots of features that might be related to each other.

What's the solution?

The researchers designed KronSAE using something called Kronecker product decomposition to speed up how the autoencoder learns, and they added a new binary AND function called mAND to help the model pick out and explain important features more clearly.

Why it matters?

This is important because it means we can train models that are both faster and easier to understand, which helps scientists and engineers figure out what their AI is really learning and makes these tools more useful for real-world problems.

Abstract

KronSAE, a novel architecture using Kronecker product decomposition, enhances efficiency in training Sparse Autoencoders, while mAND, a differentiable binary AND function, improves interpretability and performance.

View Paper