Decoupling Angles and Strength in Low-rank Adaptation

Massimo Bini, Leander Girrbach, Zeynep Akata

2025-04-01

Decoupling Angles and Strength in Low-rank Adaptation

Summary

This paper talks about improving a technique called Parameter-Efficient Fine-Tuning (PEFT), which helps AI models quickly learn new tasks without needing a lot of computing power.

What's the problem?

Current PEFT methods aren't always reliable and can be sensitive to specific settings, making it difficult to get the best performance.

What's the solution?

The researchers developed a new method called DeLoRA that separates the 'angle' and 'strength' of the learning process, making it more robust and easier to use.

Why it matters?

This work matters because it can make AI models more adaptable and easier to train, leading to better performance in various tasks.

Abstract

Parameter-Efficient FineTuning (PEFT) methods have recently gained significant popularity thanks to the widespread availability of large-scale pretrained models. These methods allow for quick adaptation to downstream tasks with minimal computational cost. However, popular finetuning methods such as LoRA exhibit limited robustness when it comes to hyperparameter choices or extended training regimes, preventing optimal out-of-the-box performance. In contrast, bounded approaches, such as ETHER, provide greater robustness but are limited to extremely low-rank adaptations and fixed-strength transformations, reducing their adaptation expressive power. In this work, we propose Decoupled Low-rank Adaptation (DeLoRA), a novel finetuning method that normalizes and scales learnable low-rank matrices. By bounding the distance of the transformation, DeLoRA effectively decouples the angular learning from the adaptation strength, enhancing robustness without compromising performance. Through evaluations on subject-driven image generation, natural language understanding, and instruction tuning, we show that DeLoRA matches or surpasses performance of competing PEFT methods, while exhibiting stronger robustness. Code is available at https://github.com/ExplainableML/DeLoRA.

View Paper