SP1029 commited on
Commit
fb3d542
·
verified ·
1 Parent(s): 60fd680

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -9,3 +9,15 @@ extra_gated_fields:
9
  I want to use this model for (please provide the reason(s)): text
10
  LoRMA model is free for research use but NOT for commercial use; do you agree if you are provided with the LoRMA model, you will NOT use for any commercial purposes: checkbox
11
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  I want to use this model for (please provide the reason(s)): text
10
  LoRMA model is free for research use but NOT for commercial use; do you agree if you are provided with the LoRMA model, you will NOT use for any commercial purposes: checkbox
11
  ---
12
+ **Title**
13
+
14
+ Low-Rank Multiplicative Adaptation for LLMs
15
+
16
+ **Abstract**
17
+
18
+ Large Language Models have shown remarkable capabilities in the NLP domain.Their effectiveness can mainly be attributed to their ability to adapt to an array of downstream tasks. However, generally, full fine-tuning is a computationally expensive job. To mitigate this, many techniques have been developed that prime efficiency, a prominent one being Low-Rank Adaptation (LoRA). However, LoRA and its variants employ re-parametrized additive updates. In this paper, we propose Low-Rank Multiplicative Adaptation (LoRMA), which shifts the paradigm of additive updates to a richer space of matrix multiplicative transformations. We tackle challenges such as computational complexity and rank bottleneck of matrix multiplication by effectively re-ordering operations and introducing rank inflation strategies. We conduct extensive experiments to demonstrate the effectiveness of our approach in terms of various evaluation metrics.
19
+
20
+ **For more details:**
21
+ - [GitHub Repository](https://github.com/Exploration-Lab/LoRMA)
22
+ - [Summary](https://exploration-lab.github.io/LoRMA/)
23
+ - [Paper](https://arxiv.org/abs/2506.07621)