What rank is used in LoRA fine-tuning?

Question

I previously fine-tuned a few instances of gpt-4 using the standard Azure ML training flow. I am seeing online that Azure ML uses Low rank approximation in order to optimize fine tuning.

I am wondering what rank is used in the LoRA fine-tuning process so that I might determine the memory/flops saved by using Azure ML LoRA instead of running a full backprop on the model.

Thanks!

Answer

Hi Nick Matton

Welcome to Microsoft Q&A Forum, thank you for posting your query here!

I understand that you are encountering an issue, The LoRA (Low-Rank Adaptation) fine-tuning process generally employs a rank between 4 and 32. This range ensures effective performance while greatly decreasing the number of trainable parameters and the overall memory requirements. Using LoRA (Low-Rank Adaptation) allows you to attain performance levels similar to those achieved with full fine-tuning.

Kindly refer below link: huggingface

Hope this helps. Do let us know if you any further queries.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Thank You.

Share via

What rank is used in LoRA fine-tuning?

1 answer

Your answer