In PEFT, each specialized model is created by training a small set of parameters—like Low-Rank Adaptation (LoRA) matrices or adapter modules—on top of a frozen base model. The resulting task vectors (the arithmetic difference between the fine-tuned and base weights) encode distinct capabilities. Model merging performs arithmetic operations, such as linear interpolation or task arithmetic, on these vectors to combine their knowledge into one model without catastrophic interference, enabling a single model to perform multiple tasks efficiently.
Primary Use Cases & Applications
Model merging leverages the compact delta weights from PEFT to combine multiple specialized models into a single, more capable system. This enables efficient multi-task learning, improved generalization, and the creation of foundational multi-purpose models.




