Res-Tuning Component

[NeurIPS 2023] Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone

arXiv | Project Page

Res-Tuning is a flexible and efficient tuning method. We decouple the design of tuners from the model architecture for flexible combinations, and further extend it to implement a new memory-saving bypass tuner, which greatly reduces memory consumption and multi-task inference cost.

Currently, Res-Tuning is provided as a pluggable tuner algorithm component in SWIFT, which developers can use directly.

Supported Components

  • [x] Res-Adapter

  • [x] Res-Tuning-Bypass

  • [ ] Res-Prefix

  • [ ] Res-Prompt

Usage

Demo

Initialize Tuner

from swift import ResTuningConfig
config = ResTuningConfig(
    dims=768,
    root_modules=r'.*blocks.0$',
    stem_modules=r'.*blocks\.\d+$',
    target_modules=r'norm',
    tuner_cfg='res_adapter'
)
  • dims: The dimensions of the hidden states.

  • root_modules: The root module to be replaced.

  • stem_modules: The stem modules to be replaced.

  • target_modules: The target module to be replaced.

  • tuner_cfg: The configuration of the tuning module.

Load Model

from swift import Swift
import timm, torch
model = timm.create_model("vit_base_patch16_224", pretrained=False, num_classes=100)
model_tune = Swift.prepare_model(model, config)
print(model_tune.get_trainable_parameters())
print(model(torch.ones(1, 3, 224, 224)).shape)

Citation

@inproceedings{jiang2023restuning,
  title={Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone},
  author={Jiang, Zeyinzi and Mao, Chaojie and Huang, Ziyuan and Ma, Ao and Lv, Yiliang and Shen, Yujun and Zhao, Deli and Zhou, Jingren},
  booktitle={Advances in Neural Information Processing Systems},
  year={2023}
}