Prompt learning

Convolutional Visual Prompt for Robust Visual Perception

Jongmin Lim 2024. 12. 15. 14:55

Abstract

  • Visual prompts는 high-dimension additive vector와 labeled data에 의존
  • 본 논문에서는 Convolution visual prompts(CVP)를 소개

Method

본 논문에서는 convolution으로 생성한 prompt를 통해 deep model을 instruct하는 방법론을 소개한다

$$
x=x+\lambda \text{conv}(x,k)
$$

Convolution prompt의 주요 장점은 patch prompt와 padding prompt와 같은 전통적인 prompt보다 parameter수보다 적다는 것이다.

 

[전체 알고리즘]

 

[Flow of CVP]

 

 

[Low-Rank Visual Prompt]

  • CNN을 이용하여 Prompt를 생성하는대신 Image를 특이 값 분해를 하여 초기 visual prompt를 결정

 

Experiment

4.2 Experimental Results

[Training Cost v.s. Different Kernel Size]

 

[Training Time v.s. Number of Adapt Iteration]

 

[Number of Trainable Parameters]

 


Reference

https://arxiv.org/abs/2303.00198

 

Convolutional Visual Prompt for Robust Visual Perception

Vision models are often vulnerable to out-of-distribution (OOD) samples without adapting. While visual prompts offer a lightweight method of input-space adaptation for large-scale vision models, they rely on a high-dimensional additive vector and labeled d

arxiv.org