Modular Prompt Learning Improves Vision-Language Models
CoRR(2025)
Key words
Vision-language Models,Input Layer,Trainable Parameters,Transformer Layers,Semantic Language,Image Encoder,Text Encoder,Running Time,Test Accuracy,Average Accuracy,ImageNet,Linear Transformation,Baseline Methods,Functional Coupling,Classical Way,Multi-head Self-attention,Image Embedding
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined