Surveillance Video-and-Language Understanding: from Small to Large Multimodal Models
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY(2025)
Key words
Surveillance,Annotations,Anomaly detection,Visualization,Timing,Large language models,Public transportation,Surveillance video understanding,multimodal learning,dataset annotation,multimodal large language learning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined