Cross-Modal Distillation for Speaker Recognition.
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11(2023)
Key words
Audio-Visual Speech Recognition,Feature Learning,Speaker Verification,Speaker Diarization,End-to-End Speech Recognition
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined