Chrome Extension
WeChat Mini Program
Use on ChatGLM

Extracting extended vocal units from two neighborhoods in the embedding plane

biorxiv(2022)

Cited 0|Views6
No score
Abstract
Annotating and proofreading data sets of complex natural behaviors are tedious tasks because instances of a given behavior need to be correctly segmented from background noise and must be classified with minimal false positive error rate. Low-dimensional embeddings have proven very useful for this task because they provide a visually appealing overview of a data set in which relevant clusters appear spontaneously. However, low-dimensional embeddings introduce errors because they fail to preserve high dimensional distances; and embeddings represent only objects of fixed dimensionality, which conflicts with natural objects such as vocalizations that have variable dimensions stemming from their variable durations. To mitigate these issues, we introduce a semi-supervised method for simultaneous segmentation and clustering of vocalizations. We define vocal units of a given type in terms of two density-based regions in low-dimensional embedding space, one associated with onsets and the other with offsets. We demonstrate our approach on the task of clustering adult zebra finch vocalizations embedded into the 2d plane with UMAP. We show that two-neighborhood (2N) extraction allows the identification of short and long vocal renditions from continuous data streams without initially committing to a particular segmentation of the data. Also, 2N vocal extraction achieves much lower false positive error rate than approaches based on a single defining region. ### Competing Interest Statement The authors have declared no competing interest.
More
Translated text
Key words
extended vocal units,neighborhoods
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined