Self-supervised contrastive video-speech representation learning for ultrasound
In medical imaging, manual annotations can be expensive to acquire and sometimes infeasible to access, making conventional deep learning-based models difficult to scale. As a result, it would be beneficial if useful representations could be derived from raw data without the need for manual annotatio...
Main Authors: | , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Springer
2020
|