Self-supervised contrastive video-speech representation learning for ultrasound
In medical imaging, manual annotations can be expensive to acquire and sometimes infeasible to access, making conventional deep learning-based models difficult to scale. As a result, it would be beneficial if useful representations could be derived from raw data without the need for manual annotatio...
Huvudupphovsmän: | , , , , , |
---|---|
Materialtyp: | Conference item |
Språk: | English |
Publicerad: |
Springer
2020
|