Stav dette: Self-supervised representation learning for ultrasound video