Anfonwch hwn fel neges destun: Object level grouping for video shots