Text this: Semantic‐guided fusion for multiple object tracking and RGB‐T tracking