A Two-Stage Semi-Supervised High Maneuvering Target Trajectory Data Classification Algorithm

Labeled data in insufficient amounts and missing categories are two observable features for high maneuvering target trajectory data. However, the existing research achievements are insufficient for solving these two problems simultaneously during data classification. This study proposed a two-stage...

Full description

Bibliographic Details
Main Authors: Qing Li, Xintai He, Kun Chen, Qicheng Ouyang
Format: Article
Language:English
Published: MDPI AG 2022-10-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/21/10979
Description
Summary:Labeled data in insufficient amounts and missing categories are two observable features for high maneuvering target trajectory data. However, the existing research achievements are insufficient for solving these two problems simultaneously during data classification. This study proposed a two-stage semi-supervised trajectory data classification algorithm. By pre-training the autoencoder and combining it with the Siamese network, a two-stage joint training was formed, which enabled the model to deal with missing categories by clustering and maintaining the classification ability under the missing label categories. The experimental simulation results showed that the performance of this algorithm was better than the classical semi-supervised algorithm label propagation and transferred learning when the amount of various labeled data was as low as 1–5. The two-stage training model also had a good effect on the problem of missing categories. When 75% of the types were missing, the purity could still reach 82%, which was about eight percentage points higher than the directly trained network. When two problems appeared simultaneously, compared with the directly trained network, the performance improved by about three percentage points on average, and the purity was consistently higher than the clustering results. In summary, this algorithm was more tolerant of the problems of labeled datasets, so it was more practical.
ISSN:2076-3417