Summary: | The distribution of passengers reflects the characteristics of urban rail stations. The automatic fare collection system of rail transit collects a large amount of passenger trajectory data tracking the entry and exit continuously, which provides a basis for detailed passenger distributions. We first exploit the Automatic Fare Collection (AFC) data to construct the passenger visit pattern distribution for stations. Then we measure the similarity of all stations using Wasserstein distance. Different from other similarity metrics, Wasserstein distance takes the similarity between values of quantitative variables in the one-dimensional distribution into consideration and can reflect the correlation between different dimensions of high-dimensional data. Even though the computational complexity grows, it is applicable in the metro stations since the scale of urban rail transit stations is limited to tens to hundreds and detailed modeling of the stations can be performed offline. Therefore, this paper proposes an integrated method that can cluster multi-dimensional joint distribution considering similarity and correlation. Then this method is applied to cluster the rail transit stations by the passenger visit distribution, which provides some valuable insight into the flow management and the station replanning of urban rail transit in the future.
|