Cluster analysis is a common application of maritime traffic pattern recognition. However, most current methods are unsupervised clustering algorithms that rely solely on the features of the trajectories themselves for clustering, making them susceptible to uneven density distributions. To address this issue, we propose a semisupervised clustering algorithm that utilizes geospatial data to segment trajectories and generate clustering constraints. First, shoreline and anchorage information are employed for trajectory segmentation to enhance the consistency of local trajectory features. Second, cannot-link constraints derived from shoreline data are used to modify the distance between two trajectories, thereby preventing the clustering of trajectories separated by nonnavigable waters into a single cluster. Last, experimental verification is conducted using automatic identification system data, and the results are compared with those of existing algorithms. The results demonstrate more concentrated trajectory segmentation points, easier selection of clustering parameters, and significantly improved evaluation outcomes based on external clustering validity indices.
Read full abstract