The identification and categorization of circulating tumor cells (CTCs) in peripheral blood are imperative for advancing cancer diagnostics and prognostics. The intricacy of various CTCs subtypes, coupled with the difficulty in developing exhaustive datasets, has impeded progress in this specialized domain. To date, no methods have been dedicated exclusively to overcoming the classification challenges of CTCs. To address this deficit, we have developed CTCDet, a large-scale dataset meticulously annotated based on the distinctive pathological characteristics of CTCs, aimed at advancing the application of deep learning techniques in oncological research. Furthermore, we introduce CTCNet, an innovative hybrid architecture that merges the capabilities of CNNs and Transformers to achieve precise classification of CTCs. This architecture features the Parallel Token mixer, which integrates local window self-attention with large-kernel depthwise convolution, enhancing the network's ability to model intricate channel and spatial relationships. Additionally, the Deformable Large Kernel Attention (DLKAttention) module leverages deformable convolution and large-kernel operations to adeptly delineate the nuanced features of CTCs, substantially boosting classification efficacy. Comprehensive evaluations on the CTCDet dataset validate the superior performance of CTCNet, confirming its ability to outperform other general methods in accurate cell classification. Moreover, the generalizability of CTCNet has been established across various datasets, establishing its robustness and applicability. What is more, our proposed method can lead to clinical applications and provide some help in assisting cancer diagnosis and treatment. Code and Data are available at https://github.com/JasonWu404/CTCs_Classification .
Read full abstract