Advanced Persistent Threats (APTs) pose a major cyber threat due to their stealthy, long-term nature and intricate complexity, making them particularly challenging to detect. Provenance graphs map interactions between system entities as directed, heterogeneous networks, offering rich semantic information valuable for threat identification. However, most existing approaches rely on static graph analysis, overlooking the critical dynamics of evolving threats. Dynamic graph analysis offers the potential to capture both temporal and structural insights, but current methods focus on single-perspective learning, failing to fully exploit the inherent relationships and evolving patterns within the data. To address this gap, we propose CGL-AD, a Contextualized Graph Learning APT Detector. CGL-AD leverages temporal graph learning to capture subtle temporal changes and overall structural transformations over time. It then integrates hash-based data stream frequency estimation techniques to identify local topological alterations, subsequently feeding these rich embeddings into a powerful sequence learning model. Experimental results on three widely used datasets: Streamspot, Camflow-apt and Shellshock datasets demonstrate that CGL-AD significantly outperforms existing methods. Specifically, CGL-AD outperforms the best baseline FLASH on these datasets by 1.5%, 3.7%, and 5.6% respectively in terms of Receiver Operating Characteristic -Area Under the Curve (ROC-AUC), effectively revealing hidden APT attack patterns in dynamic provenance graphs.
Read full abstract