Accelerate Literature Icon
Want to do a literature review? Try our new Literature Review workflow

Beyond data dependency: FedPET enables robust federated learning via data-free dual-teacher knowledge distillation

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

Beyond data dependency: FedPET enables robust federated learning via data-free dual-teacher knowledge distillation

Similar Papers
  • Research Article
  • Cite Count Icon 7
  • 10.1016/j.infsof.2024.107510
SeDPGK: Semi-supervised software defect prediction with graph representation learning and knowledge distillation
  • Jun 21, 2024
  • Information and Software Technology
  • Wangshu Liu + 6 more

SeDPGK: Semi-supervised software defect prediction with graph representation learning and knowledge distillation

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 3
  • 10.3390/app14114805
A Lightweight Method for Graph Neural Networks Based on Knowledge Distillation and Graph Contrastive Learning
  • Jun 2, 2024
  • Applied Sciences
  • Yong Wang + 1 more

Graph neural networks (GNNs) are crucial tools for processing non-Euclidean data. However, due to scalability issues caused by the dependency and topology of graph data, deploying GNNs in practical applications is challenging. Some methods aim to address this issue by transferring GNN knowledge to MLPs through knowledge distillation. However, distilled MLPs cannot directly capture graph structure information and rely only on node features, resulting in poor performance and sensitivity to noise. To solve this problem, we propose a lightweight optimization method for GNNs that combines graph contrastive learning and variable-temperature knowledge distillation. First, we use graph contrastive learning to capture graph structural representations, enriching the input information for the MLP. Then, we transfer GNN knowledge to the MLP using variable temperature knowledge distillation. Additionally, we enhance both node content and structural features before inputting them into the MLP, thus improving its performance and stability. Extensive experiments on seven datasets show that the proposed KDGCL model outperforms baseline models in both transductive and inductive settings; in particular, the KDGCL model achieves an average improvement of 1.63% in transductive settings and 0.8% in inductive settings when compared to baseline models. Furthermore, KDGCL maintains parameter efficiency and inference speed, making it competitive in terms of performance.

  • Research Article
  • Cite Count Icon 25
  • 10.1016/j.inffus.2024.102748
Vul-LMGNNs: Fusing language models and online-distilled graph neural networks for code vulnerability detection
  • Oct 21, 2024
  • Information Fusion
  • Ruitong Liu + 6 more

Vul-LMGNNs: Fusing language models and online-distilled graph neural networks for code vulnerability detection

  • Research Article
  • Cite Count Icon 27
  • 10.1016/j.neucom.2024.128761
A survey of graph neural networks and their industrial applications
  • Oct 28, 2024
  • Neurocomputing
  • Haoran Lu + 4 more

A survey of graph neural networks and their industrial applications

  • Research Article
  • Cite Count Icon 11
  • 10.1109/jsen.2025.3543588
Real-Time Diagnosis of Abrupt and Incipient Faults in IMU Using a Lightweight CNN-Transformer Hybrid Model
  • Apr 1, 2025
  • IEEE Sensors Journal
  • Jia Song + 2 more

The fault diagnosis is crucial for improving the reliability and safety of industrial sensors. Diagnosing faults in inertial measurement units (IMUs) is particularly challenging due to the complex nature of abrupt and incipient faults, which require the accurate and rapid diagnosis. This article presents a hybrid model that combines convolutional neural networks (CNNs) and Transformer encoder architectures. The CNN component effectively extracts local fault features, while the Transformer encoder captures long-range dependencies in time-series data, enabling the precise and rapid IMU fault diagnosis. To meet the autonomous and real-time operational demands of IMU fault diagnosis, the knowledge distillation is applied to develop a lightweight version of the model. This optimization facilitates efficient deployment on resource-limited hardware, maintaining the original model’s accuracy and rapid processing speed. The effectiveness of the proposed approach is validated through comprehensive comparisons with other models, demonstrating the superior diagnostic accuracy, low fault diagnosis delay, and suitability for real-time applications.

  • Research Article
  • Cite Count Icon 36
  • 10.1145/3711121
Knowledge Distillation on Graphs: A Survey
  • Mar 5, 2025
  • ACM Computing Surveys
  • Yijun Tian + 4 more

Graph Neural Networks (GNNs) have received significant attention for demonstrating their capability to handle graph data. However, they are difficult to be deployed in resource-limited devices because of model sizes and scalability constraints imposed by the multi-hop data dependency. In addition, real-world graphs usually possess complex structural information and features. Therefore, to improve the applicability of GNNs and fully encode the complicated topological information, Knowledge Distillation on Graphs (KDG) has been introduced to build a smaller but effective model, leading to model compression and performance improvement. Recently, KDG has achieved considerable progress, with many studies proposed. In this survey, we systematically review these works. Specifically, we first introduce the challenges and bases of KDG, then categorize and summarize the existing work of KDG by answering the following three questions: (1) what to distillate, (2) who to whom, and (3) how to distillate. We offer in-depth comparisons and elucidate the strengths and weaknesses of each design. Finally, we share our thoughts on future research directions.

  • Conference Article
  • Cite Count Icon 19
  • 10.1109/ipdps53621.2022.00111
Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA
  • May 1, 2022
  • Hongkuan Zhou + 4 more

Temporal Graph Neural Networks (TGNNs) are powerful models to capture temporal, structural, and contextual information on temporal graphs. The generated temporal node embeddings outperform other methods in many downstream tasks. Real-world applications require high performance inference on real-time streaming dynamic graphs. However, these models usually rely on complex attention mechanisms to capture relationships between temporal neighbors. In addition, maintaining vertex memory suffers from intrinsic temporal data dependency that hinders task-level parallelism, making it inefficient on general-purpose processors. In this work, we present a novel model-architecture co-design for inference in memory-based TGNNs on FPGAs. The key modeling optimizations we propose include a light-weight method to compute attention scores and a related temporal neighbor pruning strategy to further reduce computation and memory accesses. These are holistically coupled with key hardware optimizations that leverage FPGA hardware. We replace the temporal sampler with an on-chip FIFO based hardware sampler and the time encoder with a look-up-table. We train our simplified models using knowledge distillation to ensure similar accuracy vis-á-vis the original model. Taking advantage of the model optimizations, we propose a principled hardware architecture using batching, pipelining, and prefetching techniques to further improve the performance. We also propose a hardware mechanism to ensure the chronological vertex updating without sacrificing the computation parallelism. We evaluate the performance of the proposed hardware accelerator on three real-world datasets. The proposed model reduces the computation complexity by 84% and memory accesses by 67% with less than 0.33% accuracy loss. Compared with CPU/GPU, our FPGA accelerator achieves 16.4/2.3× speedup in latency and 0.27% improvement in accuracy compared with the state-of-the-art inference algorithm. To the best of our knowledge, this is the first work that performs model-architecture co-design on memory-based Temporal Graph Neural Networks.

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant