Source Datasets Research Articles

Network slicing is considered as a key enabler for 5G and beyond mobile networks for supporting a variety of new services, including enhanced mobile broadband, ultra-reliable and low-latency communication, and massive connectivity, on the same physical infrastructure. However, this technology increases the susceptibility of networks to cyber threats, particularly Distributed Denial-of-Service (DDoS) attacks. These attacks have the potential to cause service quality degradation by overloading network function(s) that are central to network slices to operate seamlessly. This calls for an Intrusion Detection System (IDS) as a shield against a wide array of DDoS attacks. In this regard, one promising solution would be the use of Deep Learning (DL) models for detecting possible DDoS attacks, an approach that has already made its way into the field given its manifest effectiveness. However, one particular challenge with DL models is that they require large volumes of labeled data for efficient training, which are not readily available in operational networks. A possible workaround is to resort to Transfer Learning (TL) approaches that can utilize the knowledge learned from prior training to a target domain with limited labeled data. This paper investigates how Deep Transfer Learning (DTL) based approaches can improve the detection of DDoS attacks in 5G networks by leveraging DL models, such as Bidirectional Long Short-Term Memory (BiLSTM), Convolutional Neural Network (CNN), Residual Network (ResNet), and Inception as base models. A comprehensive dataset generated in our 5G network slicing testbed serves as the source dataset for DTL, which includes both benign and different types of DDoS attack traffic. After learning features, patterns, and representations from the source dataset using initial training, we fine-tune base models using a variety of TL processes on a target DDoS attack dataset. The 5G-NIDD dataset, which has a sparse amount of annotated traffic pertaining to several DDoS attack generated in a real 5G network, is chosen as the target dataset. The results show that the proposed DTL models have performance improvements in detecting different types of DDoS attacks in 5G-NIDD dataset compared to the case when no TL is applied. According to the results, the BiLSTM and Inception models being identified as the top-performing models. BiLSTM indicates an improvement of 13.90%, 21.48%, and 12.22% in terms of accuracy, recall, and F1-score, respectively, whereas, Inception demonstrates an enhancement of 10.09% in terms of precision, compared to the models that do not adopt TL.

Read full abstract

Scientific names in biodiversity represent one of the oldest identifiers used in science. As a result, a common repetitive task is being able to reconcile a list of scientific names against curated data sources. Reconciliation allows one to determine if names in a list are spelled correctly, whether they are currently accepted, and their nomenclatural status. There are several online and local resources that provide reconciliation services. We share here the potential in interoperability across reconciliation tools. Global Names Verifier (GNverifier), Catalogue of Life, Global Biodiversity Information Facility (GBIF), Taxonomic Name Resolution Service (TNRS), LifeWatch, National Center for Biotechnology Information (NCBI), World Flora Online, Global Biotic Interactions (GloBI), Nomer, Wikidata, and others provide their own tools for name reconciliation. All these tools have their scope, design decisions, input, and output formats. It is often useful to do reconciliation using several such services, because they often include complementary data. However, with all the idiosyncrasies of services and lack of standardization, it is not an easy task (Islam et al. 2024). It would be great for researchers if all existing and future tools could be standardized. Then moving from one resource to another would be as easy as changing the URL. Implementing elements of Findable, Accessible, Interoperable, and Reusable (FAIR) data management principles would help to create such standards. However, standardizing all existing and future resources to a common interface would be difficult. Some of them have no monetary or programmatic means to modify their code, while others have more urgent priorities. Some resources support a specific research path where adhering to a rigid standard might hinder their innovation. In this paper we suggest interoperability between reconciliation tools by implementing the OpenRefine Reconciliation Service. OpenRefine is a popular and powerful reconciliation and data cleaning application. It is used by many researchers for data transformation and normalization. Any service that implements the OpenRefine Service can be incorporated into data-management workflows just by providing the service's OpenRefine-compatible URL. Such compatible services can easily be discovered by providing their metadata in the OpenRefine Services Registry. In this paper we discuss our implementation of the OpenRefine Service with the Global Names Verifier (GNverifier) reconciliation tool. GNverifier is developed at the Species File Group as a part of the Global Names Architecture initiative. It offers a powerful, configurable, fast way to reconcile scientific names. GNverifier software aggregates data from more than 100 source datasets. Queries return currently accepted names when provided in a dataset. It allows finding matches for names that historically had several suffixes and can do fuzzy and partial matches. It sorts data by many factors to reliably provide the best available results. With a strong focus on software optimization and a sophisticated matching algorithm, it can process 2000 names a second, making it one of the fastest services available. OpenRefine can use GNverifier directly because it is compatible with the OpenRefine protocol. As shown in Fig. 1, switching between GNverifier and Wikidata reconciliation of scientific names requires only change of a service URL. Implementation of the OpenRefine protocol might solve many standardization problems. Some resources already have it implemented (e.g., Wikidata, GNverifier Whitmire and Mozzherin 2023, WFO Plant List, IPNI). Many people already use OpenRefine for their other reconciliation needs. For them, the incorporation of name reconciliation would be especially beneficial because it will fit into their existing data-management workflow Fig. 2. Basic reconciliation by itself is standard by design and a big step forward. Beyond the basic reconciliation (as seen in Fig. 2) there are more data that researchers are interested in. The Service Protocol allows one to add optional "extended" features. For example, for scientific names, we provide "currently accepted" names, data sources where a name was found, taxonomic classification, etc. Fig. 3. To make these fields standardized, we would need a recommendation document that describes additional fields and their format. We need interested parties to participate in its creation and agree on its usage. The same would apply to optional input filters, for example for restricting reconciliation to certain data sources or higher taxonomic entities. We think OpenRefine would be a significant step forward for standardization between name-reconciliation tools.

Read full abstract

Source Datasets Research Articles

Related Topics

Articles published on Source Datasets

An Adaptive Transfer Learning Framework for Functional Classification

Cycle-Consistent Adversarial chest X-rays Domain Adaptation for pneumonia diagnosis

Domain adaptation hyperspectral image fusion based on spatial-spectral domain separation

Temporal Protein Complex Identification Based on Dynamic Heterogeneous Protein Information Network Representation Learning.

Attention Retinex Network(A4R-Net) for face detection under low-light environment

DTL-5G: Deep transfer learning-based DDoS attack detection in 5G and beyond networks

Can We Standardize Name Reconciliaton via OpenRefine?

Advancing mRNA subcellular localization prediction with graph neural network and RNA structure.

MetalTrans: A Biological Language Model-Based Approach for Predicting Disease-Associated Mutations in Protein Metal-Binding Sites.

Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition.

Prediction of Super-enhancers Based on Mean-shift Undersampling

AI4LUC: deep learning and automated mask labelling to support land use and land cover mapping in the Cerrado biome

MIGP: Metapath Integrated Graph Prompt Neural Network

Image-level supervision and self-training for transformer-based cross-modality tumor segmentation

Accuracy and transportability of machine learning models for adolescent suicide prediction with longitudinal clinical records

Adaptive centroid prototype-based domain adaptation for fault diagnosis of rotating machinery without source data

Industrial Battery State-of-Health Estimation with Incomplete Limited Data Towards Second-Life Applications

CSP Data: A Data Discovery Web Application of Commercial CSP Plants

Distilling consistent relations for multi-source domain adaptive person re-identification

Transfer-Learning Prediction Model for Low-Cycle Fatigue Life of Bimetallic Steel Bars

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Source Datasets Research Articles

Related Topics

Articles published on Source Datasets

An Adaptive Transfer Learning Framework for Functional Classification

Cycle-Consistent Adversarial chest X-rays Domain Adaptation for pneumonia diagnosis

Domain adaptation hyperspectral image fusion based on spatial-spectral domain separation

Temporal Protein Complex Identification Based on Dynamic Heterogeneous Protein Information Network Representation Learning.

Attention Retinex Network(A4R-Net) for face detection under low-light environment

DTL-5G: Deep transfer learning-based DDoS attack detection in 5G and beyond networks

Can We Standardize Name Reconciliaton via OpenRefine?

Advancing mRNA subcellular localization prediction with graph neural network and RNA structure.

MetalTrans: A Biological Language Model-Based Approach for Predicting Disease-Associated Mutations in Protein Metal-Binding Sites.

Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition.

Prediction of Super-enhancers Based on Mean-shift Undersampling

AI4LUC: deep learning and automated mask labelling to support land use and land cover mapping in the Cerrado biome

MIGP: Metapath Integrated Graph Prompt Neural Network

Image-level supervision and self-training for transformer-based cross-modality tumor segmentation

Accuracy and transportability of machine learning models for adolescent suicide prediction with longitudinal clinical records

Adaptive centroid prototype-based domain adaptation for fault diagnosis of rotating machinery without source data

Industrial Battery State-of-Health Estimation with Incomplete Limited Data Towards Second-Life Applications

CSP Data: A Data Discovery Web Application of Commercial CSP Plants

Distilling consistent relations for multi-source domain adaptive person re-identification

Transfer-Learning Prediction Model for Low-Cycle Fatigue Life of Bimetallic Steel Bars