Abstract

Classification and gradation system adopts different security protection schemes for different types of data by implementing classification and gradation management of data, which is an important pretechnical means for data security protection and prevention of data leakage. This paper introduces artificial intelligence classification, machine learning, and other means to learn and train enterprise documents according to the characteristics of enterprise sensitive data. The generated training model can intelligently identify and classify file streams, improving work efficiency and accuracy of classification and gradation. At the same time, the differences, advantages, and disadvantages of K-NN (K-Nearest Neighbors), DT (Decision Tree), and LinearSVC algorithms are compared. The experimental data shows that LinearSVC algorithm is applicable to high-dimensional data, with discrete, sparse data features and large number of features, which is more suitable for classification of sensitive data of enterprises.

Highlights

  • With the advent of Internet plus era, the status of data as a basic strategic resource has become increasingly prominent [1,2,3]

  • According to the international standards of information security [8], the importance of different data is different, and high-value data requires stricter protection mechanisms. erefore, data is used as the security protection target, and the intricate enterprise data assets are divided into various categories and multiple levels according to the classification and gradation method

  • If in a specific type of test set document the term frequency (TF)-IDF value of the word ti is high but that in other categories is very low or even 0, indicating that the word is of greater importance to this type of document and has a strong ability to classify this type of document, it can be e original text

Read more

Summary

Introduction

With the advent of Internet plus era, the status of data as a basic strategic resource has become increasingly prominent [1,2,3]. Classification and Gradation System Architecture and Deployment e classification and gradation management of enterprise information can enable various sensitive data information of the enterprise to be grasped in a timely, efficient, and accurate manner [19] It is an important pretechnical means for large-scale enterprises to protect data security and prevent data leakage. E DLP system continuously captures and analyzes the traffic on the network by placing a monitor at the outlet of the enterprise’s external network and detects sensitive data and important traffic elements [20] through protocols such as SMTP, FTP, and HTTP to prevent the transfer of sensitive data to the outside The DLP system interacts with the classification and gradation system to compare the fingerprint of the outgoing file with the fingerprint database of the classification and gradation system to predict which category the file belongs to

Materials and Methods
Results and Discussion
Application Verification and Summary
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call