Basic Classification Algorithm Research Articles

Regardless of young or old, people have quickly stepped into the world of internet with today's communication technologies such as phones, tablets, computers and smart devices. As the place of the Internet in people's lives increases, social media platforms are diversifying and users want to take part in these platforms. With the increase in the number of social media users, some negativities are encountered. The most important problem encountered in social media platforms is cyber bullying. Although cyber bullying seems to be a daily dialogue between social media users or between groups, the situation of encountering is increasing day by day with the diversity of shared information, content and agenda social media environments. With the development of technology, it is necessary to develop a platform that detects bullying with artificial intelligence technologies. One of the biggest difficulties in text classification problems that we encounter during the development of these platforms is the need to train the artificial intelligence algorithm to be used with labeled data. In this study, 21 different people, including journalists, athletes, scientists, doctors, politicians, comedians, social media phenomena, and artists who actively use social media, were selected in order to create the necessary dataset for training the models to be developed to detect cyber bullying situations. The public messages (mentions) of these 21 people sent via Twitter were compiled. After filtering the repetitive and meaningless messages sent by bot accounts out of 10500 tweets compiled, the number of messages in the dataset decreased to 7706. The labeling process, which is necessary for the dataset to be used for training and testing purposes in classification processes, was carried out by three independent people who were given preliminary information about cyberbullying (1=Includes Cyber bullying, 0=Does not include Cyber bullying). The majority of the tags, which were read and assigned by 3 different people, were accepted as the final class of the relevant message. Afterwards, the dataset was preprocessed in accordance with the principles of natural language processing and made suitable for classification algorithms. The findings obtained after the classification processes performed with the basic classification algorithms are shared. When the findings are examined, it is understood that the data set created has the competence to be used in the detection and prevention of cyber bullying. In this context, it is predicted that training specially developed and optimized artificial intelligence algorithms with the relevant dataset for the detection of cyberbullying will greatly increase the success rate.

The scientific work highlights the problem of increasing the accuracy of binary classification predictions using machine learning algorithms. Over the past few decades, systems that consist of many machine learning algorithms, also called ensemble models, have received increasing attention in the computational intelligence and machine learning community. This attention is well deserved, as ensemble systems have proven to be very effective and extremely versatile in a wide range of problem domains and real-world applications. One algorithm may not make a perfect prediction for a particular data set. Machine learning algorithms have their limitations, so creating a model with high accuracy is a difficult task. If you create and combine several models by combining and aggregating the results of each model, there is a chance to improve the overall accuracy, this problem is dealt with by ensembling. The basis of the information system of binary classification is the ensemble model. This model, in turn, contains a set of unique combinations of basic classifiers – a kind of algorithmic primitives. An ensemble model can be considered as some kind of meta-algorithm, which consists of unique sets of machine learning (ML) classification algorithms. The task of the ensemble model is to find such a combination of basic classification algorithms that would give the highest performance. The performance is evaluated according to the main ML metrics in classification tasks. Another aspect of scientific work is the creation of an aggregation mechanism for combining the results of basic classification algorithms. That is, each unique combination within the ensemble consists of a set of basic models (harbingers), the results of which must be aggregated. In this work, a non-hierarchical clustering method is used to aggregate (average) the predictions of the base models. A feature of this study is to find the correlation coefficients of the base models in each combination. With the help of the magnitude of correlations, the relationship between the prediction of the classifier (base model) and the true value is established, as a result of which space is opened for further research on improving the ensemble model (meta-algorithm)

Basic Classification Algorithm Research Articles

Related Topics

Articles published on Basic Classification Algorithm

Evolutionary simultaneous under and oversampling of instances for dealing with class-imbalance datasets in multilabel problems

Glove-Based Hand Gesture Recognition for Diver Communication.

Creating a New Dataset for the Classification of Cyber Bullying

METHOD OF BUILDING ENSEMBLES OF MODELS FOR DATA CLASSIFICATION BASED ON DECISION CORRELATIONS

Hybrid Graph Neural Network Model Design and Modeling Reasoning for Text Feature Extraction and Recognition

A New Model for Emotions Analysis in Social Network Text Using Ensemble Learning and Deep learning

A novel deep learning model based on convolutional neural networks for employee churn prediction

Bank Deposit Prediction Using Ensemble Learning

The software complex for biochemical indicators monitoring taking into account ecological background of the region

Predicting the revocation of a bank license using machine learning algorithms

IMPT-FRAKEL: A Simple Multi-label Web-server that Only Uses Fingerprints to Identify which Metabolic Pathway Types Compounds can Participate In

Hybrid classification algorithms based on instance filtering

Investigating effects of force and pressure centre signals on stabilogram analysis

IATC-NRAKEL: an efficient multi-label classifier for recognizing anatomical therapeutic chemical classes of drugs.

The Use of Artificial Neural Networks Optimized with Fire Fly Algorithm in Cancer Diagnosis

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Increasing Face Recognition Rates Using Novel Classification Algorithms

Improved classification with allocation method and multiple classifiers

Hierarchical Categorization of Open Source Software by Online Profiles

LVQ-SMOTE – Learning Vector Quantization based Synthetic Minority Over–sampling Technique for biomedical data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Basic Classification Algorithm Research Articles

Related Topics

Articles published on Basic Classification Algorithm

Evolutionary simultaneous under and oversampling of instances for dealing with class-imbalance datasets in multilabel problems

Glove-Based Hand Gesture Recognition for Diver Communication.

Creating a New Dataset for the Classification of Cyber Bullying

METHOD OF BUILDING ENSEMBLES OF MODELS FOR DATA CLASSIFICATION BASED ON DECISION CORRELATIONS

Hybrid Graph Neural Network Model Design and Modeling Reasoning for Text Feature Extraction and Recognition

A New Model for Emotions Analysis in Social Network Text Using Ensemble Learning and Deep learning

A novel deep learning model based on convolutional neural networks for employee churn prediction

Bank Deposit Prediction Using Ensemble Learning

The software complex for biochemical indicators monitoring taking into account ecological background of the region

Predicting the revocation of a bank license using machine learning algorithms

IMPT-FRAKEL: A Simple Multi-label Web-server that Only Uses Fingerprints to Identify which Metabolic Pathway Types Compounds can Participate In

Hybrid classification algorithms based on instance filtering

Investigating effects of force and pressure centre signals on stabilogram analysis

IATC-NRAKEL: an efficient multi-label classifier for recognizing anatomical therapeutic chemical classes of drugs.

The Use of Artificial Neural Networks Optimized with Fire Fly Algorithm in Cancer Diagnosis

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Increasing Face Recognition Rates Using Novel Classification Algorithms

Improved classification with allocation method and multiple classifiers

Hierarchical Categorization of Open Source Software by Online Profiles

LVQ-SMOTE – Learning Vector Quantization based Synthetic Minority Over–sampling Technique for biomedical data