Sort by
Integration and Recommendation System of Profiles based on Professional Social Networks

The aim of our investigation is to personalize bilateral recommendation of job-related proposals based on existing professional social networks. In a context where the points of view of job seekers and employers can be contradictory, our approach consists in trying to bring the both in a best possible matching. To this end, we propose an integration system that gives a minimum of credit to the users’ data in order to facilitate the discovery of relevant proposals based on the users’ behaviors, on the characteristics of the proposals and on possible relationships. The main contribution is the proposal of an architecture for the recommendation of profiles and job offers including social and administrative factors. The particularity of our approach lies in the freedom from the recommendation problem by using metrics proven in the literature for the estimation of similarity rates. We have used these metrics as default values to appropriate data dimensions. It emerges that, the user’s behavior is exclusively responsible for the recommendations. However, the cross-analysis of randomly generated behaviors on real profiles collected on Cameroonian sites dedicated to job offers, shows the influence of the most active users. But, for requests via the search bar (interface with the script respecting the path of our architecture) the central subject remains the user. Our current work is limited by a data set that is not very representative of changing socio-economic conditions.

Open Access
Relevant
Kriging interpolation model: The problem of predicting the number of deaths due to COVID-19 over time in Vietnam

The COVID-19 pandemic can be considered a human disaster, it has claimed the lives of many people. We only know the number of deaths due to COVID-19 through government statistics, but on days when there are no statistics, how do we know whether people died that day or not? This study aims to predict the number of new deaths per day due to COVID 19 in Vietnam on days when observational data is not available and predict the number of deaths in the future. The study used COVID-19 data from the World Health Organization (WHO). A total of 260 days were collected and the author processed and standardized the data. Based on available data, the author uses Kriging interpolation statistical method to build a forecast model. As a result, the author has selected a prediction model suitable for a highly reliable data set, the regression coefficient and correlation coefficient are close to 1, the error between the model’s prediction results compared to data. There are days when the prediction error is almost zero. The study has built a future forecast map of the number of new deaths per day due to COVID-19. The article concludes that applying the Kriging statistical methodis appropriate for COVID-19 data. This research opens up new research directions for related fields such as earthquakes, mining, groundwater, environment, etc.

Open Access
Relevant
The data preprocessing in improving the classification quality of network intrusion detection systems

Stream-based intrusion detection is a growing problem in computer network security environments. Many previous researches have applied machine learning as a method to detect attacks in network intrusion detection systems. However, these methods still have limitations of low accuracy and high false alarm rate. To improve the quality of classification, this paper proposes two solutions in the data preprocessing stage, that is, the solution of feature selection and resampling of the training dataset before they are used for training the classifiers. This is based on the fact that there is a lot of class imbalanced data in the training dataset used for network intrusion detection systems, as well as that there are many features in the dataset that are irrelevant to the classification goal, this reduces the quality of classification and increases the computation time. The data after preprocessing by the proposed algorithms is used to train the classifiers using different machine learning algorithms including: Decision Trees, Naive Bayes, Logistic Regression, Support Vector Machines, k Nearest Neighbor and Artificial Neural Network. The training and testing results on the UNSW-NB15 dataset show that: as with the Reconnaissance attack type, the proposed feature selection solution for F-Measure achieves 96.31%, an increase of 19.64%; the proposed oversampling solution for F-Measure achieves 6.99%, an increase of 3.17% and the proposed undersampling solution for F-Measure achieves 94.65%, an increase of 11.42%.

Open Access
Relevant