Feature Selection Using Submodular Approach forFinancial Big Data

Girija Attigeri ,M M Manohara Pai ,Radhika M Pai

doi:10.3745/jips.04.0149

Abstract

As the world is moving towards digitization, data is generated from various sources at a faster rate. It is getting humungous and is termed as big data. The financial sector is one domain which needs to leverage the big data being generated to identify financial risks, fraudulent activities, and so on. The design of predictive models for such financial big data is imperative for maintaining the health of the country’s economics. Financial data has many features such as transaction history, repayment data, purchase data, investment data, and so on. The main problem in predictive algorithm is finding the right subset of representative features from which the predictive model can be constructed for a particular task. This paper proposes a correlation-based method using submodular optimization for selecting the optimum number of features and thereby, reducing the dimensions of the data for faster and better prediction. The important proposition is that the optimal feature subset should contain features having high correlation with the class label, but should not correlate with each other in the subset. Experiments are conducted to understand the effect of the various subsets on different classification algorithms for loan data. The IBM Bluemix Big Data platform is used for experimentation along with the Spark notebook. The results indicate that the proposed approach achieves considerable accuracy with optimal subsets in significantly less execution time. The algorithm is also compared with the existing feature selection and extraction algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature Selection Using Submodular Approach forFinancial Big Data

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing Systems

Lead the way for us

Similar Papers

Data Security Issues and Countermeasure Suggestions for Financial Big data: A Literature Review
Ningbo Chen
Advances in Economics, Management and Political Sciences | VOL. 41
Ningbo ChenNingbo Chen
10 Nov 2023
Advances in Economics, Management and Political Sciences | VOL. 41

Research on Optimization of Enterprise Financial Management System Based on Big Data Hadoop
Yue Chang ... Juan Wang
-
Yue Chang, et. al.Yue Chang ... Juan Wang
11 Dec 2022
11 Dec 2022

Corporate Credit Risk Rating Model Based on Financial Big Data
Mingzhi Tang ... Runzhou Zhao
BCP Business & Management | VOL. 48
Mingzhi Tang, et. al.Mingzhi Tang ... Runzhou Zhao
24 Jul 2023
BCP Business & Management | VOL. 48

Optimization of Quantitative Investment Strategies in the Financial Big Data Environment
Jinhong Wang
Frontiers in Business, Economics and Management | VOL. 12
Jinhong WangJinhong Wang
06 Dec 2023
Frontiers in Business, Economics and Management | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Selection Using Submodular Approach forFinancial Big Data

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing Systems