Mining Problem Research Articles

Role mining is a technique that is used to derive a role-based authorization policy from an existing policy. Given a set of users U , a set of permissions P and a user-permission authorization relation UPA ⊆ U × P , a role mining algorithm seeks to compute a set of roles R , a user-role authorization relation UA ⊆ U × R and a permission-role authorization relation PA ⊆ R × P , such that the composition of UA and PA is close (in some appropriate sense) to UPA . Role mining is therefore a core problem in the specification of role-based authorization policies. Role mining is known to be hard in general and exact solutions are often impossible to obtain, so there exists an extensive literature on variants of the role mining problem that seek to find approximate solutions and algorithms that use heuristics to find reasonable solutions efficiently. In this paper, we first introduce the Generalized Noise Role Mining problem (GNRM) – a generalization of the MinNoise Role Mining problem – which we believe has considerable practical relevance. In particular, GNRM can produce “security-aware” or “availability-aware” solutions. Extending work of Fomin et al., we show that GNRM is fixed parameter tractable, with parameter r + k , where r is the number of roles in the solution and k is the number of discrepancies between UPA and the relation defined by the composition of UA and PA . We further introduce a bi-objective optimization variant of GNRM, where we wish to minimize both r and k subject to upper bounds $r \le \bar{r} $ and $k\le \bar{k} $ , where $\bar{r} $ and $\bar{k} $ are constants. We show that the Pareto front of this bi-objective optimization problem (BO-GNRM) can be computed in fixed-parameter tractable time with parameter $\bar{r} +\bar{k} $ . From a practical perspective, a solution to BO-GNRM gives security managers the opportunity to identify a mined policy offering the best trade-off between the number of policy discrepancies and the number of roles. We then report the results of our experimental work using the integer programming solver Gurobi to solve instances of BO-GNRM. Our key findings are that (a) we obtained strong support that Gurobi’s performance is fixed-parameter tractable, (b) our results suggest that our techniques may be useful for role mining in practice, based on our experiments in the context of three well-known real-world authorization policies. We observed that, in many cases, our solver is capable of obtaining optimal solutions when the values of either k or r are small.

Read full abstract

Finding dense subgraphs is a core problem in graph mining with many applications in diverse domains. At the same time many real-world networks vary over time, that is, the dataset can be represented as a sequence of graph snapshots. Hence, it is natural to consider the question of finding dense subgraphs in a temporal network that are allowed to vary over time to a certain degree. In this paper, we search for dense subgraphs that have large pairwise Jaccard similarity coefficients. More formally, given a set of graph snapshots and input parameter α\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\alpha$$\\end{document}, we find a collection of dense subgraphs, with pairwise Jaccard index at least α\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\alpha$$\\end{document}, such that the sum of densities of the induced subgraphs is maximized. We prove that this problem is NP-hard and we present a greedy, iterative algorithm which runs in Onk2+m\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$${\\mathcal {O}} \\mathopen {} \\left( nk^2 + m\\right)$$\\end{document} time per single iteration, where k is the length of the graph sequence and n and m denote number of vertices and total number of edges respectively. We also consider an alternative problem where subgraphs with large pairwise Jaccard indices are rewarded. We do this by incorporating the indices directly into the objective function. More formally, given a set of graph snapshots and a weight λ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\lambda$$\\end{document}, we find a collection of dense subgraphs such that the sum of densities of the induced subgraphs plus the sum of Jaccard indices, weighted by λ\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\lambda$$\\end{document}, is maximized. We prove that this problem is NP-hard. To discover dense subgraphs with good objective value, we present an iterative algorithm which runs in On2k2+mlogn+k3n\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$${\\mathcal {O}} \\mathopen {}\\left( n^2k^2 + m \\log n + k^3 n\\right)$$\\end{document} time per single iteration, and a greedy algorithm which runs in On2k2+mlogn+k3n\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$${\\mathcal {O}} \\mathopen {}\\left( n^2k^2 + m \\log n + k^3 n\\right)$$\\end{document} time. We show experimentally that our algorithms are efficient, they can find ground truth in synthetic datasets and provide good results from real-world datasets. Finally, we present two case studies that show the usefulness of our problem.

Read full abstract

Mining Problem Research Articles

Related Topics

Articles published on Mining Problem

Optimization of frequent item set mining parallelization algorithm based on spark platform

A Nonaxial-Type Swirling Cavitating Nozzle for Exploiting Natural Gas Hydrate

Oligarchy of Power in The Management of C-Mine Resources in Noemuti, North Central Timor Regency

Review of Major Influencing Factors Contributing to Persisting Safety Problems in Coal Mines: Addressing Systemic Challenges

Bi-objective Optimization in Role Mining

A secure Multi-Frequency Computation Protocol in 2-Part Fully Distributed Setting

Production of low-sulphur tailings by hydrocycloning

Research on Gas Extraction Effect of High Gas Mines Based on Stereoscopic Cross Directional Drilling Technology

Experimentation of Heat-Insulating Materials for Surrounding Rocks in Deep Mines and Simulation Study of Temperature Reduction

Geospatial Analysis of the Socioeconomic and Demographic Effects of Historic Coal Mining in the Greater Pittsburgh Region, Pennsylvania, USA

Research on controlled mining of end slope fire-burned area in open-pit mine

Swin-chart: An efficient approach for chart classification

Attention-based fuzzy neural networks designed for early warning of financial crises of listed companies

Enhancing the performance of integer models for addressing the long-term production planning problem in open pit mines by decision variable fixation based on parametric analysis of the final pit limit

Investigating critical metals Ge and Ga in complex sulphide mineral assemblages using LIBS mapping

Jaccard-constrained dense subgraph discovery

Study of Slope Stability of the Mining Wall in an Open-Pit Coal Mine by the Paste Cut-and-Backfill Method

A road network traffic flow data imputation method based on the fusion of spatiotemporal features and adversarial networks

A Secure Parallel Pattern Mining System for Medical Internet of Things.

Об одной новой схеме применения метода конечных элементов в горных науках

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mining Problem Research Articles

Related Topics

Articles published on Mining Problem

Optimization of frequent item set mining parallelization algorithm based on spark platform

A Nonaxial-Type Swirling Cavitating Nozzle for Exploiting Natural Gas Hydrate

Oligarchy of Power in The Management of C-Mine Resources in Noemuti, North Central Timor Regency

Review of Major Influencing Factors Contributing to Persisting Safety Problems in Coal Mines: Addressing Systemic Challenges

Bi-objective Optimization in Role Mining

A secure Multi-Frequency Computation Protocol in 2-Part Fully Distributed Setting

Production of low-sulphur tailings by hydrocycloning

Research on Gas Extraction Effect of High Gas Mines Based on Stereoscopic Cross Directional Drilling Technology

Experimentation of Heat-Insulating Materials for Surrounding Rocks in Deep Mines and Simulation Study of Temperature Reduction

Geospatial Analysis of the Socioeconomic and Demographic Effects of Historic Coal Mining in the Greater Pittsburgh Region, Pennsylvania, USA

Research on controlled mining of end slope fire-burned area in open-pit mine

Swin-chart: An efficient approach for chart classification

Attention-based fuzzy neural networks designed for early warning of financial crises of listed companies

Enhancing the performance of integer models for addressing the long-term production planning problem in open pit mines by decision variable fixation based on parametric analysis of the final pit limit

Investigating critical metals Ge and Ga in complex sulphide mineral assemblages using LIBS mapping

Jaccard-constrained dense subgraph discovery

Study of Slope Stability of the Mining Wall in an Open-Pit Coal Mine by the Paste Cut-and-Backfill Method

A road network traffic flow data imputation method based on the fusion of spatiotemporal features and adversarial networks

A Secure Parallel Pattern Mining System for Medical Internet of Things.

Об одной новой схеме применения метода конечных элементов в горных науках