Abstract

Next-generation enterprise management systems are beginning to be developed based on the Systems of Engagement (SOE) model. We visualize an SOE as a set of entities. Each entity is modeled by a single parent document with dynamic embedded links (i.e., child documents) that contain multi-modal information about the entity from various networks. Since entities in an SOE are generally queried using keywords, our goal is to efficiently retrieve the top-k entities related to a given keyword-based query by considering the relevance scores of both their parent and child documents. Furthermore, we extend the afore-mentioned problem to incorporate the case where the entities are geo-tagged. The main contributions of this work are three-fold. First, it proposes an efficient bitmap-based approach for quickly identifying the candidate set of entities, whose parent documents contain all queried keywords. A variant of this approach is also proposed to reduce memory consumption by exploiting skews in keyword popularity. Second, it proposes the two-tier HI-tree index, which uses both hashing and inverted indexes, for efficient document relevance score lookups. Third, it proposes an R-tree-based approach to extend the afore-mentioned approaches for the case where the entities are geo-tagged. Fourth, it performs comprehensive experiments with both real and synthetic datasets to demonstrate that our proposed schemes are indeed effective in providing good top-k result recall performance within acceptable query response times.

Highlights

  • Nowadays, enterprises have fully realized the value of data that they have about their customers in Customer Relationship Management (CRM) systems and transactional systems

  • We propose an R-tree-based extension of our proposed schemes for geo-tagged entities, as we shall see in Sect

  • Given a top-k query Q, a parent document is considered to be in the candidate set if it contains at least one instance of each of the queried keywords in Q. (Note that if instances of any of the queried keywords occur in the child documents without occurring in the parent document, the parent document would not be in the candidate set.) the query model uses ‘AND’ semantics

Read more

Summary

Introduction

Enterprises have fully realized the value of data that they have about their customers in Customer Relationship Management (CRM) systems and transactional systems. To gain competitive advantage by knowing more about their customers, enterprises try to incorporate the social data of their customers. This has led to the emergence of. While SORs typically allow only passive one-way transactional communication between the enterprise management system and the stakeholders, SOEs enable two-way communication, thereby allowing stakeholders to engage and collaborate with each other [2]. The work in [18] studied how management consultants can be facilitated toward navigating competing systems of engagement w.r.t. their daily routine operations and their consultancy services for clients. We have motivated SOEs using CRM applications, SOEs have significant commercial applications in important areas such as human resource management and supply chain management

Problem Statement with an Example
Related Work and Differentiation
Our Contributions
Context of the Problem
Example for Relevance Score Computation
Top‐k Query Processing Schemes
A Variant of the Brute‐Force Approach
Illustrative Example for ZCS
Determining the Candidate Set Using Bitmaps
BSA: A Bitmap Array‐Based Technique for Looking Up Document Relevance Scores
HI‐tree: A Dynamic Index for Computing Relevance Scores
Tier 1
Tier 2
Context and Problem Statement
R‐tree‐Based Approach for Geo‐Tagged Entities
Performance Study for the Case of Non‐Geo‐Tagged Entities
Determining the Percentage Factor for ZCS
Effect of Varying the Number of Documents
Effect of Varying the Number of Queried Keywords
Effect of Skew in Keywords Distribution
Performance Study for the Case of Geo‐Tagged
Effect of Varying the Spatial Query Window Size
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call