Citation Graph Research Articles

We describe a novel approach to precise searching in the full content of digital libraries. The Searchbench (for search workbench) is based on sentence-wise syntactic and semantic natural language processing (NLP) of both born-digital and scanned publications in PDF format. The term born-digital means natively digital, i.e. prepared electronically using typesetting systems such as LaTeX, OpenOffice, and the like. In the Searchbench, queries can be formulated as (possibly underspecified) statements, consisting of simple subject-predicate-object constructs such as ‘algorithm improves word alignment’. This reduces the number of false hits in large document collections when the search words happen to appear close to each other, but are not semantically related. The method also abstracts from passive voice and predicate synonyms. Moreover, negated statements can be excluded from the search results, and negated antonym predicates again count as synonyms (e.g. not include = exclude).In the Searchbench, a sentence-semantic search can be combined with search filters for classical full-text, bibliographic metadata and automatically computed domain terms. Auto-suggest fields facilitate text input. Queries can be bookmarked or emailed. Furthermore, a novel citation browser in the Searchbench allows graphical navigation in citation networks. These have been extracted automatically from metadata and paper texts. The citation browser displays short phrases from citation sentences at the edges in the citation graph and thus allows students and researchers to quickly browse publications and immerse into a new research field. By clicking on a citation edge, the original citation sentence is shown in context, and optionally also in the original PDF layout.To showcase the usefulness of our research, we have a applied it to a collection of currently approx. 25,000 open access research papers in the field of computational linguistics and language technology, the ACL Anthology ( http://aclweb.org/anthology). The Searchbench user interface is a web application running in every modern, JavaScript-enabled web browser, also on smart phones and tablet computers. The system is a free and public service at http://aclasb.dfki.de. Because the NLP technology is domain-independent, it could also be applied to newspaper texts, technical documentation, or scientific publications from other disciplines. The aim of this paper is to make the benefits of this new, language technology based approach known in library research and related fields.This article summarises 9 peer reviewed publications from the past three years that have been published in international conferences and workshops in the area of computational linguistics, and tries to present them in an appropriate way to the LIBER audience. The original papers contain more details and are freely available from the author’s homepage[1] or via the Searchbench[2].

Read full abstract

For most global software companies with a client base that covers a large number of regulated businesses, regulatory compliance represents a significant challenge. The world of compliance has become increasingly complex due to the overwhelming number of regulations, laws, and standards that are introduced every year. These laws may vary significantly in their scope and applicability depending on the industry sector and the geographical area of the end client. In addition, many of these laws are created by different legislative bodies resulting in overlapping and sometimes conflicting provisions. To further complicate matters, laws are often created based on existing ones, forming a complex set of interdependent rules where changes made in one place can propagate to affect, sometimes in an inconsistent manner, many other laws. There is clearly a need to investigate techniques and tools that can alleviate IT solution providers from the complexity of dealing with regulatory compliance. In this paper, we present an approach and a supporting tool that aim to facilitate the analysis of multiple regulations. Our approach is based on the exploration of the citation relationship that links various laws together. The citation relationship is represented by a citation graph that can be used by an analyst to navigate through the provisions of various interrelated laws to uncover overlaps and possible conflicts or to simply understand the content of specific law documents. We also present a tool called CompDSS (Compliance Decision Support System) that supports our approach. Finally, we show the effectiveness of the presented approach by applying it to three regulations, namely, SOX, HIPAA, and GLBA.

Read full abstract

Citation Graph Research Articles

Related Topics

Articles published on Citation Graph

Document Similarity Search Algorithm Based On Hierarchy Model

Diversifying Citation Recommendations

Inheritance Patterns in Citation Networks Reveal Scientific Memes

Patent Query Formulation by Synthesizing Multiple Sources of Relevance Evidence

Exploiting citation networks for large-scale author name disambiguation

GraMi

Inheritance Patterns in Citation Networks Reveal Scientific Memes

Reverse top-k search using random walk with restart

Review of the indirect citations paradigm: theory and practice of the assessment of papers, authors and journals

The effect of citation analysis on query expansion for patent retrieval

Full‐text citation analysis: A new method to enhance scholarly networks

Fast recommendation on bibliographic networks with sparse-matrix ordering and partitioning

The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

Задачи и методы автоматического построения графа цитирований по коллекции научных документов

Geometric graph properties of the spatial preferred attachment model

Universality of performance indicators based on citation and reference counts

Citation genetic genealogy: a novel insight for citation analysis in scientific literature

Mining citation information from CiteSeer data

F-Value: measuring an article’s scientific impact

An approach based on citation analysis to support effective handling of regulatory compliance

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Citation Graph Research Articles

Related Topics

Articles published on Citation Graph

Document Similarity Search Algorithm Based On Hierarchy Model

Diversifying Citation Recommendations

Inheritance Patterns in Citation Networks Reveal Scientific Memes

Patent Query Formulation by Synthesizing Multiple Sources of Relevance Evidence

Exploiting citation networks for large-scale author name disambiguation

GraMi

Inheritance Patterns in Citation Networks Reveal Scientific Memes

Reverse top-k search using random walk with restart

Review of the indirect citations paradigm: theory and practice of the assessment of papers, authors and journals

The effect of citation analysis on query expansion for patent retrieval

Full‐text citation analysis: A new method to enhance scholarly networks

Fast recommendation on bibliographic networks with sparse-matrix ordering and partitioning

The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

Задачи и методы автоматического построения графа цитирований по коллекции научных документов

Geometric graph properties of the spatial preferred attachment model

Universality of performance indicators based on citation and reference counts

Citation genetic genealogy: a novel insight for citation analysis in scientific literature

Mining citation information from CiteSeer data

F-Value: measuring an article’s scientific impact

An approach based on citation analysis to support effective handling of regulatory compliance