Parsing Speed Research Articles

The rapid development of information technology has made the amount of information in massive texts far exceed human intuitive cognition, and dependency parsing can effectively deal with information overload. In the background of domain specialization, the migration and application of syntactic treebanks and the speed improvement in syntactic analysis models become the key to the efficiency of syntactic analysis. To realize domain migration of syntactic tree library and improve the speed of text parsing, this paper proposes a novel approach-the Double-Array Trie and Multi-threading (DAT-MT) accelerated graph fusion dependency parsing model. It effectively combines the specialized syntactic features from small-scale professional field corpus with the generalized syntactic features from large-scale news corpus, which improves the accuracy of syntactic relation recognition. Aiming at the problem of high space and time complexity brought by the graph fusion model, the DAT-MT method is proposed. It realizes the rapid mapping of massive Chinese character features to the model's prior parameters and the parallel processing of calculation, thereby improving the parsing speed. The experimental results show that the unlabeled attachment score (UAS) and the labeled attachment score (LAS) of the model are improved by 13.34% and 14.82% compared with the model with only the professional field corpus and improved by 3.14% and 3.40% compared with the model only with news corpus; both indicators are better than DDParser and LTP 4 methods based on deep learning. Additionally, the method in this paper achieves a speedup of about 3.7 times compared to the method with a red-black tree index and a single thread. Efficient and accurate syntactic analysis methods will benefit the real-time processing of massive texts in professional fields, such as multi-dimensional semantic correlation, professional feature extraction, and domain knowledge graph construction.

Read full abstract

The almost existing commercial HL7 interface engines apply the string array method which is run in the main memory to HL7 message parsing process. But, if the HL7 message is big, this method will be possible to cause the computer system to raise critical and fatal problems because a long string array can carry a too heavy load to the main memory and the processor. Therefore, the image and the multi-media data which are needed for the modern medical records could be limited to be included into a HL7 message because the size is usually too big in comparison with the main body of a HL7 message and in result, it make the size of the HL7 message expanded. The purpose of this study is to suggest a new HL7 interface algorithm which can solve this problem by the method of the 'Streaming Algorithm'. This new method for HL7 message parsing apply the character-stream object which process character by character between the main memory and hard disk device with the consequence that the processing load on main memory could be alleviated. The main functions of this new engine are generating, parsing, validating, browsing, sending, and receiving of message. And also, this can parse and generate XML-formated HL7 message. This engine had been practically tested in the Discharge Summary Information Exchange System between Kyungpook National University Hospital and Chonnam National University Hospital for the purpose of proofing its usability for a month. Overall, the preliminary results of this test is considered as good, but it is pointed out that some improvement is needed relating to the speed of parsing which was predicted because this engine partly used the memory of hard disk device instead of the main memory. (Journal of Korean Society of Medical Informatics 10-1,17-33, 2004)

Read full abstract

Parsing Speed Research Articles

Articles published on Parsing Speed

DAT-MT Accelerated Graph Fusion Dependency Parsing Model for Small Samples in Professional Fields.

RabbitFX: Efficient Framework for FASTA/Q File Parsing on Modern Multi-Core Platforms.

Performance comparison of chosen JSON parsers and a parser that employs a different reading method

Dependency Grammar Induction with a Neural Variational Transition-Based Parser

Deep Semantic Role Labeling With Self-Attention

Fast and Efficient XML Data Access for Next-Generation Mass Spectrometry.

A representational system of idiomatic constructions: For the building of computational resources

한국어 의존 관계 분석과 자질 집합 분할을 이용한 기계학습의 성능 개선

A Practical GLR Parser Generator for Software Reverse Engineering

Distributed and Parallel Big Textual Data Parsing for Social Sensor Network

Eye movements reset visual perception

Development a New HL7 Interface Engine for Large-size Messages which Include Image Data based on Tree Structure and Streaming Algorithm

Efficient controlling of parsing-stack operation for LR parsers

Even faster lr parsing

Very fast LR parsing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Parsing Speed Research Articles

Articles published on Parsing Speed

DAT-MT Accelerated Graph Fusion Dependency Parsing Model for Small Samples in Professional Fields.

RabbitFX: Efficient Framework for FASTA/Q File Parsing on Modern Multi-Core Platforms.

Performance comparison of chosen JSON parsers and a parser that employs a different reading method

Dependency Grammar Induction with a Neural Variational Transition-Based Parser

Deep Semantic Role Labeling With Self-Attention

Fast and Efficient XML Data Access for Next-Generation Mass Spectrometry.

A representational system of idiomatic constructions: For the building of computational resources

한국어 의존 관계 분석과 자질 집합 분할을 이용한 기계학습의 성능 개선

A Practical GLR Parser Generator for Software Reverse Engineering

Distributed and Parallel Big Textual Data Parsing for Social Sensor Network

Eye movements reset visual perception

Development a New HL7 Interface Engine for Large-size Messages which Include Image Data based on Tree Structure and Streaming Algorithm

Efficient controlling of parsing-stack operation for LR parsers

Even faster lr parsing

Very fast LR parsing