Abstract

In this paper, we explain the development of a new Mizar tokenizer and parser program as a component of a search system that works on the Mizar Mathematical Library. The existing Mizar tokenizer and parser can handle only an article as a whole written in the Mizar language, however, the newly developed program can deal with a snippet of a Mizar article.In particular, since it is possible to handle a snippet of an article without specifying a vocabulary section of an environment part, it is expected that user input efforts will be greatly reduced.

Highlights

  • T HE AUTHOR is developing a new information retrieval system that works on the Mizar Mathematical Library (MML) [1]

  • In order to learn from the drawbacks of MML Query, the newly developing search system extracts features of the input data and compares them with that of theorems and definitions registered in the MML

  • The Mizar language consists of a context sensitive grammar, and a set of valid symbols are determined according to Mizar articles enumerated in a vocabulary section of an environment part

Read more

Summary

MOTIVATION

T HE AUTHOR is developing a new information retrieval system that works on the Mizar Mathematical Library (MML) [1]. We explain a developed tokenizer and parser program of the Mizar language as a component of our search system

MML Query
Developing search system
Necessity of a new tokenizer and parser
REQUIREMENTS OF NEW TOKENIZER AND PARSER
Tokenizer
Parser
Input and output
Tokenizer specification
Functionality
Parser specification
Choice of programming languages
Performance
Display of parsing results
Type checking
Performance improvement

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.