Abstract

This paper discusses the formation of math grammar rules for LATEX math equations. These rules are used to generate Abstract Syntax Tree (AST) which extracts structural information from mathematical expressions given in LATEX format. Later AST is used to generate XML structure of mathematical expressions that make mathematical expressions machine-readable in heterogeneous environments. A rule-based algorithm is also proposed that converts LATEX math expressions into Content MathML (CMML), which produces semantic enrichment in web documents. The rules for writing LATEX math equations are formulated and implemented as LATEX Math Grammar (LMG), which are used for generating AST. Further, AST is converted into XML structure which is used to generate CMML encoding. Initially, the conversion algorithm is tested on 20 equations used in an NTCIR-12 math competition, then the algorithm is tested on NTCIR-12 Wikipedia-MathIR and ArXiv data sets. The results show that our algorithm is capable of converting LATEX complex equations into CMML extensively as compared to the existing ones as well as its time efficiency is better than contemporary systems.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.