XML Format Research Articles

The article deals with the issue of using data serializers for the implementation of projects related to the processing of large volumes of data, as well as the support of high-speed data transmission in distributed systems. It is shown that in this context, the choice of the most effective serialization mechanism is critical for ensuring the performance and scalability of applications. The purpose of this work is to study the effectiveness of data serializers of the C# programming language by developing a software product for testing serializers using objects of different size and type. A review of scientific research on the use of various data serialization formats: XML, JSON, BSON, MessagePack, Smile, Protocol Buffers, Flat Buffers, Apache Thrift was conducted. It was concluded that XML and JSON formats are the most popular today, and their comparative analysis was performed. The expediency of using the JSON serialization format is substantiated, which is due to its safety compared to the binary format, its smaller size compared to the XML format, as well as the support of most software development tools. The .NET framework is chosen, which provides standard tools for JSON serialization of the C# programming language, namely: System.Runtime.Serialize.Json and System.Text.Json, which are supplied by default. The most popular software solutions for serializing C# objects are analyzed, the feasibility of testing such serializers as Jil, Json.NET, Utf8Json, SpanJson and standard serializers is shown in order to identify the advantages and disadvantages of their use for the implementation of specific tasks and projects. The C# BenchmarkDotNet programming language library was chosen to create the tester program. It is noted that this framework of the .NET platform allows you to convert methods into tests and create performance testing thanks to a powerful statistical mechanism. A class diagram and a component diagram of the developed software are given. A study of 5 data serializers was conducted, which included the execution of 7 experiments on serialization of objects with different types of data. The consumption of time and working memory during serialization of small and large objects was analyzed; objects containing one-dimensional, two-dimensional and three-dimensional arrays of natural numbers, an object with a complex chain of class inheritance, as well as an object containing a dictionary. The results of experimental studies showed the dependence of the effectiveness of serializers on the type and volume of data to be serialized. It is concluded that there is no one-size-fits-all serializer that will perform best in all cases. Recommendations for the use of various serializers are provided, taking into account the requirements of a specific project

The traditional way of publishing in PDF makes it difficult to retrospectively convert the legacy literature into data. This presentation will discuss pre-publication tagging as an alternative solution for publishing FAIR (Findable, Accessible, Interoperable, Resuable) biodiversity data. The Metotaxa-Metostem workflow Тhe MetoTaxa project aims to create a new digital production chain for the European Journal of Taxonomy, which enables the pre-publication semantic structuring of text, automatic tagging and semantic enrichment (annotation). The system is based on a single-source publishing model, where the development of an XML file enables technical editors to automatically enrich text and produce multiple digital outputs. This makes it possible to structure generic or domain-specific sections of articles (e.g., Introduction; Material and methods; Taxon names or Мaterial examined). Thanks to the GoldenGate API developed by Plazi, the Text Encoding Intiative (TEI) XML source file is automatically annotated with JATS TaxPub tags: taxon names are labeled and each authorship can be checked via Catalogue of Life, each element of the material examined is parsed thanks to the preformatting of the text (Chester et al. 2019). Also, each bibliographic reference is parsed into Journal Article Tag Suite (JATS) elements (author names, title, journal, etc.), which automatically links references to their in-text citations. Pre-publication tagging will be carried out by the technical editors and then checked by the authors before publication, and will be sent to databases such as Global Biodiversity Information Facility (GBIF) or Biodiversity Literature Repository (BLR) as soon as the article is published. We will also briefly present MetoStem, which offers a technical solution for the digital transformation of monographs, and particularly floras. The tools and methods developed by this project will enable advanced publication of interoperable structured text and data. ARPHA Publishing Platform Launched in 2010 by Pensoft, ARPHA (Penev et al. 2010) is the first ever scholarly publishing platform to support pre-publication semantic tags and enhancements to entities (e.g., taxon treatments, taxon names, sequences) in the JATS TaxPub XML format developed by Plazi, which are then embedded into the HTML version of the article. Having proved advantageous for biodiversity scientists, Pensoft’s pre-publication tagging workflow has since been adopted by over 30 biodiversity journals hosted on ARPHA. The second development stage of ARPHA was marked by the launch of ARPHA Writing Tool (AWT)*1 and Biodiversity Data Journal in 2013. AWT supports import of Darwin Core structured data from GBIF, Barcode of Life Data Systems (BOLD) and Integrated Digitized Biocollections (iDigBio) directly into manuscripts. These are also exported automatically as published material citations to GBIF. AWT also provides several other unique tools encompassed within the ARPHA-BioDiv toolbox (Penev et al. 2017). Currently, AWT is being redeveloped into a standalone, freely accessible installation (AWT 2.0), based on a micro-service architecture. It enables new semantic enhancements during the authoring process, which can be confirmed by the authors before manuscript submission. Such enhancements include the in-text citations context by CiTO ontology; automated tagging of taxon names and linking to their identifiers in authoritative sources; annotator tool; nanopublication module; automated search and import of references; treatment citation module; export/import to/from JATS TaxPub; and internal communication tool for contributors.

XML Format Research Articles

Related Topics

Articles published on XML Format

EVE-X: Software to Identify Novel Viral Insertions in Wild-Caught Arthropod Hosts From Next-Generation Short Read Data.

Eye Injury Incidence in Germany from 2008 to 2022: An Analysis of Hospital Quality Reports.

OPTIMIZATION OF MANAGEMENT DECISIONS IN TAX RELATIONS: NEW TECHNOLOGIES AND STANDARDS IN ACCOUNTING PROCESSES

Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema

Web System to Support the Teaching of an Undergraduate Distributed Systems Course

Analyzing ECG signals in professional football players using machine learning techniques

Data resource profile of an online database system for forensic mental health services

Performance research of C# programming language data serializers using the developed software product for testing

Implementing the subsystem of research findings control in RNPLS&T’s Single Open Information Archive (SOIA)

Creating JATS XML from DOCX

Abstract 17676: An Electrocardiographic Deep Learning Model is Informative for the Prediction of Atrioventricular Block After Transcatheter Aortic Valve Replacement

OCR Pipeline for Transforming Parliamentary Debates into Linked Data: Case ParliamentSampo - Parliament of Finland on the Semantic Web

Open RT Structures: A Solution for TG-263 Accessibility

Metadata Scraping Using Programmable Customized Search Engine

Building an Electronic Medical Record System Exchanged in FHIR Format and Its Visual Presentation.

Describing Inscriptions of Ancient Italy. The ItAnt Project and Its Information Encoding Process

Pre-Publication Data Linking in Taxonomy and Biodiversity: The ARPHA and Metotaxa-Metostem Publishing Systems

LinguaPhylo: A probabilistic model specification language for reproducible phylogenetic analyses.

Enabling efficient business process mining using flatten sequential structure model

Representing the Sung Poetry of Ottoman Art Music in a Critical Digital Edition in TEI XML

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

XML Format Research Articles

Related Topics

Articles published on XML Format

EVE-X: Software to Identify Novel Viral Insertions in Wild-Caught Arthropod Hosts From Next-Generation Short Read Data.

Eye Injury Incidence in Germany from 2008 to 2022: An Analysis of Hospital Quality Reports.

OPTIMIZATION OF MANAGEMENT DECISIONS IN TAX RELATIONS: NEW TECHNOLOGIES AND STANDARDS IN ACCOUNTING PROCESSES

Embedding Structure into HTML for More Precise Retrieval of Information, A Novel XML Schema

Web System to Support the Teaching of an Undergraduate Distributed Systems Course

Analyzing ECG signals in professional football players using machine learning techniques

Data resource profile of an online database system for forensic mental health services

Performance research of C# programming language data serializers using the developed software product for testing

Implementing the subsystem of research findings control in RNPLS&amp;T’s Single Open Information Archive (SOIA)

Creating JATS XML from DOCX

Abstract 17676: An Electrocardiographic Deep Learning Model is Informative for the Prediction of Atrioventricular Block After Transcatheter Aortic Valve Replacement

OCR Pipeline for Transforming Parliamentary Debates into Linked Data: Case ParliamentSampo - Parliament of Finland on the Semantic Web

Open RT Structures: A Solution for TG-263 Accessibility

Metadata Scraping Using Programmable Customized Search Engine

Building an Electronic Medical Record System Exchanged in FHIR Format and Its Visual Presentation.

Describing Inscriptions of Ancient Italy. The ItAnt Project and Its Information Encoding Process

Pre-Publication Data Linking in Taxonomy and Biodiversity: The ARPHA and Metotaxa-Metostem Publishing Systems

LinguaPhylo: A probabilistic model specification language for reproducible phylogenetic analyses.

Enabling efficient business process mining using flatten sequential structure model

Representing the Sung Poetry of Ottoman Art Music in a Critical Digital Edition in TEI XML

Implementing the subsystem of research findings control in RNPLS&T’s Single Open Information Archive (SOIA)