PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan

A R Kinjo,R Yamashita,H Nakamura

doi:10.1093/database/baq021

Abstract

This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB.Database URL: http://www.pdbj.org/

Highlights

All protein structural data must be deposited to the worldwide Protein Data Bank (1) if they are to be published in scientific journals
Based on PDBMLplus, we have recently developed Protein Data Bank Japan (PDBj) Mine, a relational database (RDB) and its web interface to the PDBMLplus data
It is a RDB, PDBj Mine preserves the structure of PDBMLplus so that a user who is familiar with PDBML or PDBMLplus can construct SQL queries based on the hierarchical structure of XML

Summary

Introduction

All protein structural data must be deposited to the worldwide Protein Data Bank (wwPDB) (1) if they are to be published in scientific journals. The deposited data are saved in three different formats [PDB, mmCIF (2), and PDBML (3)] and are published at the FTP sites of the wwPDB members, which includes RCSB PDB (USA), PDBe (Europe) and PDBj (Japan) (together with BMRB [Biological Magnetic Resonance Bank]). We describe the design and implementation of PDBj Mine It is a RDB, PDBj Mine preserves the structure of PDBMLplus (an XML format) so that a user who is familiar with PDBML or PDBMLplus can construct SQL queries based on the hierarchical structure of XML. The basic structure of PDBj Mine and the sample queries described in this article will help the user to construct more advanced queries against the PDBj to retrieve more useful information on protein structures

Design

Discussion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Database	Publication Date: Aug 25, 2010
Citations: 21	License type: cc-by

R Discovery Prime

R Discovery Prime

PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database

Lead the way for us

Similar Papers

Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures.
Akira R Kinjo ... Hirofumi Suzuki
Nucleic Acids Research | VOL. 45
Akira R Kinjo, et. al.Akira R Kinjo ... Hirofumi Suzuki
26 Oct 2016
Nucleic Acids Research | VOL. 45

Accessing State Ground-Water Database and Displaying Geospatial Data Through Innovative GIS/internet Technologies : ABSTRACT
Robichaud, Marc&Nbsp;
AAPG Bulletin | VOL. 81 (1997)
Robichaud, Marc&Nbsp; Robichaud, Marc&Nbsp;
01 Jan 1997
AAPG Bulletin | VOL. 81 (1997)

Database design of the Malaysia public figures web archive repository: a social and cultural heritage web collections
Farrah Diana Saiful Bahry ... Tesa Eranti Putri
Collection and Curation | VOL. 41
Farrah Diana Saiful Bahry, et. al.Farrah Diana Saiful Bahry ... Tesa Eranti Putri
20 Jan 2022
Collection and Curation | VOL. 41

3DinSight: an integrated relational database and search tool for the structure, function and properties of biomolecules.
J An ... T Nakama
Bioinformatics (Oxford, England) | VOL. 14
J An, et. al.J An ... T Nakama
01 Jan 1998
Bioinformatics (Oxford, England) | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database