Complete Query Research Articles

We have compared commonly used sequence comparison algorithms, scoring matrices, and gap penalties using a method that identifies statistically significant differences in performance. Search sensitivity with either the Smith-Waterman algorithm or FASTA is significantly improved by using modern scoring matrices, such as BLOSUM45-55, and optimized gap penalties instead of the conventional PAM250 matrix. More dramatic improvement can be obtained by scaling similarity scores by the logarithm of the length of the library sequence (In()-scaling). With the best modern scoring matrix (BLOSUM55 or JO93) and optimal gap penalties (-12 for the first residue in the gap and -2 for additional residues), Smith-Waterman and FASTA performed significantly better than BLASTP. With In()-scaling and optimal scoring matrices (BLOSUM45 or Gonnet92) and gap penalties (-12, -1), the rigorous Smith-Waterman algorithm performs better than either BLASTP and FASTA, although with the Gonnet92 matrix the difference with FASTA was not significant. Ln()-scaling performed better than normalization based on other simple functions of library sequence length. Ln()-scaling also performed better than scores based on normalized variance, but the differences were not statistically significant for the BLOSUM50 and Gonnet92 matrices. Optimal scoring matrices and gap penalties are reported for Smith-Waterman and FASTA, using conventional or In()-scaled similarity scores. Searches with no penalty for gap extension, or no penalty for gap opening, or an infinite penalty for gaps performed significantly worse than the best methods. Differences in performance between FASTA and Smith-Waterman were not significant when partial query sequences were used. However, the best performance with complete query sequences was obtained with the Smith-Waterman algorithm and In()-scaling.

Read full abstract

In database systems the end user interacts with the database at the external schema level. At this level the user sees only the logical structure of the database that is relevant to his/her work. Both the relational and the entity-relationship model have proponents arguing that one data model is superior to the other when used in the end user environment. However, a literature review indicated that these arguments have not been based on empirical results from a systematic inquiry. The study reported here examined this issue through a controlled experiment using query writing as the task. Our basic assumption was that if one data model was superior to the other, then the superiority of the model would be reflected in the user's query writing performance. In addition, this superiority would be demonstrated on both simple and complex tasks. Query writing performance was measured by three variables: number of syntax errors, number of semantic errors, and amount of time to complete queries. The results indicated that subjects using the relational model made fewer syntax errors, but required more time to complete a query. No significant differences in the number of semantic errors were found between the two data models. Based on these results, neither the relational nor the entity-relational data model was clearly superior when used as the interface between a database system and the end user. As expected, the more complex tasks caused more syntax and semantic errors, and required more time to complete.

Read full abstract

Complete Query Research Articles

Related Topics

Articles published on Complete Query

Identification of novel homologues of three low molecular weight subunits of the mitochondrial bc1 complex.

G-Log: a graph-based query language

Comparison of methods for searching protein sequence databases

QBI

A sound and complete query evaluation for implicit predicate which is a semantic descriptor of unknown values

No IFs, ANDs, or ORs: A study of database querying

Pattern match reduction for Relational Production Language in the USL MMDBS

The effects of relational and entity-relationship data models on query performance of end users

An average-case analysis of MAT and inverted file

A sound and complete query evaluation algorithm for relational databases with null values

A sound and sometimes complete query evaluation algorithm for relational databases with null values

Specification of a query language by the attribute method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Complete Query Research Articles

Related Topics

Articles published on Complete Query

Identification of novel homologues of three low molecular weight subunits of the mitochondrial bc1 complex.

G-Log: a graph-based query language

Comparison of methods for searching protein sequence databases

QBI

A sound and complete query evaluation for implicit predicate which is a semantic descriptor of unknown values

No IFs, ANDs, or ORs: A study of database querying

Pattern match reduction for Relational Production Language in the USL MMDBS

The effects of relational and entity-relationship data models on query performance of end users

An average-case analysis of MAT and inverted file

A sound and complete query evaluation algorithm for relational databases with null values

A sound and sometimes complete query evaluation algorithm for relational databases with null values

Specification of a query language by the attribute method