Abstract
BackgroundAllergy is a form of hypersensitivity to normally innocuous substances, such as dust, pollen, foods or drugs. Allergens are small antigens that commonly provoke an IgE antibody response. There are two types of bioinformatics-based allergen prediction. The first approach follows FAO/WHO Codex alimentarius guidelines and searches for sequence similarity. The second approach is based on identifying conserved allergenicity-related linear motifs. Both approaches assume that allergenicity is a linearly coded property. In the present study, we applied ACC pre-processing to sets of known allergens, developing alignment-independent models for allergen recognition based on the main chemical properties of amino acid sequences.ResultsA set of 684 food, 1,156 inhalant and 555 toxin allergens was collected from several databases. A set of non-allergens from the same species were selected to mirror the allergen set. The amino acids in the protein sequences were described by three z-descriptors (z1, z2 and z3) and by auto- and cross-covariance (ACC) transformation were converted into uniform vectors. Each protein was presented as a vector of 45 variables. Five machine learning methods for classification were applied in the study to derive models for allergen prediction. The methods were: discriminant analysis by partial least squares (DA-PLS), logistic regression (LR), decision tree (DT), naïve Bayes (NB) and k nearest neighbours (kNN). The best performing model was derived by kNN at k = 3. It was optimized, cross-validated and implemented in a server named AllerTOP, freely accessible at http://www.pharmfac.net/allertop. AllerTOP also predicts the most probable route of exposure. In comparison to other servers for allergen prediction, AllerTOP outperforms them with 94% sensitivity.ConclusionsAllerTOP is the first alignment-free server for in silico prediction of allergens based on the main physicochemical properties of proteins. Significantly, as well allergenicity AllerTOP is able to predict the route of allergen exposure: food, inhalant or toxin.
Highlights
Allergy is a form of hypersensitivity to normally innocuous substances, such as dust, pollen, foods or drugs
Alignment-free presentation of the protein sequences A set of 684 food, 1,156 inhalant and 555 toxin allergens was collected from several databases as described in Methods
The amino acids in the protein sequences were described by three z-descriptors (z1, z2 and z3)
Summary
Allergy is a form of hypersensitivity to normally innocuous substances, such as dust, pollen, foods or drugs. The second approach is based on identifying conserved allergenicity-related linear motifs. Allergens are small antigens that commonly provoke an IgE antibody response Such antigens normally enter the body at very low doses by diffusion across mucosal surfaces, triggering a Th2 response [2]. The allergen-specific Th2 cells drive allergen-specific B cells to produce IgE, which binds to the high-affinity surface receptor FcεRI, present on mast cells, basophils, and activated eosinophils. On activation, these cells release stored mediators, which in turn give rise to inflammation and tissue damage causing a variety of symptoms. Food allergens rarely cause respiratory reactions and inhalant allergens seldom affect the gut [3]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.