Baseline Kernel Research Articles

Graph kernels are powerful tools to bridge the gap between machine learning and data encoded as graphs. Most graph kernels are based on the decomposition of graphs into a set of patterns. The similarity between two graphs is then deduced to the similarity between corresponding patterns. Kernels based on linear patterns constitute a good trade-off between accuracy and computational complexity. In this work, we propose a thorough investigation and comparison of graph kernels based on different linear patterns, namely walks and paths. First, all these kernels are explored in detail, including their mathematical foundations, structures of patterns and computational complexity. After that, experiments are performed on various benchmark datasets exhibiting different types of graphs, including labeled and unlabeled graphs, graphs with different numbers of vertices, graphs with different average vertex degrees, linear and non-linear graphs. Finally, for regression and classification tasks, accuracy and computational complexity of these kernels are compared and analyzed, in the light of baseline kernels based on non-linear patterns. Suggestions are proposed to choose kernels according to the types of graph datasets. This work leads to a clear comparison of strengths and weaknesses of these kernels. An open-source Python library containing an implementation of all discussed kernels is publicly available on GitHub to the community, thus allowing to promote and facilitate the use of graph kernels in machine learning problems.

Read full abstract

Array comparative genomic hybridization (arrayCGH) is widely used to measure DNA copy numbers in cancer research. ArrayCGH data report log-ratio intensities of thousands of probes sampled along the chromosomes. Typically, the choices of the locations and the lengths of the probes vary in different experiments. This discrepancy in choosing probes poses a challenge in integrated classification or analysis across multiple arrayCGH datasets. We propose an alignment-based framework to integrate arrayCGH samples generated from different probe sets. The alignment framework seeks an optimal alignment between the probe series of one arrayCGH sample and the probe series of another sample, intended to find the maximum possible overlap of DNA copy number variations between the two measured chromosomes. An alignment kernel is introduced for integrative patient sample classification and a multiple alignment algorithm is also introduced for identifying common regions with copy number aberrations. The probe alignment kernel and the MPA algorithm were experimented to integrate three bladder cancer datasets as well as artificial datasets. In the experiments, by integrating arrayCGH samples from multiple datasets, the probe alignment kernel used with support vector machines significantly improved patient sample classification accuracy over other baseline kernels. The experiments also demonstrated that the multiple probe alignment (MPA) algorithm can find common DNA aberrations that cannot be identified with the standard interpolation method. Furthermore, the MPA algorithm also identified many known bladder cancer DNA aberrations containing four known bladder cancer genes, three of which cannot be detected by interpolation. http://www.cs.umn.edu/compbio/ProbeAlign.

Read full abstract

Baseline Kernel Research Articles

Related Topics

Articles published on Baseline Kernel

Graph kernels based on linear patterns: Theoretical and experimental comparisons

Robust One-Class Kernel Spectral Regression.

Sequential Inference Methods for Non-Homogeneous Poisson Processes With State-Space Prior

Development of element-by-element kernel algorithms in unstructured finite-element solvers for many-core wide-SIMD CPUs: Application to earthquake simulation

Spatial resolution compensation by adjusting the reconstruction kernels for iterative reconstruction images of computed tomography

Graphkernels: R and Python packages for graph comparison.

Sparse kernel minimum squared error using Householder transformation and givens rotation

Learning deep kernels in the space of dot product polynomials

Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks

Substructure counting graph kernels for machine learning from RDF data

Substructure Counting Graph Kernels for Machine Learning from RDF Data

2D scale-adaptive tracking based on projective geometry

Integrative classification and analysis of multiple arrayCGH datasets with probe alignment

Locality kernels for sequential data and their applications to parse ranking

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Baseline Kernel Research Articles

Related Topics

Articles published on Baseline Kernel

Graph kernels based on linear patterns: Theoretical and experimental comparisons

Robust One-Class Kernel Spectral Regression.

Sequential Inference Methods for Non-Homogeneous Poisson Processes With State-Space Prior

Development of element-by-element kernel algorithms in unstructured finite-element solvers for many-core wide-SIMD CPUs: Application to earthquake simulation

Spatial resolution compensation by adjusting the reconstruction kernels for iterative reconstruction images of computed tomography

Graphkernels: R and Python packages for graph comparison.

Sparse kernel minimum squared error using Householder transformation and givens rotation

Learning deep kernels in the space of dot product polynomials

Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks

Substructure counting graph kernels for machine learning from RDF data

Substructure Counting Graph Kernels for Machine Learning from RDF Data

2D scale-adaptive tracking based on projective geometry

Integrative classification and analysis of multiple arrayCGH datasets with probe alignment

Locality kernels for sequential data and their applications to parse ranking