Systematic Benchmarks Research Articles

BackgroundThe phenotypes of cancer cells are driven in part by somatic structural variants. Structural variants can initiate tumors, enhance their aggressiveness, and provide unique therapeutic opportunities. Whole-genome sequencing of tumors can allow exhaustive identification of the specific structural variants present in an individual cancer, facilitating both clinical diagnostics and the discovery of novel mutagenic mechanisms. A plethora of somatic structural variant detection algorithms have been created to enable these discoveries; however, there are no systematic benchmarks of them. Rigorous performance evaluation of somatic structural variant detection methods has been challenged by the lack of gold standards, extensive resource requirements, and difficulties arising from the need to share personal genomic information.ResultsTo facilitate structural variant detection algorithm evaluations, we create a robust simulation framework for somatic structural variants by extending the BAMSurgeon algorithm. We then organize and enable a crowdsourced benchmarking within the ICGC-TCGA DREAM Somatic Mutation Calling Challenge (SMC-DNA). We report here the results of structural variant benchmarking on three different tumors, comprising 204 submissions from 15 teams. In addition to ranking methods, we identify characteristic error profiles of individual algorithms and general trends across them. Surprisingly, we find that ensembles of analysis pipelines do not always outperform the best individual method, indicating a need for new ways to aggregate somatic structural variant detection approaches.ConclusionsThe synthetic tumors and somatic structural variant detection leaderboards remain available as a community benchmarking resource, and BAMSurgeon is available at https://github.com/adamewing/bamsurgeon.

Read full abstract

The combination of density functional theory and multireference configuration interaction (DFT/MRCI) is a well-established semi-empirical method suitable for computing spectral properties of large molecular systems. To this day, three different Hamiltonians and various parameter set combinations exist. These DFT/MRCI variants are well tried and tested when it comes to electronic excitations of organic molecules. For transition metal complexes, systematic benchmarks against experimental data are missing, however. Here we present an assessment of the DFT/MRCI variants and of time-dependent, linear-response density functional theory (TDDFT) for a diverse set of ligand-centered, metal-to-ligand charge transfer, metal-centered, and ligand-to-metal charge transfer (LMCT) excitations on 21 3d and 4d complexes comprising 10 small inorganic and 11 larger metalorganic compounds with closed-shell ground states. In the course of this assessment, we realized that the excitation energies of transition metal complexes can be very sensitive with respect to the details of the damping function that scales off-diagonal matrix elements. This scaling is required in DFT/MRCI to avoid double counting of dynamic electron correlation. These insights lead to a new Hamiltonian, denoted R2018, with improved performance on transition metal compounds, while the results for organic molecules are nearly unaffected by the modified damping function. Two parameter sets were optimized for this Hamiltonian: One set is to be used in conjunction with the standard configuration selection threshold of 1.0 E h and a second set is for use with a selection threshold of 0.8 E h which leads to shorter wave function expansions. The R2018 Hamiltonian in standard parameterization achieves root-mean-square errors (RMSEs) of merely 0.15 eV for the metalorganic complexes, followed by 0.20 eV for the original DFT/MRCI ansatz, and 0.25 eV for the redesigned DFT/MRCI approach. In comparison, TDDFT gives a much larger RMSE of 0.46 eV for metalorganic complexes. None of the DFT/MRCI variants yields convincing results for small oxides and fluorides which exhibit LMCT transitions. Here, TDDFT performs better. If the oxides and fluorides are excluded from the inorganic test set, satisfactory agreement can be achieved, with RMSE values between 0.26 eV and 0.30 eV for DFT/MRCI and 0.34 eV for TDDFT. The performance of the original and the new DFT/MRCI Hamiltonians deteriorates only slightly, when a tighter selection threshold is chosen, thus enabling the computation of reliable spectral properties even for large metalorganic complexes.

Read full abstract

Systematic Benchmarks Research Articles

Articles published on Systematic Benchmarks

Rapid, Accurate, Ranking of Protein-Ligand Binding Affinities with VM2, the Second-Generation Mining Minima Method.

Modeling Multi-Step Organic Reactions: Can Density Functional Theory Deliver Misleading Chemistry?

Spatial-linked alignment tool (SLAT) for aligning heterogenous slices

Evaluation of computational phage detection tools for metagenomic datasets.

Quantum simulation with just-in-time compilation

Neutron-deuteron scattering cross sections with chiral NN interactions using wave-packet continuum discretization

Towards scalable online machine learning collaborations with OpenML

Shifting towards Proactive OHS Risk Management in Romanian Organizations: Systematic Benchmarks

Stochastic model predictive control for central HVAC plants

A Quadratic Pair Atomic Resolution of the Identity Based SOS-AO-MP2 Algorithm Using Slater Type Orbitals

Combining accurate tumor genome simulation with crowdsourcing to benchmark somatic structural variant detection

On the performance of DFT/MRCI Hamiltonians for electronic excitations in transition metal complexes: The role of the damping function.

Personal Cloud Storage Benchmarks and Comparison

Complexity analysis of simulations with analytic bond-order potentials

A systematic benchmark of the ab initio Bethe-Salpeter equation approach for low-lying optical excitations of small organic molecules.

Effects of vibrational averaging on coupled cluster calculations of spin–spin coupling constants for hydrocarbons

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Systematic Benchmarks Research Articles

Articles published on Systematic Benchmarks

Rapid, Accurate, Ranking of Protein-Ligand Binding Affinities with VM2, the Second-Generation Mining Minima Method.

Modeling Multi-Step Organic Reactions: Can Density Functional Theory Deliver Misleading Chemistry?

Spatial-linked alignment tool (SLAT) for aligning heterogenous slices

Evaluation of computational phage detection tools for metagenomic datasets.

Quantum simulation with just-in-time compilation

Neutron-deuteron scattering cross sections with chiral NN interactions using wave-packet continuum discretization

Towards scalable online machine learning collaborations with OpenML

Shifting towards Proactive OHS Risk Management in Romanian Organizations: Systematic Benchmarks

Stochastic model predictive control for central HVAC plants

A Quadratic Pair Atomic Resolution of the Identity Based SOS-AO-MP2 Algorithm Using Slater Type Orbitals

Combining accurate tumor genome simulation with crowdsourcing to benchmark somatic structural variant detection

On the performance of DFT/MRCI Hamiltonians for electronic excitations in transition metal complexes: The role of the damping function.

Personal Cloud Storage Benchmarks and Comparison

Complexity analysis of simulations with analytic bond-order potentials

A systematic benchmark of the ab initio Bethe-Salpeter equation approach for low-lying optical excitations of small organic molecules.

Effects of vibrational averaging on coupled cluster calculations of spin–spin coupling constants for hydrocarbons