Depth-first Research Articles

Molecular search is important in chemistry, biology, and informatics for identifying molecular structures within large data sets, improving knowledge discovery and innovation, and making chemical data FAIR (findable, accessible, interoperable, reusable). Search algorithms for polymers are significantly less developed than those for small molecules because polymer search relies on searching by polymer name, which can be challenging because polymer naming is overly broad (i.e., polyethylene), complicated for complex chemical structures, and often does not correspond to official IUPAC conventions. Chemical structure search in polymers is limited to substructures, such as monomers, without awareness of connectivity or topology. This work introduces a novel query language and graph traversal search algorithm for polymers that provides the first search method able to fully capture all of the chemical structures present in polymers. The BigSMARTS query language, an extension of the small-molecule SMARTS language, allows users to write queries that localize monomer and functional group searches to different parts of the polymer, like the middle block of a triblock, the side chain of a graft, and the backbone of a repeat unit. The substructure search algorithm is based on the traversal of graph representations of the generating functions for the stochastic graphs of polymers. Operationally, the algorithm first identifies cycles representing the monomers and then the end groups and finally performs a depth-first search to match entire subgraphs. To validate the algorithm, hundreds of queries were searched against hundreds of target chemistries and topologies from the literature, with approximately 440,000 query-target pairs. This tool provides a detailed algorithm that can be implemented in search engines to provide search results with full matching of the monomer connectivity and polymer topology.

Read full abstract

The aim of sequential pattern mining (SPM) is to discover potentially useful information from a given sequence. Although various SPM methods have been investigated, most of these focus on mining all of the patterns. However, users sometimes want to mine patterns with the same specific prefix pattern, called co-occurrence pattern. Since sequential rule mining can make better use of the results of SPM, and obtain better recommendation performance, this paper addresses the issue of maximal co-occurrence nonoverlapping sequential rule (MCoR) mining and proposes the MCoR-Miner algorithm. To improve the efficiency of support calculation, MCoR-Miner employs depth-first search and backtracking strategies equipped with an indexing mechanism to avoid the use of sequential searching. To obviate useless support calculations for some sequences, MCoR-Miner adopts a filtering strategy to prune the sequences without the prefix pattern. To reduce the number of candidate patterns, MCoR-Miner applies the frequent item and binomial enumeration tree strategies. To avoid searching for the maximal rules through brute force, MCoR-Miner uses a screening strategy. To validate the performance of MCoR-Miner, eleven competitive algorithms were conducted on eight sequences. Our experimental results showed that MCoR-Miner outperformed other competitive algorithms, and yielded better recommendation performance than frequent co-occurrence pattern mining. All algorithms and datasets can be downloaded from <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/wuc567/Pattern-Mining/tree/master/MCoR-Miner</uri> .

Read full abstract

Depth-first Research Articles

Related Topics

Articles published on Depth-first

Multi-Path Routing Algorithm Based on Deep Reinforcement Learning for SDN

Enhancing Privacy in Graph Algorithms: Data-Oblivious Approaches to DFS and Dijkstra's Algorithm

Battery-powered automated guided vehicles scheduling problem in automated container terminals for minimizing energy consumption

Voxel-based variable width continuous spiral path planning for 3D printing

BigSMARTS: A Topologically Aware Query Language and Substructure Search Algorithm for Polymer Chemical Structures.

Finding strong components using depth-first search

An optimal pruned traversal tree-based fast minimum cut solver in dense graph

Artificial Intelligence-Based Chatbot to Support Public Health Services in Indonesia

A few words about maps

Research on the Co-simulation System of Subway Control Circuits

Cooperative Merging Strategy in Mixed Traffic Based on Optimal Final-State Phase Diagram With Flexible Highway Merging Points

Community Detection On Multi-layer Graph using Intra-layer and Inter-layer Linkage Graphs (CDMIILG)

Competitive network restructuring with spatially loyal customers. A bilevel facility delocation problem

Offshore Electrical-Oil Production Coupling System Reliability Analysis

MCoR-Miner: Maximal Co-Occurrence Nonoverlapping Sequential Rule Mining

Parallelizing Depth-First Search for Pathway Finding: A Comprehensive Investigation

Unauthorized Access Detection for Network Device Firmware WEB Pages

Deep Reinforcement Learning for Intelligent Penetration Testing Path Design

Key deviation source diagnosis of complex thin-walled structures based on complex networks and weighted transfer entropy

Techniques for accelerating branch-and-bound algorithms dedicated to sparse optimization

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Depth-first Research Articles

Related Topics

Articles published on Depth-first

Multi-Path Routing Algorithm Based on Deep Reinforcement Learning for SDN

Enhancing Privacy in Graph Algorithms: Data-Oblivious Approaches to DFS and Dijkstra's Algorithm

Battery-powered automated guided vehicles scheduling problem in automated container terminals for minimizing energy consumption

Voxel-based variable width continuous spiral path planning for 3D printing

BigSMARTS: A Topologically Aware Query Language and Substructure Search Algorithm for Polymer Chemical Structures.

Finding strong components using depth-first search

An optimal pruned traversal tree-based fast minimum cut solver in dense graph

Artificial Intelligence-Based Chatbot to Support Public Health Services in Indonesia

A few words about maps

Research on the Co-simulation System of Subway Control Circuits

Cooperative Merging Strategy in Mixed Traffic Based on Optimal Final-State Phase Diagram With Flexible Highway Merging Points

Community Detection On Multi-layer Graph using Intra-layer and Inter-layer Linkage Graphs (CDMIILG)

Competitive network restructuring with spatially loyal customers. A bilevel facility delocation problem

Offshore Electrical-Oil Production Coupling System Reliability Analysis

MCoR-Miner: Maximal Co-Occurrence Nonoverlapping Sequential Rule Mining

Parallelizing Depth-First Search for Pathway Finding: A Comprehensive Investigation

Unauthorized Access Detection for Network Device Firmware WEB Pages

Deep Reinforcement Learning for Intelligent Penetration Testing Path Design

Key deviation source diagnosis of complex thin-walled structures based on complex networks and weighted transfer entropy

Techniques for accelerating branch-and-bound algorithms dedicated to sparse optimization