Binary Strings Research Articles

Consider two sequences of n independent and identically distributed fair coin tosses, X = (X 1 , . . . , X n ) and Y = (Y 1 , . . . , Y n ), which are ρ-correlated for each j, i.e. P[X j = Y j ] = 1+ρ/2 .We study the question of how large (small) the probability P[X ∈ A, Y ∈ B] can be among all sets A, B ⊂ {0, 1} n of a given cardinality. For sets |A|, |B| = Θ(2 n ) it is well known that the largest (smallest) probability is approximately attained by concentric (anti-concentric) Hamming balls, and this can be proved via the hypercontractive inequality (reverse hypercontractivity). Here we consider the case of |A|, |B| = 2 Θ (n). By applying a recent extension of the hypercontractive inequality of Polyanskiy-Samorodnitsky (J. Functional Analysis, 2019), we show that Hamming balls of the same size approximately maximize P[X ∈ A, Y ∈ B] in the regime of p → 1. We also prove a similar tight lower bound, i.e. show that for p → 0 the pair of opposite Hamming balls approximately minimizes the probability P[X ∈ A, Y ∈ B].

This paper is concerned with Field Programmable Gate Arrays (FPGA)-based systems for energy-efficient high-throughput string comparison. Modern applications which involve comparisons across large data sets—such as large sequence sets in molecular biology—are by their nature computationally intensive. In this work, we present a scalable FPGA-based system architecture to accelerate the comparison of binary strings. The current architecture supports arbitrary lengths in the range 16 to 2048-bit, covering a wide range of possible applications. In our example application, we consider DNA sequences embedded in a binary vector space through Locality Sensitive Hashing (LSH) one of several possible encodings that enable us to avoid more costly character-based operations. Here the resulting encoding is a 512-bit binary signature with comparisons based on the Hamming distance. In this approach, most of the load arises from the calculation of the O ( m ∗ n ) Hamming distances between the signatures, where m is the number of queries and n is the number of signatures contained in the database. Signature generation only needs to be performed once, and we do not consider it further, focusing instead on accelerating the signature comparisons. The proposed FPGA-based architecture is optimized for high-throughput using hundreds of computing elements, arranged in a systolic array. These core computing elements can be adapted to support other string comparison algorithms with little effort, while the other infrastructure stays the same. On a Xilinx Virtex UltraScale+ FPGA (XCVU9P-2), a peak throughput of 75.4 billion comparisons per second—of 512-bit signatures—was achieved, using a design with 384 parallel processing elements and a clock frequency of 200 MHz. This makes our FPGA design 86 times faster than a highly optimized CPU implementation. Compared to a GPU design, executed on an NVIDIA GTX1060, it performs nearly five times faster.

Binary Strings Research Articles

Related Topics

Articles published on Binary Strings

Topological optimization of the DenseNet with pretrained-weights inheritance and genetic channel selection

Secure Digital Databases using Watermarking based on English-Character Attributes

In‐Memory Hamming Weight Calculation in a 1T1R Memristive Array

A Note on the Probability of Rectangles for Correlated Binary Strings

The Asymptotic Induced Matching Number of Hypergraphs: Balanced Binary Strings

Efficient hash maps to {mathbb {G}}_2 on BLS curves

Efficient implied alignment

User-Silicon Entangled Mobile Identity Authentication

The surprising little effectiveness of cooperative algorithms in parallel problem solving

Development of Combinational Circuits by Encoding on the Basis of Developmental Biology.

Time for Change: Implementation of Aksentijevic-Gibson Complexity in Psychology

Implementation of a Hamming distance\u2013like genomic quantum classifier using inner products on ibmqx2 and ibmq_16_melbourne

The complexity of clinically-normal sinus-rhythm ECGs is decreased in equine athletes with a diagnosis of paroxysmal atrial fibrillation

Comparison Analysis with Huffman Algorithm and Goldbach Codes Algorithm in File Compression Text Using the Method Exponential Comparison

Remote Sensing Image Registration Based on Improved KAZE and BRIEF Descriptor

Accelerating Binary String Comparisons with a Scalable, Streaming-Based System Architecture Based on FPGAs

Improved ORB Algorithm Using Three-Patch Method and Local Gray Difference.

Asymptotic Analysis of the kth Subword Complexity.

Smartphone based iris recognition through optimized textural representation

Prediction of Genome Sequences in Terms of Cellular Automata Expansion of Rule Based Logics

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Binary Strings Research Articles

Related Topics

Articles published on Binary Strings

Topological optimization of the DenseNet with pretrained-weights inheritance and genetic channel selection

Secure Digital Databases using Watermarking based on English-Character Attributes

In‐Memory Hamming Weight Calculation in a 1T1R Memristive Array

A Note on the Probability of Rectangles for Correlated Binary Strings

The Asymptotic Induced Matching Number of Hypergraphs: Balanced Binary Strings

Efficient hash maps to {mathbb {G}}_2 on BLS curves

Efficient implied alignment

User-Silicon Entangled Mobile Identity Authentication

The surprising little effectiveness of cooperative algorithms in parallel problem solving

Development of Combinational Circuits by Encoding on the Basis of Developmental Biology.

Time for Change: Implementation of Aksentijevic-Gibson Complexity in Psychology

Implementation of a Hamming distance\u2013like genomic quantum classifier using inner products on ibmqx2 and ibmq_16_melbourne

The complexity of clinically-normal sinus-rhythm ECGs is decreased in equine athletes with a diagnosis of paroxysmal atrial fibrillation

Comparison Analysis with Huffman Algorithm and Goldbach Codes Algorithm in File Compression Text Using the Method Exponential Comparison

Remote Sensing Image Registration Based on Improved KAZE and BRIEF Descriptor

Accelerating Binary String Comparisons with a Scalable, Streaming-Based System Architecture Based on FPGAs

Improved ORB Algorithm Using Three-Patch Method and Local Gray Difference.

Asymptotic Analysis of the kth Subword Complexity.

Smartphone based iris recognition through optimized textural representation

Prediction of Genome Sequences in Terms of Cellular Automata Expansion of Rule Based Logics