Similarity Search Research Articles

ABSTRACT Low surface brightness substructures around galaxies, known as tidal features, are a valuable tool in the detection of past or ongoing galaxy mergers, and their properties can answer questions about the progenitor galaxies involved in the interactions. The assembly of current tidal feature samples is primarily achieved using visual classification, making it difficult to construct large samples and draw accurate and statistically robust conclusions about the galaxy evolution process. With upcoming large optical imaging surveys such as the Vera C. Rubin Observatory’s Legacy Survey of Space and Time, predicted to observe billions of galaxies, it is imperative that we refine our methods of detecting and classifying samples of merging galaxies. This paper presents promising results from a self-supervised machine learning model, trained on data from the Ultradeep layer of the Hyper Suprime-Cam Subaru Strategic Program optical imaging survey, designed to automate the detection of tidal features. We find that self-supervised models are capable of detecting tidal features, and that our model outperforms previous automated tidal feature detection methods, including a fully supervised model. An earlier method applied to real galaxy images achieved 76 per cent completeness for 22 per cent contamination, while our model achieves considerably higher (96 per cent) completeness for the same level of contamination. We emphasize a number of advantages of self-supervised models over fully supervised models including maintaining excellent performance when using only 50 labelled examples for training, and the ability to perform similarity searches using a single example of a galaxy with tidal features.

Read full abstract

AbstractClassifying logo images is a challenging task as they contain elements such as text or shapes that can represent anything from known objects to abstract shapes. While the current state of the art for logo classification addresses the problem as a multi‐class task focusing on a single characteristic, logos can have several simultaneous labels, such as different colours. This work proposes a method that allows visually similar logos to be classified and searched from a set of data according to their shape, colour, commercial sector, semantics, general characteristics, or a combination of features selected by the user. Unlike previous approaches, the proposal employs a series of multi‐label deep neural networks specialized in specific attributes and combines the obtained features to perform the similarity search. To delve into the classification system, different existing logo topologies are compared and some of their problems are analysed, such as the incomplete labelling that trademark registration databases usually contain. The proposal is evaluated considering 76,000 logos (seven times more than previous approaches) from the European Union Trademarks dataset, which is organized hierarchically using the Vienna ontology. Overall, experimentation attains reliable quantitative and qualitative results, reducing the normalized average rank error of the state‐of‐the‐art from 0.040 to 0.018 for the Trademark Image Retrieval task. Finally, given that the semantics of logos can often be subjective, graphic design students and professionals were surveyed. Results show that the proposed methodology provides better labelling than a human expert operator, improving the label ranking average precision from 0.53 to 0.68.

Read full abstract

Similarity Search Research Articles

Articles published on Similarity Search

Graph Contrastive Multi-view Learning: A Pre-training Framework for Graph Classification

Identifying genes within pathways in unannotated genomes with PaGeSearch.

Systematic analysis of jellyfish galaxy candidates in Fornax, Antlia, and Hydra from the S-PLUS survey: a self-supervised visual identification aid

Detecting galaxy tidal features using self-supervised representation learning

Poaceascoma zborayi sp. nov. and Agrorhizomyces patris gen. et spec. nov. – Two novel dark septate endophytes colonizing wheat (Triticum aestivum) roots from a cropland in Hungary

Radio Galaxy Zoo: Leveraging latent space representations from variational autoencoder

P‐59: Deep Learning‐based Defect Map Classification and Similarity Search in Display Manufacturing

GTS: GPU-based Tree Index for Fast Similarity Search

Multi‐label logo recognition and retrieval based on weighted fusion of neural features

CASTpFold: Computed Atlas of Surface Topography of the universe of protein Folds.

Anti-inflammatory action of new hybrid N-acyl-[1,2]dithiolo-[3,4-c]quinoline-1-thione

Screening of natural epigenetic modifiers for managing glycemic memory and diabetic nephropathy

Music Information Retrieval using Deep Learning Techniques

Crossing Linguistic Barriers: Authorship Attribution in Sinhala Texts

Prison Nurseries: A Review of Maternal and Infant Rooming in Outcomes for Incarcerated Mothers

DIDS: Double Indices and Double Summarizations for Fast Similarity Search

Safety evaluation of the food enzyme cellobiose phosphorylase from the genetically modified Escherichia coli strain LE1B109-pPB130.

Few-shot learning for similarity search in 12-lead ECG with deep Siamese networks

From Text to Recommendations: How Vector Databases are Revolutionizing Personalized Content Delivery

Повышение эффективности методов подбора персонала на основе глубоких нейронных сетей

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Similarity Search Research Articles

Articles published on Similarity Search

Graph Contrastive Multi-view Learning: A Pre-training Framework for Graph Classification

Identifying genes within pathways in unannotated genomes with PaGeSearch.

Systematic analysis of jellyfish galaxy candidates in Fornax, Antlia, and Hydra from the S-PLUS survey: a self-supervised visual identification aid

Detecting galaxy tidal features using self-supervised representation learning

Poaceascoma zborayi sp. nov. and Agrorhizomyces patris gen. et spec. nov. – Two novel dark septate endophytes colonizing wheat (Triticum aestivum) roots from a cropland in Hungary

Radio Galaxy Zoo: Leveraging latent space representations from variational autoencoder

P‐59: Deep Learning‐based Defect Map Classification and Similarity Search in Display Manufacturing

GTS: GPU-based Tree Index for Fast Similarity Search

Multi‐label logo recognition and retrieval based on weighted fusion of neural features

CASTpFold: Computed Atlas of Surface Topography of the universe of protein Folds.

Anti-inflammatory action of new hybrid N-acyl-[1,2]dithiolo-[3,4-c]quinoline-1-thione

Screening of natural epigenetic modifiers for managing glycemic memory and diabetic nephropathy

Music Information Retrieval using Deep Learning Techniques

Crossing Linguistic Barriers: Authorship Attribution in Sinhala Texts

Prison Nurseries: A Review of Maternal and Infant Rooming in Outcomes for Incarcerated Mothers

DIDS: Double Indices and Double Summarizations for Fast Similarity Search

Safety evaluation of the food enzyme cellobiose phosphorylase from the genetically modified Escherichia coli strain LE1B109-pPB130.

Few-shot learning for similarity search in 12-lead ECG with deep Siamese networks

From Text to Recommendations: How Vector Databases are Revolutionizing Personalized Content Delivery

Повышение эффективности методов подбора персонала на основе глубоких нейронных сетей