Abstract

In big data era, the special data with rare characteristics may be of great significations. However, it is very difficult to automatically search these samples from the massive and high-dimensional datasets and systematically evaluate them. The DoPS, our previous work [2], provided a search method of rare spectra with double-peaked profiles from massive and high-dimensional data of LAMOST survey. The identification of the results is mainly depended on visually inspection by astronomers. In this paper, as a follow-up study, a new lattice structure named SVM-Lattice is designed based on SVM(Support Vector Machine) and FCL(Formal Concept Lattice) and particularly applied in the recognition and evaluation of rare spectra with double-peaked profiles. First, each node in the SVM-Lattice structure contains two components: the intents are defined by the support vectors trained by the spectral samples with the specific characteristics, and the relevant extents are all the positive samples classified by the support vectors. The hyperplanes can be extracted from every lattice node and used as classifiers to search targets by categories. A generalization and specialization relationship is expressed between the layers, and higher layers indicate higher confidence of targets. Then, including a SVM-Lattice building algorithm, a pruning algorithm based on association rules, and an evaluation algorithm, the supporting algorithms are provided and analysed. Finally, for the recognition and evaluation of spectra with double-peaked profiles, several data sets from LAMOST survey are used as experimental dataset. The results exhibit good consistency with traditional methods, more detailed and accurate evaluations of classification results, and higher searching efficiency than other similar methods.

Highlights

  • In the context of massive and high-dimensional data, the research regarding special data with particular characteristics is becoming increasingly difficult [2]

  • Motivation 3: A classification method DoPS [1] based on the SVM is a useful classifier, which serves as a hyperplane trained by the known samples with double-peaked profiles

  • SUMMARY In this paper, we propose a novel method, SVM-Lattice, which is based on the DoPS and formal concept lattice, to perform a systematic evaluation of the classification results for special data with rare characteristics

Read more

Summary

INTRODUCTION

In the context of massive and high-dimensional data, the research regarding special data with particular characteristics is becoming increasingly difficult [2]. The classification model is built using the support vectors trained from the labelled templates as thresholds It is suitable for recognition of the double-peaked profiles. We propose a recognition and evaluation method named SVM-Lattice for classification of data with rare characteristics, based on. The proposed method addresses the problem of classification when detecting special data with rare characteristics. The double-peaked profiles search algorithm is proposed by using the support vectors trained from the labelled samples as thresholds. Motivation 3: A classification method DoPS [1] based on the SVM is a useful classifier, which serves as a hyperplane trained by the known samples with double-peaked profiles. The proposed algorithms and techniques are integrated in SVM-Lattice, which is tested on the recognition and evaluation of double-profile spectra using several datasets from the LAMOST survey

ROADMAP The remaining content in this paper is arranged as follows
RELATED WORKS
DoPS METHOD
Findings
EVALUATION RESULT

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.