Abstract
In big data era, the special data with rare characteristics may be of great significations. However, it is very difficult to automatically search these samples from the massive and high-dimensional datasets and systematically evaluate them. The DoPS, our previous work [2], provided a search method of rare spectra with double-peaked profiles from massive and high-dimensional data of LAMOST survey. The identification of the results is mainly depended on visually inspection by astronomers. In this paper, as a follow-up study, a new lattice structure named SVM-Lattice is designed based on SVM(Support Vector Machine) and FCL(Formal Concept Lattice) and particularly applied in the recognition and evaluation of rare spectra with double-peaked profiles. First, each node in the SVM-Lattice structure contains two components: the intents are defined by the support vectors trained by the spectral samples with the specific characteristics, and the relevant extents are all the positive samples classified by the support vectors. The hyperplanes can be extracted from every lattice node and used as classifiers to search targets by categories. A generalization and specialization relationship is expressed between the layers, and higher layers indicate higher confidence of targets. Then, including a SVM-Lattice building algorithm, a pruning algorithm based on association rules, and an evaluation algorithm, the supporting algorithms are provided and analysed. Finally, for the recognition and evaluation of spectra with double-peaked profiles, several data sets from LAMOST survey are used as experimental dataset. The results exhibit good consistency with traditional methods, more detailed and accurate evaluations of classification results, and higher searching efficiency than other similar methods.
Highlights
In the context of massive and high-dimensional data, the research regarding special data with particular characteristics is becoming increasingly difficult [2]
Motivation 3: A classification method DoPS [1] based on the SVM is a useful classifier, which serves as a hyperplane trained by the known samples with double-peaked profiles
SUMMARY In this paper, we propose a novel method, SVM-Lattice, which is based on the DoPS and formal concept lattice, to perform a systematic evaluation of the classification results for special data with rare characteristics
Summary
In the context of massive and high-dimensional data, the research regarding special data with particular characteristics is becoming increasingly difficult [2]. The classification model is built using the support vectors trained from the labelled templates as thresholds It is suitable for recognition of the double-peaked profiles. We propose a recognition and evaluation method named SVM-Lattice for classification of data with rare characteristics, based on. The proposed method addresses the problem of classification when detecting special data with rare characteristics. The double-peaked profiles search algorithm is proposed by using the support vectors trained from the labelled samples as thresholds. Motivation 3: A classification method DoPS [1] based on the SVM is a useful classifier, which serves as a hyperplane trained by the known samples with double-peaked profiles. The proposed algorithms and techniques are integrated in SVM-Lattice, which is tested on the recognition and evaluation of double-profile spectra using several datasets from the LAMOST survey
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.