Abstract
Cross-matching is an indispensable operation in the data preparation, analysis, and research processes of multi-band astronomy and time-domain astronomy. Multi-catalog time-series data reconstruction is an important part of time-domain astronomy. In the large-scale distributed reconstruction process, boundary problems have always affected the accuracy of time-series data. To optimize these boundary problems and improve data precision, this paper proposes a new hybrid astronomical data indexing method called Translated Transformation based HEALPix Dual Index (TT-HEALPix). Under the reasonable Healpix division level, by translation transformation, the two indexes before and after the transformation form a unique pseudo-hybrid index strategy, which not only retains the advantages of the hybrid index scheme suitable for large-scale parallel computing, but also compensates for its shortage of high omission at the block boundary position. Based on TT-HEALPix, this paper completes the multi-catalog time-series reconstruction process on the Spark platform and compares it with the HEALPix+HTM hybrid indexing strategy. The experiments demonstrate that TT-HEALPix has significant advantages over the traditional HEALPix+HTM hybrid indexing method in terms of data accuracy and cross-matching efficiency. At level 9 of the Healpix index, TT-HEALPix achieves a 6%–19% improvement in cross-matching efficiency in a distributed environment compared to HEALPix+HTM. In terms of data accuracy, for the AST3-II dataset at level 9, TT-HEALPix has 62.2% accuracy improvement over HEALPix and 45.5% improvement over HEALPix+HTM. In conclusion, the proposed novel indexing strategy, TT-HEALPix, is better suited to the efficiency and accuracy requirements of cross-match.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Publications of the Astronomical Society of the Pacific
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.