Abstract

XML documents have both structural and semantic information, bringing data integration and deep utilization based on XML more precise description and versatile expression. But in the meanwhile traditional NLP and DM methods can't be applied directly. Feature dimension reduction and general similarity of XML based on tensor analysis are discussed. Considering the correlation between XML's structure and content, a tensor based model for describing XML documents and an MMI method to XML's dimension reduction is presented. Since structure and content are not independent with each other, a tensor based algorithm to calculate general similarity from a non-linear angle is designed to show their relationships and effects to its performance.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.