Abstract

For select domains and datasets, duplicates may be, in part or in whole, instances of cheating. We may specifically observe this for Sony's PlayStation Network (PSN) that services the world's most popular gaming platform. The key to cheat detection in like domains is the ability to perform temporal deduplication. Temporal data is increasingly prevalent and is not well suited to traditional similarity and distance-based deduplication techniques. We strengthen the well-established Adaptive Sorted Neighborhood Method (ASNM) with an approach for temporal data domains ( <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\text{ASNM}+\text{LCS}$</tex> ) that applies ASNM, infers attribute metadata, and further detects duplicates through inference of temporal ordering requirements using Longest Common Subsequence (LCS) for records of a shared type. Using LCS, we split each record's temporal sequence into constrained and unconstrained sequences. We flag suspicious (errant) records that are non-adherent to the inferred constrained order and we flag a record as a duplicate if its unconstrained order, of sufficient length, matches that of another record. ASNM and <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\text{ASNM}+\text{LCS}$</tex> were evaluated against a labeled dataset of 22,794 records from PSN trophy data where duplication may be indicative of cheating. <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\text{ASNM}+\text{LCS}$</tex> F1 scores outperformed ASNM at every similarity threshold with at least 32% improvement. ASNM's best performance was an F1 of. 708 at the 0.99 threshold; <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\text{ASNM}+\text{LCS}$</tex> yielded an F1 of. 938. The significant performance improvement costs little overhead as <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\text{ASNM}+\text{LCS}$</tex> averaged only 3.79% additional runtime.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.