Using web corpus statistics for program analysis

Chun-Hung Hsiao,Michael Cafarella,Satish Narayanasamy

doi:10.1145/2660193.2660226

Using web corpus statistics for program analysis

Chun-Hung Hsiao, Michael Cafarella + Show 1 more

Open Access

https://doi.org/10.1145/2660193.2660226

Copy DOI

Publication Date: Oct 15, 2014

Citations: 63

Affiliation: University of Michigan–Ann Arbor

#Program Analysis Tools #Bug Finding + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Several program analysis tools - such as plagiarism detection and bug finding - rely on knowing a piece of code's relative semantic importance. For example, a plagiarism detector should not bother reporting two programs that have an identical simple loop counter test, but should report programs that share more distinctive code. Traditional program analysis techniques (e.g., finding data and control dependencies) are useful, but do not say how surprising or common a line of code is. Natural language processing researchers have encountered a similar problem and addressed it using an n-gram model of text frequency, derived from statistics computed over text corpora.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.