Abstract

AbstractAt CLEF 2009 the University of Hildesheim submitted experiments for the new Intellectual Property Track. We focused on the main task of this track that aims at finding prior art for a specified patent. Our experiments were split up into one official German run as well as different additional runs using English and German terms. The submitted run was based on a simple baseline approach including stopword elimination, stemming and simple term queries. Furthermore, we investigated the significance of the International Patent Classification (IPC). During the experiments, different parts of a patent were used to construct the queries. In a first stage, only title and claims were included. In contrast, for the post runs we generated a more complex boolean query, which combined terms of the title, claims, description and the IPC classes. The results made clear that using the IPC codes can particularly increase the recall of a patent retrieval system.KeywordsMean Average PrecisionTest CollectionPatent DocumentPatent NumberBoolean QueryThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.