A manually annotated corpus of pharmaceutical patents

Márton Kiss, Ágoston Nagy, Veronika Vincze, Attila Almási, Zoltán Alexin, János Csirik

Research output: Conference contribution

2 Citations (Scopus)

Abstract

The language of patent claims differs from ordinary language to a great extent, which results in the fact that tools especially adapted to patent language are needed in patent processing. In order to evaluate these tools, manually annotated patent corpora are necessary. Thus, we constructed a corpus of English language pharmaceutical patents belonging to the class A61K, on which several layers of manual annotation (such as named entities, keys, NucleusNPs, quantitative expressions, heads and complements, perdurants) were carried out and on which tools for patent processing can be evaluated.

Original languageEnglish
Title of host publicationText, Speech and Dialogue - 15th International Conference, TSD 2012, Proceedings
Pages135-142
Number of pages8
DOIs
Publication statusPublished - szept. 3 2012
Event15th International Conference on Text, Speech and Dialogue, TSD 2012 - Brno, Czech Republic
Duration: szept. 3 2012szept. 7 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7499 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other15th International Conference on Text, Speech and Dialogue, TSD 2012
CountryCzech Republic
CityBrno
Period9/3/129/7/12

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'A manually annotated corpus of pharmaceutical patents'. Together they form a unique fingerprint.

  • Cite this

    Kiss, M., Nagy, Á., Vincze, V., Almási, A., Alexin, Z., & Csirik, J. (2012). A manually annotated corpus of pharmaceutical patents. In Text, Speech and Dialogue - 15th International Conference, TSD 2012, Proceedings (pp. 135-142). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7499 LNAI). https://doi.org/10.1007/978-3-642-32790-2_16