The Szeged Treebank

Dóra Csendes, János Csirik, Tibor Gyimóthy, András Kocsor

Research output: Conference contribution

30 Citations (Scopus)

Abstract

The major aim of the Szeged Treebank project was to create a high-quality database of syntactic structures for Hungarian that can serve as a golden standard to further research in linguistics and computational language processing. The treebank currently contains full syntactic parsing of about 82,000 sentences, which is the result of accurate manual annotation. Current paper describes the linguistic theory as well as the actual method used in the annotation process. In addition, the application of the treebank for the training of automated syntactic parsers is also presented.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages123-131
Number of pages9
Publication statusPublished - dec. 1 2005
Event8th International Conference on Text, Speech and Dialogue, TSD 2005 - Karlovy Vary, Czech Republic
Duration: szept. 12 2005szept. 15 2005

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3658 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other8th International Conference on Text, Speech and Dialogue, TSD 2005
CountryCzech Republic
CityKarlovy Vary
Period9/12/059/15/05

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'The Szeged Treebank'. Together they form a unique fingerprint.

  • Cite this

    Csendes, D., Csirik, J., Gyimóthy, T., & Kocsor, A. (2005). The Szeged Treebank. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 123-131). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3658 LNAI).