Experiments with a hierarchical text categorizer

Domonkos Tikk, György Biró, Jae Dong Yang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

HITEC is a hierarchical text categorizer tool that is based on UFEX (Universal Feature Extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora. The obtained results shows that HITEC outperforms its known competitors on the investigated corpora, and its performance is independent from the processed languages. The time and storage requirement of HITEC is considerable, therefore it can be run on an average PC.

Original languageEnglish
Title of host publication2004 IEEE International Conference on Fuzzy Systems - Proceedings
Pages1191-1196
Number of pages6
DOIs
Publication statusPublished - Dec 1 2004
Event2004 IEEE International Conference on Fuzzy Systems - Proceedings - Budapest, Hungary
Duration: Jul 25 2004Jul 29 2004

Publication series

NameIEEE International Conference on Fuzzy Systems
Volume2
ISSN (Print)1098-7584

Other

Other2004 IEEE International Conference on Fuzzy Systems - Proceedings
CountryHungary
CityBudapest
Period7/25/047/29/04

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Artificial Intelligence
  • Applied Mathematics

Fingerprint Dive into the research topics of 'Experiments with a hierarchical text categorizer'. Together they form a unique fingerprint.

  • Cite this

    Tikk, D., Biró, G., & Yang, J. D. (2004). Experiments with a hierarchical text categorizer. In 2004 IEEE International Conference on Fuzzy Systems - Proceedings (pp. 1191-1196). (IEEE International Conference on Fuzzy Systems; Vol. 2). https://doi.org/10.1109/FUZZY.2004.1375582