Experiments with a hierarchical text categorizer

D. Tikk, György Biró, Jae Dong Yang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

HITEC is a hierarchical text categorizer tool that is based on UFEX (Universal Feature Extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora. The obtained results shows that HITEC outperforms its known competitors on the investigated corpora, and its performance is independent from the processed languages. The time and storage requirement of HITEC is considerable, therefore it can be run on an average PC.

Original languageEnglish
Title of host publicationIEEE International Conference on Fuzzy Systems
Pages1191-1196
Number of pages6
Volume2
DOIs
Publication statusPublished - 2004
Event2004 IEEE International Conference on Fuzzy Systems - Proceedings - Budapest, Hungary
Duration: Jul 25 2004Jul 29 2004

Other

Other2004 IEEE International Conference on Fuzzy Systems - Proceedings
CountryHungary
CityBudapest
Period7/25/047/29/04

Fingerprint

Experiments

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality
  • Chemical Health and Safety

Cite this

Tikk, D., Biró, G., & Yang, J. D. (2004). Experiments with a hierarchical text categorizer. In IEEE International Conference on Fuzzy Systems (Vol. 2, pp. 1191-1196) https://doi.org/10.1109/FUZZY.2004.1375582

Experiments with a hierarchical text categorizer. / Tikk, D.; Biró, György; Yang, Jae Dong.

IEEE International Conference on Fuzzy Systems. Vol. 2 2004. p. 1191-1196.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tikk, D, Biró, G & Yang, JD 2004, Experiments with a hierarchical text categorizer. in IEEE International Conference on Fuzzy Systems. vol. 2, pp. 1191-1196, 2004 IEEE International Conference on Fuzzy Systems - Proceedings, Budapest, Hungary, 7/25/04. https://doi.org/10.1109/FUZZY.2004.1375582
Tikk D, Biró G, Yang JD. Experiments with a hierarchical text categorizer. In IEEE International Conference on Fuzzy Systems. Vol. 2. 2004. p. 1191-1196 https://doi.org/10.1109/FUZZY.2004.1375582
Tikk, D. ; Biró, György ; Yang, Jae Dong. / Experiments with a hierarchical text categorizer. IEEE International Conference on Fuzzy Systems. Vol. 2 2004. pp. 1191-1196
@inproceedings{75eeb7bc395440579a9de8b51239e9d5,
title = "Experiments with a hierarchical text categorizer",
abstract = "HITEC is a hierarchical text categorizer tool that is based on UFEX (Universal Feature Extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora. The obtained results shows that HITEC outperforms its known competitors on the investigated corpora, and its performance is independent from the processed languages. The time and storage requirement of HITEC is considerable, therefore it can be run on an average PC.",
author = "D. Tikk and Gy{\"o}rgy Bir{\'o} and Yang, {Jae Dong}",
year = "2004",
doi = "10.1109/FUZZY.2004.1375582",
language = "English",
isbn = "0780383532",
volume = "2",
pages = "1191--1196",
booktitle = "IEEE International Conference on Fuzzy Systems",

}

TY - GEN

T1 - Experiments with a hierarchical text categorizer

AU - Tikk, D.

AU - Biró, György

AU - Yang, Jae Dong

PY - 2004

Y1 - 2004

N2 - HITEC is a hierarchical text categorizer tool that is based on UFEX (Universal Feature Extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora. The obtained results shows that HITEC outperforms its known competitors on the investigated corpora, and its performance is independent from the processed languages. The time and storage requirement of HITEC is considerable, therefore it can be run on an average PC.

AB - HITEC is a hierarchical text categorizer tool that is based on UFEX (Universal Feature Extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora. The obtained results shows that HITEC outperforms its known competitors on the investigated corpora, and its performance is independent from the processed languages. The time and storage requirement of HITEC is considerable, therefore it can be run on an average PC.

UR - http://www.scopus.com/inward/record.url?scp=11144295767&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=11144295767&partnerID=8YFLogxK

U2 - 10.1109/FUZZY.2004.1375582

DO - 10.1109/FUZZY.2004.1375582

M3 - Conference contribution

AN - SCOPUS:11144295767

SN - 0780383532

VL - 2

SP - 1191

EP - 1196

BT - IEEE International Conference on Fuzzy Systems

ER -