A document classification algorithm using the fuzzy set theory and hierarchical structure of document

Seok Woo Han, Hye Jue Eun, Yong Sung Kim, László T. Kóczy

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

In present, Information retrieval systems which are simply expressed with combination between keywords and phrase search according to the direct keyword matching method to get the information which users need. But Web documents retrieval systems serve too many documents because of term ambiguity. Also it often happens that words with several meanings occur in a document, but in a rather different context from that expected by the querying person. So the user should need extra time and effort to get more close documents. To overcome these problems, in this paper we propose an information retrieval system based on the content, which connects documents according to the degree of semantic link which it express fuzzy value by fuzzy function. Also we propose an algorithm which it produce the hierarchical structure using the degree of concepts and contents among documents. As result, we are able to select and to provide user-interested documents.

Original languageEnglish
Pages (from-to)122-133
Number of pages12
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3043
Publication statusPublished - Dec 1 2004

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'A document classification algorithm using the fuzzy set theory and hierarchical structure of document'. Together they form a unique fingerprint.

  • Cite this