Past events ICDM´2001


IBaI Institute | Home | Past events | ICDM´2001 | Abstracts

Program past events


1st Industrial Conference on Data Mining ICDM´2001
24. 07. - 25. 07. 2001 Leipzig/Germany

Technology of Text Mining

Prof. Ari Visa, University of Tampere/Finnland

A large amount of information is stored in databases, in intranets or in Internet. This information is organised in documents or in text documents. The difference depends on the fact if pictures, tables, figures, and formulas are included or not. The common problem is to find the desired piece of information from these sources. The problem is not a new one. Traditionally the problem has been considered under the title of information retrieval, this means the science how to find a book in the library. Traditionally the problem has been solved either by classifying and accessing documents by Dewey Decimal Classification system or by giving a number of characteristic keywords. The problem is that nowadays there are lots of unclassified documents in company databases and in intranet or in Internet.

First one should define some terms. Text filtering means an information seeking process in which documents are selected from a dynamic text stream. Text mining is a process of analysing text to extract information from it for particular purposes. Text categorisation means the process of clustering similar documents from a large document set. All these terms have a certain degree of overlapping.

Text mining, also know as document information mining, text data mining, or knowledge discovery in textual databases is an merging technology for analysing large collections of unstructured documents for the purposes of extracting interesting and non-trivial patterns or knowledge. Typical subproblems that have been solved are language identification, feature selection/extraction, clustering, natural language processing, summarisation, categorisation, search, indexing, and visualisation. These subproblems are discussed in detail and the most common approaches are given.

Finally some examples of current uses of text mining are given and some potential application areas are mentioned.

Print

 

mission | ICDM´2009 | past events | publications | tutorial days | contact

Advances in Data Mining

Special Issue
Appeared: 2006

Advances in Data Mining

IBaI Publishing
ISSN: 1865-6781

Advances in Data Mining

Springer Verlag
ISBN: 978-3540707172

Data Mining and Multimedia Data

Springer Verlag
ISBN: 3-540-00317-7

DecisionMaster

DecisionMaster

Logfile-Analysetool NetLog

Logfile-Analysetool NetLog