Volume : 1, Issue : 3, August - 2012

A Text Mining Framework: Promises And Challenges

Sangeeta Mahesh Borde, Bareen Kayyum Shaikh

Abstract :

Text mining is a burgeoning new field that attempts to glean meaningful information from natural language text. It may be loosely characterized as the process of analyzing text to extract information that is useful for particular purposes. Compared with the kind of data stored in databases, text is unstructured, amorphous, and difficult to deal with algorithmically. Nevertheless, in modern culture, text is the most common vehicle for the formal exchange of information. The field of text mining usually deals with texts whose function is the communication of factual information or opinions, and the motivation for trying to extract information from such text automatically is compelling—even if success is only partial. Text mining, also known as knowledge discovery from text, and document information mining, refers to the process of extracting interesting patterns from very large text corpus for the purposes of discovering knowledge. Text mining is an interdisciplinary field involving information retrieval, text understanding, information extraction, clustering, categorization, visualization, database technology, machine learning, and data mining. Regarded by many as the next wave of knowledge discovery, text mining has a very high commercial value. This talk presents a general framework for text mining, consisting of two stages: text refining that transforms unstructured text documents into an intermediate form; and knowledge distillation that deduces patterns or knowledge from the intermediate form. In conclusion, we highlight the upcoming challenges of text mining and the opportunities it offers

Keywords :


Cite This Article:

Sangeeta Mahesh Borde, Bareen Kayyum Shaikh A Text Mining Framework: Promises And Challenges Global Journal For Research Analysis, Vol: 1, Issue: 3 August 2012


Article No. : 1


Number of Downloads : 1


References :