Witten et al. 2005 (†524)Witten, I.H., K. J. Don, M. Dewsnip, and V. Tablan. “Text mining in a digital library.” Journal of Digital Libraries 4:1 (August 2005).
- text mining (p. 10): Text mining is a burgeoning new field that attempts to glean meaningful information from natural-language text. It may be loosely characterized as the process of analyzing text to extract information that is useful for particular purposes. It most commonly targets text whose function is communication of factual information or opinions, and the motivation for trying to extract information from such text automatically is compelling – even if success is only partial. "Text mining" (sometimes called "text data mining") defies tight definitions but encompasses a wide range of activities: text summarization, document retrieval; document clustering; text categorization; language identification; authorship ascription; identifying phrases, phrase structures, and key phrases; extracting "entities" such as names, dates, and abbreviations; locating acronyms and their definitions; filling predefined templates with extracted information; and even learning rules from such templates. (†833)