Dow Using "Text Mining" Technology To Discover Knowledge

23-Aug-2001

The Dow Chemical Company Research and Development function is using cutting edge technical capabilities in its information technology laboratory to locate data from hundreds of thousands of documents on the Internet and from other sources as well. The technology is known as "text mining". The goal of the research is to discover knowledge and patterns of information that are non-recognizable, and non-retrievable using traditional data base management or search engine tools. Dow first began exploring opportunities in text mining during 1996. Dow is now working with ClearForest Technology in New York City for their latest text mining capability.

Text mining allows Dow to more fully explore complex relationships among the contents of document in textual databases. It also provides a visual interface for documentation. "The analogy we like to use is panning for gold," said Randy Collard of Dow R&D. "If you think of the Internet as a stream, you do not need or want everything that is in that stream, just the gold nuggets. Text mining allows you to find those nuggets of information very effectively. Plus it makes it possible to discover relationships that are not obvious."

Finding The Unexpected

One advantage of text mining is finding both expected and unexpected or hidden relationships. Using this capability Dow can search for new customers, technologies, business partners or marketing trends being revealed in ways previously unavailable.

How Text Mining Works

Text mining should not be confused with traditional Internet search engine tools or database management capabilities. Text mining occurs after a traditional search for documents is completed, in whatever format is used whether full text, abstracts, or indexed terms. Text mining allows for exploration of complex relationships among documents. There is a visual interface, so researchers can actually see what and where significant patterns exist.

ClearForest Technology

ClearForest Technology is a current tool of choice for Dow. While other systems offer different capabilities, there is no single tool available yet that can accommodate the needs of a company the size of Dow. An organized database is a prerequisite for information management tools to be used to their utmost. The best text mining tools are the ones that can extract previously unknown knowledge from unorganized servers. Dow is on the cutting edge in the use of this emerging field.

Other news from the department science

Most read news

More news from our other portals

Discover the latest developments in battery technology!