Information Extraction From Chemical Patents

The development of new chemicals or pharmaceuticals is preceded by an indepth analysis of published patents in this field. This information retrieval is a costly and time inefficient step when done by a human reader, yet it is mandatory for potential success of an investment. The goal of the resear...

Full description

Bibliographic Details
Main Authors: Sandra Bergmann, Mathilde Romberg
Format: Article
Language:English
Published: AGH University of Science and Technology Press 2012-01-01
Series:Computer Science
Online Access:http://journals.agh.edu.pl/csci/article/download/28/21
Description
Summary:The development of new chemicals or pharmaceuticals is preceded by an indepth analysis of published patents in this field. This information retrieval is a costly and time inefficient step when done by a human reader, yet it is mandatory for potential success of an investment. The goal of the research project UIMA-HPC is to automate and hence speed-up the process of knowledge mining about patents. Multi-threaded analysis engines, developed according to UIMA (Unstructured Information Management Architecture) standards, process texts and images in thousands of documents in parallel. UNICORE (UNiform Interface to COmputing Resources) workflow control structures make it possible to dynamically allocate resources for every given task to gain best cpu-time/realtime ratios in an HPC environment.
ISSN:1508-2806