Automatically learning gazetteers from the deep web.

Wrapper induction faces a dilemma: To reach web scale, it requires automatically generated examples, but to produce accurate results, these examples must have the quality of human annotations. We resolve this conflict with AMBER, a system for fully automated data extraction from result pages. In con...

תיאור מלא

מידע ביבליוגרפי
Main Authors: Furche, T, Grasso, G, Orsi, G, Schallhart, C, Wang, C
מחברים אחרים: Mille, A
פורמט: Journal article
שפה:English
יצא לאור: ACM 2012