Automatically learning gazetteers from the deep web.
Wrapper induction faces a dilemma: To reach web scale, it requires automatically generated examples, but to produce accurate results, these examples must have the quality of human annotations. We resolve this conflict with AMBER, a system for fully automated data extraction from result pages. In con...
Autors principals: | , , , , |
---|---|
Altres autors: | |
Format: | Journal article |
Idioma: | English |
Publicat: |
ACM
2012
|