OPAL: a passe−partout for web forms
Web forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, a...
Main Authors: | , , , , , |
---|---|
Format: | Conference item |
Published: |
2012
|
_version_ | 1797090846205542400 |
---|---|
author | Guo, X Kranzdorf, J Furche, T Grasso, G Orsi, G Schallhart, C |
author_facet | Guo, X Kranzdorf, J Furche, T Grasso, G Orsi, G Schallhart, C |
author_sort | Guo, X |
collection | OXFORD |
description | Web forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, and usability and accessibility support for enhanced web browsing. In this demonstration, we use a novel form understanding approach, OPAL, to assist in form filling even for complex, previously unknown forms. OPAL associates form labels to fields by analyzing structural properties in the HTML encoding and visual features of the page rendering. OPAL interprets this labeling and classifies the fields according to a given domain ontology. The combination of these two properties, allows OPAL to deal effectively with many forms outside of the grasp of existing form filling techniques. In the UK real estate domain, OPAL achieves more than 99 percent accuracy in form understanding. |
first_indexed | 2024-03-07T03:24:36Z |
format | Conference item |
id | oxford-uuid:b89b223f-b2af-4cf7-bb1b-7d9c8bfcad43 |
institution | University of Oxford |
last_indexed | 2024-03-07T03:24:36Z |
publishDate | 2012 |
record_format | dspace |
spelling | oxford-uuid:b89b223f-b2af-4cf7-bb1b-7d9c8bfcad432022-03-27T04:57:02ZOPAL: a passe−partout for web formsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:b89b223f-b2af-4cf7-bb1b-7d9c8bfcad43Department of Computer Science2012Guo, XKranzdorf, JFurche, TGrasso, GOrsi, GSchallhart, CWeb forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, and usability and accessibility support for enhanced web browsing. In this demonstration, we use a novel form understanding approach, OPAL, to assist in form filling even for complex, previously unknown forms. OPAL associates form labels to fields by analyzing structural properties in the HTML encoding and visual features of the page rendering. OPAL interprets this labeling and classifies the fields according to a given domain ontology. The combination of these two properties, allows OPAL to deal effectively with many forms outside of the grasp of existing form filling techniques. In the UK real estate domain, OPAL achieves more than 99 percent accuracy in form understanding. |
spellingShingle | Guo, X Kranzdorf, J Furche, T Grasso, G Orsi, G Schallhart, C OPAL: a passe−partout for web forms |
title | OPAL: a passe−partout for web forms |
title_full | OPAL: a passe−partout for web forms |
title_fullStr | OPAL: a passe−partout for web forms |
title_full_unstemmed | OPAL: a passe−partout for web forms |
title_short | OPAL: a passe−partout for web forms |
title_sort | opal a passe partout for web forms |
work_keys_str_mv | AT guox opalapassepartoutforwebforms AT kranzdorfj opalapassepartoutforwebforms AT furchet opalapassepartoutforwebforms AT grassog opalapassepartoutforwebforms AT orsig opalapassepartoutforwebforms AT schallhartc opalapassepartoutforwebforms |