OPAL: a passe−partout for web forms

Web forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, a...

Full description

Bibliographic Details
Main Authors: Guo, X, Kranzdorf, J, Furche, T, Grasso, G, Orsi, G, Schallhart, C
Format: Conference item
Published: 2012
_version_ 1797090846205542400
author Guo, X
Kranzdorf, J
Furche, T
Grasso, G
Orsi, G
Schallhart, C
author_facet Guo, X
Kranzdorf, J
Furche, T
Grasso, G
Orsi, G
Schallhart, C
author_sort Guo, X
collection OXFORD
description Web forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, and usability and accessibility support for enhanced web browsing. In this demonstration, we use a novel form understanding approach, OPAL, to assist in form filling even for complex, previously unknown forms. OPAL associates form labels to fields by analyzing structural properties in the HTML encoding and visual features of the page rendering. OPAL interprets this labeling and classifies the fields according to a given domain ontology. The combination of these two properties, allows OPAL to deal effectively with many forms outside of the grasp of existing form filling techniques. In the UK real estate domain, OPAL achieves more than 99 percent accuracy in form understanding.
first_indexed 2024-03-07T03:24:36Z
format Conference item
id oxford-uuid:b89b223f-b2af-4cf7-bb1b-7d9c8bfcad43
institution University of Oxford
last_indexed 2024-03-07T03:24:36Z
publishDate 2012
record_format dspace
spelling oxford-uuid:b89b223f-b2af-4cf7-bb1b-7d9c8bfcad432022-03-27T04:57:02ZOPAL: a passe−partout for web formsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:b89b223f-b2af-4cf7-bb1b-7d9c8bfcad43Department of Computer Science2012Guo, XKranzdorf, JFurche, TGrasso, GOrsi, GSchallhart, CWeb forms are the interfaces of the deep web. Though modern web browsers provide facilities to assist in form filling, this assistance is limited to prior form fillings or keyword matching. Automatic form understanding enables a broad range of applications, including crawlers, meta-search engines, and usability and accessibility support for enhanced web browsing. In this demonstration, we use a novel form understanding approach, OPAL, to assist in form filling even for complex, previously unknown forms. OPAL associates form labels to fields by analyzing structural properties in the HTML encoding and visual features of the page rendering. OPAL interprets this labeling and classifies the fields according to a given domain ontology. The combination of these two properties, allows OPAL to deal effectively with many forms outside of the grasp of existing form filling techniques. In the UK real estate domain, OPAL achieves more than 99 percent accuracy in form understanding.
spellingShingle Guo, X
Kranzdorf, J
Furche, T
Grasso, G
Orsi, G
Schallhart, C
OPAL: a passe−partout for web forms
title OPAL: a passe−partout for web forms
title_full OPAL: a passe−partout for web forms
title_fullStr OPAL: a passe−partout for web forms
title_full_unstemmed OPAL: a passe−partout for web forms
title_short OPAL: a passe−partout for web forms
title_sort opal a passe partout for web forms
work_keys_str_mv AT guox opalapassepartoutforwebforms
AT kranzdorfj opalapassepartoutforwebforms
AT furchet opalapassepartoutforwebforms
AT grassog opalapassepartoutforwebforms
AT orsig opalapassepartoutforwebforms
AT schallhartc opalapassepartoutforwebforms