Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding

Most of existing studies on parsing natural language (NL) for constructing structured query language (SQL) do not consider the complex structure of database schema and the gap between NL and SQL query. In this paper, we propose a schema-aware neural network with decomposing architecture, namely HSRN...

Full description

Bibliographic Details
Main Authors: Meina Song, Zecheng Zhan, Haihong E.
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8777144/
_version_ 1819133513622880256
author Meina Song
Zecheng Zhan
Haihong E.
author_facet Meina Song
Zecheng Zhan
Haihong E.
author_sort Meina Song
collection DOAJ
description Most of existing studies on parsing natural language (NL) for constructing structured query language (SQL) do not consider the complex structure of database schema and the gap between NL and SQL query. In this paper, we propose a schema-aware neural network with decomposing architecture, namely HSRNet, which aims to address the complex and cross-domain Text-to-SQL generation task. The HSRNet models the relationship of the database schema with a hierarchical schema graph and employs a graph network to encode the information into sentence representation. Instead of end-to-end generation, the HSRNet decomposes the generation process into three phases. Given an input question and schema, we first choose the column candidates and generate the sketch grammar of the SQL query. Then, a detail completion module fills the details based on the column candidates and the corresponding sketch. We demonstrate the effectiveness of our hierarchical schema representation by incorporating the information into different baselines. We further show that the decomposing architecture significantly improves the performance of our model. Evaluation of Spider benchmark shows that the hierarchical schema representation and decomposing architecture improves our parser result by 14.5% and 4.3% respectively.
first_indexed 2024-12-22T09:48:30Z
format Article
id doaj.art-351a7c83480b4810a3870e1e06b8d063
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-22T09:48:30Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-351a7c83480b4810a3870e1e06b8d0632022-12-21T18:30:28ZengIEEEIEEE Access2169-35362019-01-01710370610371510.1109/ACCESS.2019.29314648777144Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing DecodingMeina Song0Zecheng Zhan1https://orcid.org/0000-0003-0685-7825Haihong E.2School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Computer Science, Beijing University of Posts and Telecommunications, Beijing, ChinaSch. of Comput. Sci., Beijing Univ. of Posts & Telecommun., Beijing, ChinaMost of existing studies on parsing natural language (NL) for constructing structured query language (SQL) do not consider the complex structure of database schema and the gap between NL and SQL query. In this paper, we propose a schema-aware neural network with decomposing architecture, namely HSRNet, which aims to address the complex and cross-domain Text-to-SQL generation task. The HSRNet models the relationship of the database schema with a hierarchical schema graph and employs a graph network to encode the information into sentence representation. Instead of end-to-end generation, the HSRNet decomposes the generation process into three phases. Given an input question and schema, we first choose the column candidates and generate the sketch grammar of the SQL query. Then, a detail completion module fills the details based on the column candidates and the corresponding sketch. We demonstrate the effectiveness of our hierarchical schema representation by incorporating the information into different baselines. We further show that the decomposing architecture significantly improves the performance of our model. Evaluation of Spider benchmark shows that the hierarchical schema representation and decomposing architecture improves our parser result by 14.5% and 4.3% respectively.https://ieeexplore.ieee.org/document/8777144/Semantic parsingSQL generationdeep learningneural networkgraph encodernatural language process
spellingShingle Meina Song
Zecheng Zhan
Haihong E.
Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
IEEE Access
Semantic parsing
SQL generation
deep learning
neural network
graph encoder
natural language process
title Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
title_full Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
title_fullStr Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
title_full_unstemmed Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
title_short Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
title_sort hierarchical schema representation for text to sql parsing with decomposing decoding
topic Semantic parsing
SQL generation
deep learning
neural network
graph encoder
natural language process
url https://ieeexplore.ieee.org/document/8777144/
work_keys_str_mv AT meinasong hierarchicalschemarepresentationfortexttosqlparsingwithdecomposingdecoding
AT zechengzhan hierarchicalschemarepresentationfortexttosqlparsingwithdecomposingdecoding
AT haihonge hierarchicalschemarepresentationfortexttosqlparsingwithdecomposingdecoding