Typology emerges from simplicity in representations and learning

We derive well-understood and well-studied subregular classes of formal languages purely from the computational perspective of algorithmic learning problems. We parameterise the learning problem along dimensions of representation and inference strategy. Of special interest are those classes of langu...

Full description

Bibliographic Details
Main Authors:	Dakotah Jay Lambert, Jonathan Rawski, Jeffrey Heinz
Format:	Article
Language:	English
Published:	Institute of Computer Science, Polish Academy of Sciences 2021-08-01
Series:	Journal of Language Modelling
Subjects:	model theory subregularity grammatical inference formal language theory phonology learning complexity
Online Access:	https://jlm.ipipan.waw.pl/index.php/JLM/article/view/262

_version_	1818283243741380608
author	Dakotah Jay Lambert Jonathan Rawski Jeffrey Heinz
author_facet	Dakotah Jay Lambert Jonathan Rawski Jeffrey Heinz
author_sort	Dakotah Jay Lambert
collection	DOAJ
description	We derive well-understood and well-studied subregular classes of formal languages purely from the computational perspective of algorithmic learning problems. We parameterise the learning problem along dimensions of representation and inference strategy. Of special interest are those classes of languages whose learning algorithms are necessarily not prohibitively expensive in space and time, since learners are often exposed to adverse conditions and sparse data. Learned natural language patterns are expected to be most like the patterns in these classes, an expectation supported by previous typological and linguistic research in phonology. A second result is that the learning algorithms presented here are completely agnostic to choice of linguistic representation. In the case of the subregular classes, the results fall out from traditional model-theoretic treatments of words and strings. The same learning algorithms, however, can be applied to model-theoretic treatments of other linguistic representations such as syntactic trees or autosegmental graphs, which opens a useful direction for future research.
first_indexed	2024-12-13T00:33:49Z
format	Article
id	doaj.art-ea011a024c4b484d89a2ebfcce81a572
institution	Directory Open Access Journal
issn	2299-856X 2299-8470
language	English
last_indexed	2024-12-13T00:33:49Z
publishDate	2021-08-01
publisher	Institute of Computer Science, Polish Academy of Sciences
record_format	Article
series	Journal of Language Modelling
spelling	doaj.art-ea011a024c4b484d89a2ebfcce81a5722022-12-22T00:05:15ZengInstitute of Computer Science, Polish Academy of SciencesJournal of Language Modelling2299-856X2299-84702021-08-0191Typology emerges from simplicity in representations and learningDakotah Jay Lambert0Jonathan Rawski1Jeffrey Heinz2Department of Linguistics Institute for Advanced Computational Science Stony Brook UniversityDepartment of Linguistics, San José State UniversityDepartment of Linguistics Institute for Advanced Computational Science Stony Brook UniversityWe derive well-understood and well-studied subregular classes of formal languages purely from the computational perspective of algorithmic learning problems. We parameterise the learning problem along dimensions of representation and inference strategy. Of special interest are those classes of languages whose learning algorithms are necessarily not prohibitively expensive in space and time, since learners are often exposed to adverse conditions and sparse data. Learned natural language patterns are expected to be most like the patterns in these classes, an expectation supported by previous typological and linguistic research in phonology. A second result is that the learning algorithms presented here are completely agnostic to choice of linguistic representation. In the case of the subregular classes, the results fall out from traditional model-theoretic treatments of words and strings. The same learning algorithms, however, can be applied to model-theoretic treatments of other linguistic representations such as syntactic trees or autosegmental graphs, which opens a useful direction for future research.https://jlm.ipipan.waw.pl/index.php/JLM/article/view/262model theorysubregularitygrammatical inferenceformal language theoryphonologylearning complexity
spellingShingle	Dakotah Jay Lambert Jonathan Rawski Jeffrey Heinz Typology emerges from simplicity in representations and learning Journal of Language Modelling model theory subregularity grammatical inference formal language theory phonology learning complexity
title	Typology emerges from simplicity in representations and learning
title_full	Typology emerges from simplicity in representations and learning
title_fullStr	Typology emerges from simplicity in representations and learning
title_full_unstemmed	Typology emerges from simplicity in representations and learning
title_short	Typology emerges from simplicity in representations and learning
title_sort	typology emerges from simplicity in representations and learning
topic	model theory subregularity grammatical inference formal language theory phonology learning complexity
url	https://jlm.ipipan.waw.pl/index.php/JLM/article/view/262
work_keys_str_mv	AT dakotahjaylambert typologyemergesfromsimplicityinrepresentationsandlearning AT jonathanrawski typologyemergesfromsimplicityinrepresentationsandlearning AT jeffreyheinz typologyemergesfromsimplicityinrepresentationsandlearning

Typology emerges from simplicity in representations and learning

Similar Items