‟Deep lexicography” – Fad or Opportunity?

In recent years, we are witnessing staggering improvements in various semantic data processing tasks due to the developments in the area of deep learning, ranging from image and video processing to speech processing, and natural language understanding. In this paper, we discuss the opportunities and...

Full description

Bibliographic Details
Main Author: Nikola Ljubešić
Format: Article
Language:Croatian
Published: Institut za hrvatski jezik i jezikoslovlje 2020-01-01
Series:Rasprave Instituta za Hrvatski Jezik i Jezikoslovlje
Subjects:
Online Access:https://hrcak.srce.hr/file/356606
Description
Summary:In recent years, we are witnessing staggering improvements in various semantic data processing tasks due to the developments in the area of deep learning, ranging from image and video processing to speech processing, and natural language understanding. In this paper, we discuss the opportunities and challenges that these developments pose for the area of electronic lexicography. We primarily focus on the concept of representation learning of the basic elements of language, namely words, and the applicability of these word representations to lexicography. We first discuss well-known approaches to learning static representations of words, the so-called word embeddings, and their usage in lexicography-related tasks such as semantic shift detection, and cross-lingual prediction of lexical features such as concreteness and imageability. We wrap up the paper with the most recent developments in the area of word representation learning in form of learning dynamic, context-aware representations of words, showcasing some dynamic word embedding examples, and discussing improvements on lexicography-relevant tasks of word sense disambiguation and word sense induction.
ISSN:1331-6745
1849-0379