In-context learning learns label relationships but is not conventional learning

The predictions of Large Language Models (LLMs) on downstream tasks often improve significantly when including examples of the input–label relationship in the context. However, there is currently no consensus about how this in-context learning (ICL) ability of LLMs works. For example, while Xie et a...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক: Kossen, J, Gal, Y, Rainforth, T
বিন্যাস: Conference item
ভাষা:English
প্রকাশিত: OpenReview 2024
_version_ 1826317520592699392
author Kossen, J
Gal, Y
Rainforth, T
author_facet Kossen, J
Gal, Y
Rainforth, T
author_sort Kossen, J
collection OXFORD
description The predictions of Large Language Models (LLMs) on downstream tasks often improve significantly when including examples of the input–label relationship in the context. However, there is currently no consensus about how this in-context learning (ICL) ability of LLMs works. For example, while Xie et al. (2022) liken ICL to a general-purpose learning algorithm, Min et al. (2022b) argue ICL does not even learn label relationships from in-context examples. In this paper, we provide novel insights into how ICL leverages label information, revealing both capabilities and limitations. To ensure we obtain a comprehensive picture of ICL behavior, we study probabilistic aspects of ICL predictions and thoroughly examine the dynamics of ICL as more examples are provided. Our experiments show that ICL predictions almost always depend on in-context labels and that ICL can learn truly novel tasks in-context. However, we also find that ICL struggles to fully overcome prediction preferences acquired from pre-training data and, further, that ICL does not consider all in-context information equally.
first_indexed 2025-03-11T16:55:12Z
format Conference item
id oxford-uuid:71d42f2d-d351-4ab4-acc4-b3a1bdf749ff
institution University of Oxford
language English
last_indexed 2025-03-11T16:55:12Z
publishDate 2024
publisher OpenReview
record_format dspace
spelling oxford-uuid:71d42f2d-d351-4ab4-acc4-b3a1bdf749ff2025-02-20T12:21:09ZIn-context learning learns label relationships but is not conventional learningConference itemhttp://purl.org/coar/resource_type/c_5794uuid:71d42f2d-d351-4ab4-acc4-b3a1bdf749ffEnglishSymplectic Elements OpenReview2024Kossen, JGal, YRainforth, TThe predictions of Large Language Models (LLMs) on downstream tasks often improve significantly when including examples of the input–label relationship in the context. However, there is currently no consensus about how this in-context learning (ICL) ability of LLMs works. For example, while Xie et al. (2022) liken ICL to a general-purpose learning algorithm, Min et al. (2022b) argue ICL does not even learn label relationships from in-context examples. In this paper, we provide novel insights into how ICL leverages label information, revealing both capabilities and limitations. To ensure we obtain a comprehensive picture of ICL behavior, we study probabilistic aspects of ICL predictions and thoroughly examine the dynamics of ICL as more examples are provided. Our experiments show that ICL predictions almost always depend on in-context labels and that ICL can learn truly novel tasks in-context. However, we also find that ICL struggles to fully overcome prediction preferences acquired from pre-training data and, further, that ICL does not consider all in-context information equally.
spellingShingle Kossen, J
Gal, Y
Rainforth, T
In-context learning learns label relationships but is not conventional learning
title In-context learning learns label relationships but is not conventional learning
title_full In-context learning learns label relationships but is not conventional learning
title_fullStr In-context learning learns label relationships but is not conventional learning
title_full_unstemmed In-context learning learns label relationships but is not conventional learning
title_short In-context learning learns label relationships but is not conventional learning
title_sort in context learning learns label relationships but is not conventional learning
work_keys_str_mv AT kossenj incontextlearninglearnslabelrelationshipsbutisnotconventionallearning
AT galy incontextlearninglearnslabelrelationshipsbutisnotconventionallearning
AT rainfortht incontextlearninglearnslabelrelationshipsbutisnotconventionallearning