Pointwise V-usable information (PVI) excels in many #NLProc tasks. But fine-tuning #LLMs with it is very time-consuming 🐌 Is in-context PVI the necessary next step? Yes! 🎉Check out our empirical analysis accepted at #EMNLP2023 – and this 🧵 (1/7) 📄 Tweet added by UKP Lab @UKPLab

UKP Lab

7 months

Pointwise V-usable information (PVI) excels in many #NLProc tasks. But fine-tuning #LLMs with it is very time-consuming 🐌 Is in-context PVI the necessary next step? Yes! 🎉Check out our empirical analysis accepted at #EMNLP2023 – and this 🧵 (1/7) 📄

1

10

18

UKP Lab

@UKPLab

7 months

Pointwise V-usable information (PVI) is a recently proposed metric for measuring the hardness of individual instances. It is estimated by fine-tuning supervised models. (2/🧵) #EMNLP2023

1

0

1

UKP Lab

@UKPLab

7 months

In our paper we show that in-context PVI exhibits similar characteristics to the original PVI but is more time-efficient. The reason is that it requires only a few exemplars and does not need fine-tuning. (3/🧵) #EMNLP2023

1

0

1

UKP Lab

@UKPLab

7 months

Our findings show a lower prediction accuracy for low in-context PVI (see in the 🟦 box); and a higher average in-context PVI for correct predictions than incorrect (see in the 🟥 box). This matches what we see in the original PVI estimates. (4/🧵) #EMNLP2023

1

0

1

UKP Lab

@UKPLab

7 months

Major insight : in-context PVI estimates are more consistent across similar models (e.g., models that have similar architecture or similar training data. (5/🧵) #EMNLP2023

1

0

1

UKP Lab

@UKPLab

7 months

We also show that, comparable to the original PVI, the in-context PVI threshold at which instances start being predicted incorrectly is similar across datasets. However, in-context PVI estimates made by smaller models are much noisier than those made by larger models. (6/🧵)

1

0

1

UKP Lab

@UKPLab

7 months

Besides, in-context PVI estimates can be used to identify mislabeled instances. This is a very practical feature and demonstrates the reliability of the in-context PVI. (7/🧵) #EMNLP2023

1

0

1

UKP Lab

@UKPLab

7 months

Consider following our authors Sheng Lu ( @UKPLab ), @shan23chen ( @BrighamResearch ), Yingya Li ( @Bos_CHIP ), @dbittermanmd ( @aim_harvard / @BrighamWomens ), Guergana Savova ( @Bos_CHIP ) and @IGurevych ( @UKPLab ). See you in Singapore 🇸🇬! 🧵 (7/7) #EMNLP2023 📑

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly

In-context learning (ICL) is a new learning paradigm that has gained popularity along with the development of large language models. In this work, we adapt a recently proposed hardness metric,...

arxiv.org

0

2

Replies