Functionality learning through specification instructions

Name: Functionality learning through specification instructions
Keywords: FOS: Computer and information sciences, Computer Science - Computation and Language, 102019 Machine Learning, 102019 Machine learning, 602011 Computerlinguistik, Computation and Language (cs.CL), 602011 Computational linguistics

Luz de Araujo, Pedro Henrique; Roth, Benjamin

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

https://doi.org/10.18653/v1/20...

Article . 2024 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2023

License: CC BY

Data sources: Datacite

u:cris

Conference object . 2024

License: CC BY

Data sources: u:cris

Functionality learning through specification instructions

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2024Embargo end date: 01 Jan 2023Publisher:Association for Computational Linguistics (ACL)Journal:Findings of the Association for Computational Linguistics: EMNLP 2024

Authors: Luz de Araujo, Pedro Henrique; Roth, Benjamin;

doi: 10.18653/v1/2024.findings-emnlp.642 , 10.48550/arxiv.2311.08481

arXiv: 2311.08481

Functionality learning through specification instructions

- Summary
- Subjects
- Metrics

Abstract

Test suites assess natural language processing models' performance on specific functionalities: cases of interest involving model robustness, fairness, or particular linguistic capabilities. This paper introduces specification instructions: text descriptions specifying fine-grained task-specific behaviors. For each functionality in a suite, we generate an instruction that describes it. We combine the specification instructions to create specification-augmented prompts, which we feed to language models pre-trained on natural instruction data. We conduct experiments to measure how optimizing for some functionalities may negatively impact functionalities that are not covered by the specification set. Our analyses across four tasks and models of diverse sizes and families show that smaller models struggle to follow specification instructions. However, larger models (>~3B params.) can benefit from specifications and -- surprisingly -- even generalize certain desirable behaviors across functionalities.

36 pages, 8 figures. Accepted at EMNLP 2024 Findings

Related Organizations

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, 102019 Machine Learning, 102019 Machine learning, 602011 Computerlinguistik, Computation and Language (cs.CL), 602011 Computational linguistics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green