Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge

Name: Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Keywords: FOS: Computer and information sciences, Training data, Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Recent researches, Computational linguistics, Parameter modification, Language model, Artificial Intelligence (cs.AI), Factual knowledge

Tamayo Mela, Daniel; Gonzalez Aguirre, Aitor; Hernando Pericás, Francisco Javier; Villegas, Marta

Found an issue? Give us feedback

downloadFull-Text

UPCommons. Portal de...arrow_drop_down

UPCommons. Portal del coneixement obert de la UPC

Conference object . 2024 . Peer-reviewed

License: CC BY

Full-Text: https://upcommons.upc.edu/bitstreams/71acf343-bf51-4eaa-931c-8122d28d6c05/download

Data sources: UPCommons. Portal del coneixement obert de la UPC

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

Recolector de Ciencia Abierta, RECOLECTA

Conference object . 2024 . Peer-reviewed

License: CC BY

Data sources: Recolector de Ciencia Abierta, RECOLECTA

https://doi.org/10.18653/v1/20...

Article . 2024 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY

Data sources: Datacite

ZENODO

Conference object . 2024

License: CC BY

Data sources: Datacite

ZENODO

Conference object . 2024

License: CC BY

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

DBLP

Article

Data sources: DBLP

Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Preprint 01 Jan 2024Embargo end date: 01 Jan 2025 Spain Publisher:Association for Computational Linguistics (ACL)Journal:Findings of the Association for Computational Linguistics ACL 2024

Authors: Tamayo Mela, Daniel; Gonzalez Aguirre, Aitor; Hernando Pericás, Francisco Javier; Villegas, Marta;

doi: 10.18653/v1/2024.findings-acl.347 , 10.48550/arxiv.2502.02173 , 10.5281/zenodo.14548933 , 10.5281/zenodo.14548934

arXiv: 2502.02173

handle: 2117/423063

Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Recent research has explored methods for updating and modifying factual knowledge in large language models, often focusing on specific multi-layer perceptron blocks. This study expands on this work by examining the effectiveness of existing knowledge editing methods across languages and delving into the role of attention mechanisms in this process. Drawing from the insights gained, we propose Mass-Editing Memory with Attention in Transformers (MEMAT), a method that achieves significant improvements in all metrics while requiring minimal parameter modifications. MEMAT delivers a remarkable 10% increase in magnitude metrics, benefits languages not included in the training data and also demonstrates a high degree of portability. Our code and data are at https://github.com/dtamayo-nlp/MEMAT.

Country

Spain

Related Organizations

Keywords

FOS: Computer and information sciences, Training data, Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Recent researches, Computational linguistics, Parameter modification, Language model, Artificial Intelligence (cs.AI), Factual knowledge, Multilayers perceptrons, Distribution transformers, Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural, Computation and Language (cs.CL), Attention mechanisms, Cross-lingual

1 Research products, page 1 of 1

MEMAT software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average