
Dense retrieval systems rank passages by similarity to a query, but multi-hop questions require passages linked through reasoning chains rather than surface resemblance. We introduce Association-Augmented Retrieval (AAR), a lightweight transductive method that trains a small MLP (4.2M parameters) on passage co-occurrence annotations to learn associative relationships in embedding space. On HotpotQA, AAR improves Recall@5 by +8.6 points without evaluation-set tuning, with +28.5 points on hard questions where dense retrieval fails. On MuSiQue, AAR achieves +10.1 points. An inductive variant shows no significant improvement, indicating corpus-specific co-occurrence learning. Ablations confirm that association and similarity produce opposite effects: training on similar but non-associated pairs degrades retrieval, while only genuine co-occurrence structure yields gains. The method trains in two minutes, adds 3.7ms per query, and requires no LLM-based indexing. Code and results are available at https://github.com/EridosAI/AAR.
associative memory, passage association, retrieval-augmented generation, dense passage retrieval, contrastive learning, multi-hop retrieval, embedding space, reranking, transductive learning, multi-hop question answering
associative memory, passage association, retrieval-augmented generation, dense passage retrieval, contrastive learning, multi-hop retrieval, embedding space, reranking, transductive learning, multi-hop question answering
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
