Where dual-numbers forward-mode automatic differentiation (AD) pairs each scalar value with its tangent derivative, dual-numbers /reverse-mode/ AD attempts to achieve reverse AD using a similarly simple idea: by pairing each scalar value with a backpropagator function. Its correctness and efficiency on higher-order input languages have been analysed by Brunel, Mazza and Pagani, but this analysis was on a custom operational semantics for which it is unclear whether it can be implemented efficiently. We take inspiration from their use of /linear factoring/ to optimise dual-numbers reverse-mode AD to an algorithm that has the correct complexity and enjoys an efficient implementation in a standard functional language with resource-linear types, such as Haskell. Aside from the linear factoring ingredient, our optimisation steps consist of well-known ideas from the functional programming community. Furthermore, we observe a connection with classical imperative taping-based reverse AD, as well as Kmett's 'ad' Haskell library, recently analysed by Krawiec et al. We demonstrate the practical use of our technique by providing a performant implementation that differentiates most of Haskell98.

Related Organizations

Utrecht University
Netherlands

Keywords

FOS: Computer and information sciences, Computer Science - Programming Languages, source transformation, functional programming, automatic differentiation, Programming Languages (cs.PL)

7 Research products, page 1 of 1

Parallel dual-numbers reverse AD
2025IsAmongTopNSimilarDocuments
A unified representation of spatial displacements
1984IsAmongTopNSimilarDocuments
Kinematic and Dynamic Modelling of Serial Robotic Manipulators Using Dual Number Algebra
2012IsAmongTopNSimilarDocuments
Displacement Analysis of Spatial Five-Link Mechanisms Using (3×3) Matrices With Dual-Number Elements
1969IsAmongTopNSimilarDocuments
Efficient Dual-Numbers Reverse AD via Well-Known Program Transformations
2023IsAmongTopNSimilarDocuments
Artifact for Efficient Dual-Numbers Reverse AD via Well-Known Program Transformations
2022IsAmongTopNSimilarDocuments
ad-dualrev-th software on GitHub
IsRelatedTo

1the

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average