Stochastic Gradient Descent in Continuous Time

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Jan 2017Embargo end date: 01 Jan 2016 English Publisher:Elsevier BVJournal:SSRN Electronic Journal (eissn: 1556-5068,

Copyright policy )Funded by:NSF | CAREER: Multiscale stocha...

Authors: Justin A. Sirignano; Konstantinos Spiliopoulos;

doi: 10.2139/ssrn.2954149 , 10.1137/17m1126825 , 10.48550/arxiv.1611.05545

arXiv: 1611.05545

Stochastic Gradient Descent in Continuous Time

- Summary
- Subjects
- Metrics

Abstract

Stochastic gradient descent in continuous time (SGDCT) provides a computationally efficient method for the statistical learning of continuous-time models, which are widely used in science, engineering, and finance. The SGDCT algorithm follows a (noisy) descent direction along a continuous stream of data. SGDCT performs an online parameter update in continuous time, with the parameter updates $θ_t$ satisfying a stochastic differential equation. We prove that $\lim_{t \rightarrow \infty} \nabla \bar g(θ_t) = 0$ where $\bar g$ is a natural objective function for the estimation of the continuous-time dynamics. The convergence proof leverages ergodicity by using an appropriate Poisson equation to help describe the evolution of the parameters for large times. SGDCT can also be used to solve continuous-time optimization problems, such as American options. For certain continuous-time problems, SGDCT has some promising advantages compared to a traditional stochastic gradient descent algorithm. As an example application, SGDCT is combined with a deep neural network to price high-dimensional American options (up to 100 dimensions).

Related Organizations

University of Illinois at Urbana Champaign
United States
Brown University
United States
Boston University
United States
Imperial College London
United Kingdom
THE TRUSTEES OF BOSTON UNIVERSITY
United States

View all View all

Keywords

FOS: Computer and information sciences, Probability (math.PR), Learning and adaptive systems in artificial intelligence, deep learning, Mathematics - Statistics Theory, Machine Learning (stat.ML), Martingales with continuous parameter, Statistics Theory (math.ST), stochastic differential equations, Stochastic ordinary differential equations (aspects of stochastic analysis), Neural nets and related approaches to inference from stochastic processes, statistical learning, machine learning, Derivative securities (option pricing, hedging, etc.), Statistics - Machine Learning, Optimization and Control (math.OC), stochastic gradient descent, FOS: Mathematics, American options, Mathematics - Optimization and Control, Mathematics - Probability

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	36
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

36

Top 10%

Green

bronze

Fields of Science (4) View all

Fields of Science

Funded by

NSF| CAREER: Multiscale stochastic processes, Monte Carlo Methods and Irreversibility