
arXiv: 1712.04612
Learning customer preferences from an observed behaviour is an important topic in the marketing literature. Structural models typically model forward-looking customers or firms as utility-maximizing agents whose utility is estimated using methods of Stochastic Optimal Control. We suggest an alternative approach to study dynamic consumer demand, based on Inverse Reinforcement Learning (IRL). We develop a version of the Maximum Entropy IRL that leads to a highly tractable model formulation that amounts to low-dimensional convex optimization in the search for optimal model parameters. Using simulations of consumer demand, we show that observational noise for identical customers can be easily confused with an apparent consumer heterogeneity.
18 pages, 5 figures
FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computational Finance (q-fin.CP), Systems and Control (eess.SY), Electrical Engineering and Systems Science - Systems and Control, Machine Learning (cs.LG), Computational Engineering, Finance, and Science (cs.CE), FOS: Economics and business, Quantitative Finance - Computational Finance, Artificial Intelligence (cs.AI), FOS: Electrical engineering, electronic engineering, information engineering, Computer Science - Computational Engineering, Finance, and Science
FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computational Finance (q-fin.CP), Systems and Control (eess.SY), Electrical Engineering and Systems Science - Systems and Control, Machine Learning (cs.LG), Computational Engineering, Finance, and Science (cs.CE), FOS: Economics and business, Quantitative Finance - Computational Finance, Artificial Intelligence (cs.AI), FOS: Electrical engineering, electronic engineering, information engineering, Computer Science - Computational Engineering, Finance, and Science
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
