
This paper presents and discusses forms of estimation by regularized regression and model selection using the LASSO method - Least Absolute Shrinkage and Selection Operator. LASSO is recognized as one of the main supervised learning methods applied to high-dimensional econometrics, allowing work with large volumes of data and multiple correlated controls. Conceptual issues related to the consequences of high dimensionality in modern econometrics and the principle of sparsity, which underpins regularization procedures, are addressed. The study examines the main post-double selection and post-regularization models, including variations applied to instrumental variable models. A brief description of the lassopack routine package, its syntaxes, and examples of HD, HDS (High-Dimension Sparse), and IV-HDS models, with combinations involving fixed effects estimators, is also presented. Finally, the potential application of the approach in research focused on air transport is discussed, with emphasis on an empirical study on the operational efficiency of airlines and aircraft fuel consumption.
Article in Portuguese
FOS: Computer and information sciences, General Economics (econ.GN), Economics, Methodology, Systems and Control (eess.SY), General Economics, Machine Learning (cs.LG), Machine Learning, Methodology (stat.ME), FOS: Economics and business, Applications, FOS: Electrical engineering, electronic engineering, information engineering, Applications (stat.AP), Systems and Control
FOS: Computer and information sciences, General Economics (econ.GN), Economics, Methodology, Systems and Control (eess.SY), General Economics, Machine Learning (cs.LG), Machine Learning, Methodology (stat.ME), FOS: Economics and business, Applications, FOS: Electrical engineering, electronic engineering, information engineering, Applications (stat.AP), Systems and Control
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
