
Summary Statistical analyses of biological problems in life sciences often lead to high-dimensional linear models. To solve the corresponding system of equations, penalisation approaches are often the methods of choice. They are especially useful in case of multicollinearity which appears if the number of explanatory variables exceeds the number of observations or for some biological reason. Then, the model goodness of fit is penalised by some suitable function of interest. Prominent examples are the lasso, group lasso and sparse-group lasso. Here, we offer a fast and numerically cheap implementation of these operators via proximal gradient descent. The grid search for the penalty parameter is realised by warm starts. The step size between consecutive iterations is determined with backtracking line search. Finally, the package produces complete regularisation paths. Availability and implementation seagull is an R package that is freely available on the Comprehensive R Archive Network ( CRAN ; https://CRAN.R-project.org/package=seagull ; vignette included). The source code is available on https://github.com/jklosa/seagull . Contact wittenburg@fbn-dummerstorf.de
Optimization, QH301-705.5, R package, Computer applications to medicine. Medical informatics, R858-859.7, Machine Learning, High-dimensional data, Machine learning, Linear Models, Humans, Biology (General), Software, Algorithms
Optimization, QH301-705.5, R package, Computer applications to medicine. Medical informatics, R858-859.7, Machine Learning, High-dimensional data, Machine learning, Linear Models, Humans, Biology (General), Software, Algorithms
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 25 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
