Downloads provided by UsageCounts
Hugging Face (HF) has established itself as a crucial platform for the development and sharing of machine learning (ML) models. This repository mining study, which delves into more than 380,000 models using data gathered via the HF Hub API, aims to explore the community engagement, evolution, and maintenance around models hosted on HF, aspects that have yet to be comprehensively explored in the literature. We first examine the overall growth and popularity of HF, uncovering trends in ML domains, framework usage, authors grouping and the evolution of tags and datasets used. Through text analysis of model card descriptions, we also seek to identify prevalent themes and insights within the developer community. Our investigation further extends to the maintenance aspects of models, where we evaluate the maintenance status of ML models, classify commit messages into various categories (corrective, perfective, and adaptive), analyze the evolution across development stages of commits metrics and introduce a new classification system that estimates the maintenance status of models based on multiple attributes. This study aims to provide valuable insights about ML model maintenance and evolution that could inform future model development strategies on platforms like HF.
Accepted at the 2024 IEEE/ACM 21th International Conference on Mining Software Repositories (MSR)
FOS: Computer and information sciences, Computer Science - Machine Learning, software evolution, Maintenance, Computer Science - Artificial Intelligence, Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic, Software evolution, maintenance, Machine Learning (cs.LG), Software Engineering (cs.SE), Computer Science - Software Engineering, Artificial Intelligence (cs.AI), Repository mining, repository mining, Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació
FOS: Computer and information sciences, Computer Science - Machine Learning, software evolution, Maintenance, Computer Science - Artificial Intelligence, Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic, Software evolution, maintenance, Machine Learning (cs.LG), Software Engineering (cs.SE), Computer Science - Software Engineering, Artificial Intelligence (cs.AI), Repository mining, repository mining, Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 15 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 17 | |
| downloads | 13 |

Views provided by UsageCounts
Downloads provided by UsageCounts