Downloads provided by UsageCounts
Wikipedia, also known as "The Free Encyclopaedia”, is one of the largest online repositories of biomedical information in the world, and is nowadays increasingly been used by medical researchers and health professionals alike. In spite of its rising popularity, little attention has been devoted to the understanding of how such medical information is organised, and especially how it evolves through time. We here present an analysis aimed at characterising such evolution, with a focus on the effects that such dynamic may have on an automated knowledge extraction process. For that, we start from a data set comprising a large number of snapshots of Wikipedia’s disease articles, and the corresponding diagnostic elements as provided by the DISNET project (disnet.ctb.upm.es). We then track and analyse how different metrics evolve through time, such as the total article length or the number of medical terms and references. Results highlight some expected facts, as for instance that most articles increase their content through time; and that hot topics, as Alzheimer’s disease, attract the highest number of editions and views. On the other hand, relevant behaviours are observed for less well-known diseases, including abrupt changes in the text and the concentration of contributions in a handful of editors. These results stress the importance of using correctly filtered and up-to-date datasets, and more general of considering the temporal evolution of the information in Wikipedia.
The paper is a result of the project "DISNET (Creation and analysis of disease networks for drug repurposing from heterogeneous data sources applied to rare diseases)", that is being developed under grant "RTI2018-094576-A-I00" from the Spanish Ministerio de Ciencia, Innovación y Universidades. Gerardo Lagunes-Garcia work is supported by Mexican Consejo Nacional de Ciencia y Tecnología (CONACYT) (CVU: 340523) under the programme "291114 - BECAS CONACYT AL EXTRANJERO". Lucia Prieto Santamaría's work is supported by "Programa de fomento de la investigación y la innovación (Doctorados Industriales") from Comunidad de Madrid (grant IND2019/TIC-17159).
wikipedia disease, diagnostic knowledge, information retrieval, change knowledge, wikipedia evolution, medical content
wikipedia disease, diagnostic knowledge, information retrieval, change knowledge, wikipedia evolution, medical content
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 6 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 16 | |
| downloads | 4 |

Views provided by UsageCounts
Downloads provided by UsageCounts