Downloads provided by UsageCounts
The dataset is gathered on Sep. 17th 2020 from GitHub. It has clean and complete versions (from v0.7): The clean version has 5.1K type-checked Python repositories and 1.2M type annotations. The complete version has 5.2K Python repositories and 3.3M type annotations. The dataset's source files are type-checked using mypy (clean version). The dataset is also de-duplicated using the CD4Py tool. Check out the README.MD file for the description of the dataset. Notable changes to each version of the dataset are documented in CHANGELOG.md. The dataset's scripts and utilities are available on its GitHub repository.
{"references": ["A. Mir, E. Latoskinas and G. Gousios, \"ManyTypes4Py: A Benchmark Python Dataset for Machine Learning-Based Type Inference,\" in 2021 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), 2021 pp. 585-589. doi: 10.1109/MSR52588.2021.00079"]}
Evolutionary Biology, type inference, ManyTypes4Py, python, dataset, type inference, machine learning, large, type annotations, open source, GitHub, type prediction, type hints, type-checked, clean, Science Policy, Information Systems not elsewhere classified, Marine Biology, ManyTypes4Py, type prediction, ManyTypes4Py, python, dataset, type inference, machine learning, large, type annotations, open source, GitHub, type prediction, python, GitHub, ManyTypes4Py, python, dataset, type inference, machine learning, large, type annotations, open source, GitHub, type prediction, type hints, Infectious Diseases, machine learning, open source, Sociology, Genetics, dataset, type annotations, Biological Sciences not elsewhere classified, large
Evolutionary Biology, type inference, ManyTypes4Py, python, dataset, type inference, machine learning, large, type annotations, open source, GitHub, type prediction, type hints, type-checked, clean, Science Policy, Information Systems not elsewhere classified, Marine Biology, ManyTypes4Py, type prediction, ManyTypes4Py, python, dataset, type inference, machine learning, large, type annotations, open source, GitHub, type prediction, python, GitHub, ManyTypes4Py, python, dataset, type inference, machine learning, large, type annotations, open source, GitHub, type prediction, type hints, Infectious Diseases, machine learning, open source, Sociology, Genetics, dataset, type annotations, Biological Sciences not elsewhere classified, large
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 95 | |
| downloads | 79 |

Views provided by UsageCounts
Downloads provided by UsageCounts