Downloads provided by UsageCounts
We present three defect rediscovery datasets mined from Bugzilla. The datasets capture data for three groups of open source software projects: Apache, Eclipse, and KDE. The datasets contain information about approximately 914 thousands of defect reports over a period of 18 years (1999-2017) to capture the inter-relationships among duplicate defects. File Descriptions apache.csv - Apache Defect Rediscovery dataset eclipse.csv - Eclipse Defect Rediscovery dataset kde.csv - KDE Defect Rediscovery dataset apache.relations.csv - Inter-relations of rediscovered defects of Apache eclipse.relations.csv - Inter-relations of rediscovered defects of Eclipse kde.relations.csv - Inter-relations of rediscovered defects of KDE create_and_populate_neo4j_objects.cypher - Populates Neo4j graphDB by importing all the data from the CSV files. Note that you have to set dbms.import.csv.legacy_quote_escaping configuration setting to false to load the CSV files as per https://neo4j.com/docs/operations-manual/current/reference/configuration-settings/#config_dbms.import.csv.legacy_quote_escaping create_and_populate_mysql_objects.sql - Populates MySQL RDBMS by importing all the data from the CSV files rediscovery_db_mysql.zip - For your convenience, we also provide full backup of the MySQL database neo4j_examples.txt - Sample Neo4j queries mysql_examples.txt - Sample MySQL queries rediscovery_eclipse_6325.png - Output of Neo4j example #1 distinct_attrs.csv - Distinct values of bug_status, resolution, priority, severity for each project
Rediscovery, Duplicate Reports, Software Engineering, Defects
Rediscovery, Duplicate Reports, Software Engineering, Defects
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 55 | |
| downloads | 110 |

Views provided by UsageCounts
Downloads provided by UsageCounts