Views provided by UsageCounts
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>Replication Package for the tool "Makar: A Framework for Multi-source Studies based on Unstructured Data" # RP-Makar-tool Replication Package for the tool "Makar: A Framework for Multi-source Studies based on Unstructured Data" ## Structure ``` Makar_tool/ Data/ stackoverfow_questions_with_answers_by_tags.csv stackoverfow_tags_metrics.csv apache_mailing_list.csv mailing_lists_ASF_@dev_@users_1.csv mailing_lists_ASF_@dev_@users_2.csv quora.csv sample_stackoverfow_questions_with_answers_by_tags.csv Schemas/ apache_mailing_lists.json quora.json stackoverfow_questions_answers_by_tag.json stackoverfow_tag_count.json stackoverfow_tag_metrics.json LDA-analysis LDA_input/ stackoverfow_raw_dataset.csv LDA_output/ Mallet/ output_csv/ docs-in-topics.csv topic-words.csv topics-in-docs.csv topics-metadata.csv output_html/ all_topics.html Docs/ Topics/ ``` ## Contents of the Replication Package Contains the data processed using the tool for the study. - **Data/** - `stackoverfow_questions_with_answers_by_tags.csv` - all Stack Overflow questions used in the study as stored in Makar - `stackoverfow_tags_metrics.csv` - all data containing the calculations done for Stack Overflow tag selection - `apache_mailing_list.csv` - statistically significant sample of `mailing_lists_ASF_@dev_@users_1.csv` and `mailing_lists_ASF_@dev_@users_2.csv` used in the study - `mailing_lists_ASF_@dev_@users_1.csv` - mailing list data used in the study as stored in Makar (part 1) - `mailing_lists_ASF_@dev_@users_2.csv` - mailing list data used in the study as stored in Makar (part 2) - `quora.csv` - all quora questions used in the study as stored in Makar - `sample_stackoverfow_questions_with_answers_by_tags` - statistically significant sample of `stackoverfow_questions_with_answers_by_tags.csv` used in the study - **Schemas/** - `apache_mailing_lists.json` - data schema used in Makar to store mailing list data - `quora.json` - data schema used in Makar to store quora data - `stackoverfow_questions_answers_by_tag.json` - data schema used in Makar to store stack Overflow questions data - `stackoverfow_tag_count.json` - data schema used in Makar to lookup number of questions per tag available in Stack Overflow - `stackoverfow_tag_metrics.json` - data schema used in Makar to Stack Overflow tag metrics data - **LDA_input/** - input data used for LDA analysis - `stackoverfow_raw_dataset.csv` - stackoverflow questions used to perform LDA analysis - **LDA_output/** - **Mallet/** - contains the LDA output generated by MALLET tool - **output_csv/** - `docs-in-topics.csv` - documents per topic - `topic-words.csv` - most relevant topic words - `topics-in-docs.csv` - topic probability per document - `topics-metadata.csv` - metadata per document and topic probability - **output_html/** - Browsable results of mallet output - `all_topics.html` - `Docs/` - `Topics/`
Analysis of Multi-source unstructured data
Analysis of Multi-source unstructured data
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 7 |

Views provided by UsageCounts