<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

callahantiff/PheKnowLator: v3.0.2

integration_instructionsResearch softwarekeyboard_double_arrow_right Software 31 Jan 2021Publisher:Zenodo

Authors: Tiffany J. Callahan; Jordan M. Wyrwa, DO; Bill Baumgartner; Luca Cappelletti;

callahantiff/PheKnowLator: v3.0.2

- Summary
- Metrics

Abstract

Release: v3.0.2 Website: https://github.com/callahantiff/PheKnowLator/wiki/v2.0.0 Data Access: Google Cloud Storage -- PheKnowLator Bucket Docker Container: DockerHub Dedicated Project Container PyPI: pkt-kg 3.0.2 Updated Jupyter Notebooks: notebooks/OWLNETS_Example_Application.ipynb notebooks/Data_Preparation.ipynb Updated Scripts: builds/data_preprocessing.py pkt_kg/metadata.py pkt_kg/utils/kg_utils.py builds/data_to_download.txt pkt_kg/utils/data_utils.py tests/test_data_utils_downloading.py Updates Addresses issue #118 (PR: #119) by patching the prior functionality related to obtaining labels and definitions from ontologies. Specifically, it now ensures that whenever possible the language encoding for these fields is English. Please see details below for information on how to address nodes containing foreign characters prior to this release. Solution for Builds Prior to v3.0.2 The (bad_node_patch.json) file contains a dictionary where the outer keys are the entity_uri and the puter values are another dictionary where the inner keys are label and description/definition and the inner values for these inner keys are the updated strings without foreign characters. An example of this dictionary is shown below: key = '<http://purl.obolibrary.org/obo/UBERON_0000468>' print(bad_node_patch[key]) >>> {'label': 'multicellular organism', 'description/definition': 'Anatomical structure that is an individual member of a species and consists of more than one cell.'} The code to identify the nodes with erroneous foreign characters is shown below: import re import pandas as pd # link to downloaded `NodeLabels.txt` file input_file = `'NodeLabels.txt'` # load data as Pandas DataFrame nodedf = pd.read_csv(input_file, sep='\t', header=0) # identify bad nodes and filter DataFrame so it only contains these rows nodedf['bad'] = nodedf['label'].apply(lambda x: re.search("[\u4e00-\u9FFF]", x) if not pd.isna(x) else None) nodedf_bad_nodes = nodedf[~pd.isna(nodedf['bad'])].drop_duplicates()

Related Organizations

King’s University
United States
University of Milan
Italy
University of Colorado Anschutz Medical Campus
United States

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Usage byUsageCounts

visibility	views	14
download	downloads	1

14
views
1
downloads
Powered by

Found an issue? Give us feedback

visibility

download

Average