Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Software . 2025
Data sources: ZENODO
ZENODO
Software . 2025
Data sources: Datacite
ZENODO
Software . 2025
Data sources: Datacite
versions View all 2 versions
addClaim

PicAxe

Authors: Guerrero, Anna C.; Kamath, Krishna; Zhou, Qilin; Felalaga, Bruno; Damerow, Julia; Dinner, Aaron R.;
Abstract

PicAxe 1.0.1 Release Notes: Monday, October 6th, 2025 Welcome to PicAxe v1.0.1! We've made some small patches, specifically updates to the READMEs and Docker images for both pipelines for more seamless installation and use. We've also added a visual flowchart to the main branch README. This release of PicAxe was updated by: Qilin Zhou and Bruno Felalaga, with supervision by Dr. Anna Clemencia Guerrero (Santa Fe Institute), advised by Dr. Aaron K. Dinner (UChicago) and Dr. Julia Damerow (Arizona State University. Fixed Issues Users reported three issues when setting up and running PicAxe-OCR: (a) running install_pcks.py was necessary to install layoutparser but this was not mentioned in the README, (b) setup-tools was missing at first run, and (c) running --bulk and --sample both failed with no error reported. The README and Docker image have been updated to mitigate these issues. We tested the pipeline again to make sure these issues were resolved, and there should be no further issues pulling the Docker image and running PicAxe-OCR. The Docker image tag for PicAxe-YOLO was originally called "tagname" as a placeholder, but the image tag has been updated to "latest". Before running PicAxe-YOLO with Docker, users need to create host folders. We have added instructions to the README for PicAxe-YOLO about where users need to create folders to (a) place their own input PDFs, (b) output the extraction results, and (c) store our pretrained YOLO weights. Known Issues Extraction results will not be perfect from either pipeline. Users should always check the results of extraction before performing further data analysis. For more details about how we are working to improve extraction results, please see the main README file. Package dependencies can cause issues (noted in respective README files), so we have provided Docker files. If the Docker images are not pulled for some time, they will be deleted. Note that the Docker image might not exist at some point.

If you use, test, or refer to PicAxe, please cite it as below.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average