
PicAxe 1.0.1 Release Notes: Monday, October 6th, 2025 Welcome to PicAxe v1.0.1! We've made some small patches, specifically updates to the READMEs and Docker images for both pipelines for more seamless installation and use. We've also added a visual flowchart to the main branch README. This release of PicAxe was updated by: Qilin Zhou and Bruno Felalaga, with supervision by Dr. Anna Clemencia Guerrero (Santa Fe Institute), advised by Dr. Aaron K. Dinner (UChicago) and Dr. Julia Damerow (Arizona State University. Fixed Issues Users reported three issues when setting up and running PicAxe-OCR: (a) running install_pcks.py was necessary to install layoutparser but this was not mentioned in the README, (b) setup-tools was missing at first run, and (c) running --bulk and --sample both failed with no error reported. The README and Docker image have been updated to mitigate these issues. We tested the pipeline again to make sure these issues were resolved, and there should be no further issues pulling the Docker image and running PicAxe-OCR. The Docker image tag for PicAxe-YOLO was originally called "tagname" as a placeholder, but the image tag has been updated to "latest". Before running PicAxe-YOLO with Docker, users need to create host folders. We have added instructions to the README for PicAxe-YOLO about where users need to create folders to (a) place their own input PDFs, (b) output the extraction results, and (c) store our pretrained YOLO weights. Known Issues Extraction results will not be perfect from either pipeline. Users should always check the results of extraction before performing further data analysis. For more details about how we are working to improve extraction results, please see the main README file. Package dependencies can cause issues (noted in respective README files), so we have provided Docker files. If the Docker images are not pulled for some time, they will be deleted. Note that the Docker image might not exist at some point.
If you use, test, or refer to PicAxe, please cite it as below.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
