Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Data for a publication - Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains

Authors: Pavlo, Nosov; Melnyk, Oleksiy; Malaksiano, Mykola; Mamenko, Pavlo; Onyshko, Dmytro; Oleksij, Fomin; Píštěk, Václav; +1 Authors

Data for a publication - Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains

Abstract

This article presents the development of a modular software suite for automated analysis of scientific publications in PDF format. The system integrates vectorization, clustering, topic modelling, dimensionality reduction, and fuzzy logic to combine both formal (vector based) and semantic (topic-based) approaches. Interactive 3D visualization supports intuitive exploration of thematic clusters, allowing users to highlight relevant documents and adjust analytical parameters. Validation on a maritime safety case study confirmed Received: 7 October 2025 Revised: 4 November 2025 Accepted: 21 November 2025 Published: 24 November 2025 Citation: Nosov, P.; Melnyk, O.; Malaksiano, M.; Mamenko, P.; Onyshko, D.; Fomin, O.; Píštˇek, V.; Kuˇcera, P. Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains. Mach. Learn. Knowl. Extr. 2025, 7, 150. https://doi.org/10.3390/ make7040150 Copyright: © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/ licenses/by/4.0/). the system’s ability to process large publication collections, identify relevant sources, and reveal underlying knowledge structures. Compared to established frameworks such as PRISMAor Scopus/WoS Analytics, the proposed tool operates directly on full-text content, provides deeper thematic classification, and does not require subscription-based databases. The study also addresses the limitations arising from data bias and reproducibility issues in the semantic interpretability of safety-critical decision-making systems. The approach offers practical value for organizations in safety-critical domains—including transportation, energy, cybersecurity, and human–machine interaction—where rapid access to thematically related research is essential.

Keywords

safety-critical systems, transport automation, human–machine interaction, cybersecurity, intelligent data analysi, decision-support systems, semantic classification, artificial intelligence (AI), topic modelling, shipping safety, analytics, human factor, fuzzy logic, interactive visualization, maritime sector, AI-support, clustering

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities