
doi: 10.71074/ctc.1716528
The limited size of publicly available sparse matrix datasets creates a significant challenge for benchmarking, testing, and validating algorithms in scientific computing, artificial intelligence and other data-intensive applications. Existing approaches such as random matrix generators or general data augmentation methods often fail to produce structurally realistic matrices. To address this gap, we present MatGen which a tool for generating realistic variations of a given sparse matrix using signal processing and image processing techniques. MatGen takes a real sparse matrix as input and produces structurally consistent matrices at different sizes, introducing controlled variation while preserving key sparsity patterns. We evaluate the effectiveness of MatGen by analyzing structural features and visual similarities between original and generated matrices. Experimental results show that MatGen can produce realistic, scalable sparse matrices suitable for a wide range of applications including benchmarking computational methods, and sparse data techniques.
Graph, Social and Multimedia Data, Data Engineering and Data Science, Sparse matrices;matrix generator;matrix scaling;data augmentation;signal processing;image processing, Grafik, Sosyal ve Multimedya Verileri, Veri Mühendisliği ve Veri Bilimi
Graph, Social and Multimedia Data, Data Engineering and Data Science, Sparse matrices;matrix generator;matrix scaling;data augmentation;signal processing;image processing, Grafik, Sosyal ve Multimedya Verileri, Veri Mühendisliği ve Veri Bilimi
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
