doi: 10.5281/zenodo.19609604
Framework-Free Transformer Inference from Decomposed Weight Files via Vulkan Compute