
Abstract This study examines how syntactic constructions in expense narratives affect misclassification rates in AI-powered corporate ERP systems. We trained transformer-based classifiers on labeled accounting data to predict expense categories and observed that these models frequently relied on grammatical form rather than financial semantics. We extracted syntactic features including nominalization frequency, defined as the ratio of deverbal nouns to verbs; coordination depth, measured by the maximum depth of coordinated clauses; and subordination complexity, expressed as the number of embedded subordinate clauses per sentence. Using SHAP (SHapley Additive exPlanations), we identified that these structural patterns significantly contribute to false allocations, thus increasing the likelihood of audit discrepancies. For interpretability, we applied the method introduced by Lundberg and Lee in their seminal work, “A Unified Approach to Interpreting Model Predictions,” published in Advances in Neural Information Processing Systems 30 (2017): 4765–4774. To mitigate these syntactic biases, we implemented a rule-based debiasing module that re-parses each narrative into a standardized fair-syntax transformation, structured around a minimal Subject-Verb-Object sequence. Evaluation on a corpus of 18,240 expense records drawn from the U.S. Federal Travel Expenditure dataset (GSA SmartPay, 2018–2020, https://smartpay.gsa.gov) shows that the fair-syntax transformation reduced misclassification rates by 15 percent. It also improved key pre-audit compliance indicators, including GL code accuracy—defined as the percentage of model-assigned codes matching human-validated general ledger categories, with a target threshold of ≥ 95 percent—and reconciliation match rate, the proportion of expense records successfully aligned with authorized payment entries, aiming for ≥ 98 percent. The findings reveal a direct operational link between linguistic form and algorithmic behavior in accounting automation, providing a replicable interpretability framework and a functional safeguard against structural bias in enterprise classification systems. DOI: https://doi.org/10.5281/zenodo.16322760 This work is also published with DOI reference in Figshare https://doi.org/10.6084/m9.figshare.29618654 and Pending SSRN ID to be assigned. ETA: Q3 2025. Resumen Este estudio analiza cómo las construcciones sintácticas presentes en las narrativas de gastos afectan las tasas de clasificación errónea en sistemas ERP corporativos impulsados por inteligencia artificial. Se entrenaron clasificadores basados en transformadores sobre datos contables etiquetados, y se observó que estos modelos se apoyan con frecuencia en patrones gramaticales superficiales en lugar de en la semántica financiera subyacente. Se extrajeron métricas sintácticas como la frecuencia de nominalización (definida como la proporción entre sustantivos deverbales y verbos), la profundidad de coordinación (medida por la máxima profundidad de cláusulas coordinadas) y la complejidad de subordinación (cantidad de cláusulas subordinadas incrustadas por oración). Mediante SHAP (SHapley Additive exPlanations), se identificó que estos patrones estructurales contribuyen de forma significativa a asignaciones erróneas, elevando el riesgo de discrepancias contables y observaciones de auditoría. Para la interpretación, se aplicó el método propuesto por Lundberg y Lee en su artículo “A Unified Approach to Interpreting Model Predictions”, publicado en Advances in Neural Information Processing Systems 30 (2017): 4765–4774. Como estrategia de mitigación, se implementó un módulo de corrección sintáctica basado en reglas que reescribe cada narrativa en una transformación de sintaxis justa, estructurada según una secuencia mínima Sujeto–Verbo–Objeto. La evaluación se realizó sobre un corpus de 18.240 registros de gastos extraídos del conjunto de datos del Programa Federal de Viajes de EE. UU. (GSA SmartPay, 2018–2020, https://smartpay.gsa.gov), y mostró que la transformación de sintaxis justa redujo las tasas de clasificación errónea en un 15 %. También mejoró indicadores clave de cumplimiento previo a la auditoría, como la precisión del código contable (porcentaje de códigos asignados por el modelo que coinciden con categorías del libro mayor validadas por humanos, con umbral objetivo ≥ 95 %) y la tasa de conciliación (proporción de registros emparejados correctamente con transacciones autorizadas, con objetivo ≥ 98 %). Los resultados revelan un vínculo operativo directo entre la forma lingüística y el comportamiento algorítmico en automatización contable, y proponen un marco replicable de interpretación y una salvaguarda funcional contra el sesgo estructural en los sistemas de clasificación empresarial.
Business activity, Artificial Intelligence/legislation & jurisprudence, Local finance, Linguistics/legislation & jurisprudence, Business models, Unsupervised Machine Learning/classification, Anthropological ethics, Demolition business, Supervised Machine Learning/economics, Insurance business, Indifferentism (Ethics) in literature, Ethics Committees/ethics, Linguistics/organization & administration, Business, Adoption--Finance, Kiowa Indians--Finance, Linguistics/standards, Artificial Intelligence/ethics, Repair business, Supervised Machine Learning/ethics, Linguistics/methods, Administration, Buccal, Linguistics/education, Supervised Machine Learning/classification, Machine Learning/trends, Library Administration, Allied health personnel--Professional ethics, Machine Learning/history, Administration, Intravesical, Linguistics/legislation & jurisprudence, Administration, Accounting--Professional ethics, Supervised Machine Learning, Abattoirs/ethics, Artificial Intelligence/economics, Supervised Machine Learning/trends, Unsupervised Machine Learning/history, Business economics, Business classification, Linguistics/ethics, Business intelligence, Administration, Rectal, Machine learning--Experiments, Architects--Professional ethics, Machine learning, Machine Learning/classification, Comparative linguistics, Administrative court (administration), Artificial Intelligence/trends, Ethics, Business, Machine Learning/legislation & jurisprudence, Handicraft business, Big business--Social aspects, Ethics Committees, Clinical/ethics, Nursing Administration Research/organization & administration, Business organisation, Machine learning--Technique, Public finance, ethics, Artificial Intelligence/supply & distribution, Artists--Professional ethics, Undertaking business, Finances, Accounting/ethics, Machine Learning/ethics, Linguistics/statistics & numerical data, Appropriateness (Ethics), Artificial intelligence, Administrative Personnel/organization & administration, Artificial Intelligence/statistics & numerical data, Applied linguistics--Data processing, Comparative linguistics--Statistical methods, Administration, Oral, Artificial Intelligence/standards, Machine Learning, Linguistics/history, Alcoholism counselors--Professional ethics, Heuristics/ethics, Astronautics and ethics, Small Business/instrumentation, Machine Learning/standards, Linguistics/trends, Artificial Intelligence/supply & distribution, Machine Learning/supply & distribution, Transportation business, Linguistics/classification, Linguistics/organization & administration, Artificial Intelligence/statistics & numerical data, Unsupervised Machine Learning/ethics, Ethics Consultation/ethics, Artificial Intelligence/classification, Community finance, Independent practice associations (Medical care)--Finance--Management, Business policy, Justification (Ethics), Small Business/classification, Administrative Personnel/organization & administration, Applied ethics, Ethics Committees, Research/ethics, Machine Learning/supply & distribution, Brown finance, Painting business, Administration, Sublingual, Machine Learning/legislation & jurisprudence, Applied linguistics--Research, Artificial Intelligence/history, Linguistics/statistics & numerical data, Ethnic Violence/ethics, Artificial Intelligence, Supervised Machine Learning/standards, Sustainable finance, Monetary and finances, Administration, Inhalation, Independent regulatory commissions--Finance, Small Business, Nursing Administration Research/organization & administration, Housing finance, Ethics, Applied ethics--Study and teaching, Ethnic Cleansing/ethics, Linguistics/economics, Linguistics, Artificial Intelligence/legislation & jurisprudence, Machine Learning/economics, Administration, Intravaginal, Ensemble learning (Machine learning), Applied linguistics--Statistical methods, Componential analysis (Linguistics), Accountants--Professional ethics, Machine learning--Evaluation, FOS: Languages and literature, Linguistics/instrumentation, Big business--Corrupt practices, Competence and performance (Linguistics), Unsupervised Machine Learning
Business activity, Artificial Intelligence/legislation & jurisprudence, Local finance, Linguistics/legislation & jurisprudence, Business models, Unsupervised Machine Learning/classification, Anthropological ethics, Demolition business, Supervised Machine Learning/economics, Insurance business, Indifferentism (Ethics) in literature, Ethics Committees/ethics, Linguistics/organization & administration, Business, Adoption--Finance, Kiowa Indians--Finance, Linguistics/standards, Artificial Intelligence/ethics, Repair business, Supervised Machine Learning/ethics, Linguistics/methods, Administration, Buccal, Linguistics/education, Supervised Machine Learning/classification, Machine Learning/trends, Library Administration, Allied health personnel--Professional ethics, Machine Learning/history, Administration, Intravesical, Linguistics/legislation & jurisprudence, Administration, Accounting--Professional ethics, Supervised Machine Learning, Abattoirs/ethics, Artificial Intelligence/economics, Supervised Machine Learning/trends, Unsupervised Machine Learning/history, Business economics, Business classification, Linguistics/ethics, Business intelligence, Administration, Rectal, Machine learning--Experiments, Architects--Professional ethics, Machine learning, Machine Learning/classification, Comparative linguistics, Administrative court (administration), Artificial Intelligence/trends, Ethics, Business, Machine Learning/legislation & jurisprudence, Handicraft business, Big business--Social aspects, Ethics Committees, Clinical/ethics, Nursing Administration Research/organization & administration, Business organisation, Machine learning--Technique, Public finance, ethics, Artificial Intelligence/supply & distribution, Artists--Professional ethics, Undertaking business, Finances, Accounting/ethics, Machine Learning/ethics, Linguistics/statistics & numerical data, Appropriateness (Ethics), Artificial intelligence, Administrative Personnel/organization & administration, Artificial Intelligence/statistics & numerical data, Applied linguistics--Data processing, Comparative linguistics--Statistical methods, Administration, Oral, Artificial Intelligence/standards, Machine Learning, Linguistics/history, Alcoholism counselors--Professional ethics, Heuristics/ethics, Astronautics and ethics, Small Business/instrumentation, Machine Learning/standards, Linguistics/trends, Artificial Intelligence/supply & distribution, Machine Learning/supply & distribution, Transportation business, Linguistics/classification, Linguistics/organization & administration, Artificial Intelligence/statistics & numerical data, Unsupervised Machine Learning/ethics, Ethics Consultation/ethics, Artificial Intelligence/classification, Community finance, Independent practice associations (Medical care)--Finance--Management, Business policy, Justification (Ethics), Small Business/classification, Administrative Personnel/organization & administration, Applied ethics, Ethics Committees, Research/ethics, Machine Learning/supply & distribution, Brown finance, Painting business, Administration, Sublingual, Machine Learning/legislation & jurisprudence, Applied linguistics--Research, Artificial Intelligence/history, Linguistics/statistics & numerical data, Ethnic Violence/ethics, Artificial Intelligence, Supervised Machine Learning/standards, Sustainable finance, Monetary and finances, Administration, Inhalation, Independent regulatory commissions--Finance, Small Business, Nursing Administration Research/organization & administration, Housing finance, Ethics, Applied ethics--Study and teaching, Ethnic Cleansing/ethics, Linguistics/economics, Linguistics, Artificial Intelligence/legislation & jurisprudence, Machine Learning/economics, Administration, Intravaginal, Ensemble learning (Machine learning), Applied linguistics--Statistical methods, Componential analysis (Linguistics), Accountants--Professional ethics, Machine learning--Evaluation, FOS: Languages and literature, Linguistics/instrumentation, Big business--Corrupt practices, Competence and performance (Linguistics), Unsupervised Machine Learning
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
