Constitutional Oversight of Hybrid ML-RAG Credit Scoring Systems:  Scalable Governance for Safe and Fair Financial Inclusion  in Emerging Economies

Over 350 million adults in the West African UEMOA region use Mobile Money as their primary financial instrument yet possess no formal credit history. Traditional credit scoring models systematically exclude this population — not because they are untrustworthy, but because the infrastructure to read their data does not exist. We present Fvundi, a hybrid ML-RAG credit intelligence system addressing this exclusion at three levels. First, a two-layer prediction-explanation architecture: XGBoost scores creditworthiness from Mobile Money transaction features, while a RAG pipeline (LlamaIndex + Cohere + Claude) generates faithful natural-language explanations grounded in SHAP feature attribution. Second, we identify and formalize explanation faithfulness failures — instances where the RAG explanation layer misrepresents the underlying ML model's decisions, introducing demographic inconsistencies for identical credit scores. We propose four quantitative metrics: Feature Coverage Rate (FCR), Feature Rank Correlation (FRC), Demographic Consistency Score (DCS), and Hallucination Rate (HR). Third, we introduce Constitutional Oversight: a three-layer governance framework combining a machine-readable Financial Constitution (human-authored ethical rules), Scalable Oversight (an AI Arbiter enforcing constitutional compliance at transaction speed), and Human-in-the-Loop control (triggered selectively for violations). This architecture satisfies BCEAO regulatory requirements while operating at the scale of millions of Mobile Money transactions per day. Our central argument: safe AI credit scoring in emerging economies requires institutional design that keeps humans in the loop at the governance level while delegating verification to machine-speed oversight. Fvundi is designed to extend the banking system, not replace it.

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now