
README Nellai Dialect Automatic Speech Recognition System (Nellai-ASR) Automatic Speech Recognition (ASR) system tailored for the Nellai / Tirunelveli Tamil dialect — a low-resource dialect of Tamil. Converts spoken audio in the Nellai dialect into written text with support for model training, evaluation, and inference. Overview Automatic Speech Recognition (ASR) converts spoken language into readable text. While mainstream ASR systems exist for resource-rich languages, dialects like Nellai Tamil lack public datasets and tailored models.This repository provides tools, scripts, and training pipelines to build and evaluate an ASR system specifically for the Nellai dialect. Supports data preprocessing Model training & fine-tuning (Lora adapted Whisper model) Evaluation & inference pipelines Example scripts and baseline performance Purpose ASR system tailored for the Nellai / Tirunelveli Tamil dialect (low-resource dialect) Converts spoken audio into text Features Data preprocessing pipeline Training & fine-tuning scripts Evaluation with metrics like WER/CER Inference for real-time transcription Structure Organised folders for raw/processed data, models, scripts, and notebooks Usage Steps Clone repository Prepare dataset (audio + transcripts) Install dependencies Preprocess data Train model Evaluate performance Run inference Metrics Word Error Rate (WER) Character Error Rate (CER) Examples GoogleColab notebooks for interactive demos License MIT License
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
