skip to content
gigarouter gigarouter
models / speech-to-text · coming soon

vakyansh-wav2vec2-tamil-tam-250

Harveenchadha/vakyansh-wav2vec2-tamil-tam-250

A popular open speech-to-text model, with 2.2M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

status
coming soon
API providers
0
downloads / mo
2.2M
license
mit

about this model

Harveenchadha/vakyansh-wav2vec2-tamil-tam-250 is an automatic speech recognition (ASR) model for Tamil, fine-tuned from the multilingual CLSRIL-23 pretrained checkpoint. It is trained on 4200 hours of labelled speech data and requires 16 kHz input audio.

Key Strengths

  • Fine-tuned from a multilingual foundation model, enabling robust Tamil speech recognition.
  • Delivered without an external language model; Word Error Rate can be further reduced by integrating a language model if needed.
  • Suitable for direct inference or as a component in a larger pipeline.

Benchmark Performance

Dataset Metric Score Notes
Common Voice (Tamil test set) Word Error Rate (WER) 53.64% Without a language model

The reported WER is from the model’s standalone evaluation; performance may improve when combined with a language model.

Intended Use

This model is designed for developers deploying Tamil ASR in applications where a dedicated, fine-tuned wav2vec 2.0 model is preferred. It works best as a hosted API via gigarouter, eliminating the need for manual model loading and infrastructure management.

not yet live

We're benchmarking and onboarding vakyansh-wav2vec2-tamil-tam-250 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.