Fine-tune Transcriber

Configure and monitor transcription model fine-tuning jobs

Active Job

Job History

Configure training parameters and monitor fine-tuning progress. Select a base model, training dataset, and hyperparameters to start a new fine-tuning job.

Training Configuration

Base Model

Training Dataset

Language

Epochs

Learning Rate

Batch Size

Training Method

Multilingual Mode

Enable for mixed-language audio. Disable for single-language (e.g. Lithuanian only).

LoRA Parameters

LoRA Rank (r) 64

LoRA Alpha (α) 128

Dropout

Target Modules

Select which attention layers to apply LoRA adapters. Default: all projection layers.

Data Preparation

Number Normalization Applied

Converts written numbers to spoken form

Audio Downsampling 16kHz → 8kHz

Simulates telephony audio quality

WER Quality Filter Threshold: 30%

Excludes samples with baseline WER above threshold

Training Progress

Training

Epoch 7 / 10 — 70%

Training Loss

0.234

Validation WER

12.3%

Learning Rate

0.0001

ETA

8m 30s

[14:23:45] Starting epoch 7/10...

[14:23:46] Batch 1/16 — loss: 0.241

[14:23:48] Batch 8/16 — loss: 0.228

[14:23:50] Batch 16/16 — loss: 0.234

[14:23:51] Epoch 7 complete — val_wer: 12.3%, val_loss: 0.198

[14:23:51] Checkpoint saved: whisper-lg-v3-ft-epoch7.pt

Post-Training Pipeline

After training completes, the model goes through merge, conversion, and upload stages before deployment.

Train LoRA Adapter

Completed · 2h 15m · Feb 24 14:23

Done

⟳

Merge LoRA Weights

Running · Started Feb 24 16:38 · ETA 12m

Running

—

Convert to CTranslate2

Pending

—

Upload to HuggingFace

Pending

Job ID	Base Model	Dataset	Final WER	Duration	Status	Date
FT-004	Whisper Large v3	Lithuanian v1	12.3%	2h 15m	Training	Feb 24
FT-003	Whisper Large v3	Lithuanian v1	11.2%	2h 08m	Completed	Feb 22
FT-002	Whisper Medium	Sales Q1	—	45m	Failed	Feb 20
FT-001	Whisper Large v3	Lithuanian v1	14.8%	1h 52m	Completed	Feb 18
FT-000	Whisper Small	VoiceBot Audio	22.1%	38m	Completed	Feb 15