515829 – Integrate Voxtral (Mistral AI) for Speech-to-text

Bug 515829 - Integrate Voxtral (Mistral AI) for Speech-to-text

Summary: Integrate Voxtral (Mistral AI) for Speech-to-text

Status:	REPORTED

Alias:	None

Product:	kdenlive
Classification:	Applications
Component:	Title Clips & Subtitles (other bugs)
Version First Reported In:	unspecified
Platform:	Other Linux

Importance:	NOR wishlist
Target Milestone:	---
Assignee:	Jean-Baptiste Mardelle

URL:
Keywords:

Depends on:
Blocks:

Reported:	2026-02-10 17:24 UTC by reportthebug
Modified:	2026-02-10 17:48 UTC (History)
CC List:	0 users

See Also:
Latest Commit:
Version Fixed/Implemented In:
Sentry Crash Report:

Attachments
Add an attachment

Note You need to log in before you can comment on or make changes to this bug.

Description reportthebug 2026-02-10 17:24:16 UTC

Support Mistral AI’s new open-weight Voxtral models as an alternative STT engine for automatic subtitling.

Key Benefits:
* Higher Accuracy: Outperforms Whisper Large-v3 in speed and Word Error Rate (WER).
* Native Diarization: Built-in speaker identification to automatically label different voices in transcripts.
* Efficiency: Optimized for local hardware; the Mini-3B model provides high-quality results with low VRAM usage.
* Privacy/License: Apache 2.0 license, allowing for fully offline, private processing.

Proposed Integration:
Add "Voxtral" to the STT engine list in Settings > Speech to Text, with model selection (Mini/Small) and a toggle for speaker diarization.

Comment 1 reportthebug 2026-02-10 17:48:45 UTC

a demo you can find here: https://huggingface.co/spaces/pandora-s/Voxtral-Subtitles