Bug 482933 - Kdenlive: won't recognize more than a few words from videos
Summary: Kdenlive: won't recognize more than a few words from videos
Status: REPORTED
Alias: None
Product: kdenlive
Classification: Applications
Component: Title Clips & Subtitles (show other bugs)
Version: 23.08.5
Platform: Mint (Ubuntu based) Linux
: NOR normal
Target Milestone: ---
Assignee: Jean-Baptiste Mardelle
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-03-08 23:47 UTC by DJS
Modified: 2024-10-01 17:39 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description DJS 2024-03-08 23:47:26 UTC
SUMMARY
***
NOTE: If you are reporting a crash, please try to attach a backtrace with debug symbols.
See https://community.kde.org/Guidelines_and_HOWTOs/Debugging/How_to_create_useful_crash_reports
***


STEPS TO REPRODUCE

Greetings
I downloaded Kdenlive and installed all the extensions to make subtitles out of videos using speech recognition.
It only makes about 1 to 2 minutes of subtitles but not the full video.
it's running Pip3 and VOSK but others have told me to install WHISPER? When I try to install whisper it doesn't give me the option to change it in Kdenlive?

Any idea why this is?


Kdenlive version 21.12.3
Platform: Linux mint
Install method (official installer, package repository, AppImage, Flatpak, PPA, etc): system package
Screenshots or screen recordings: here is video recording of the problem: https://www.youtube.com/watch?v=0zBMvyckysA
OBSERVED RESULT


EXPECTED RESULT:
rendition of recognized text from video into text format


SOFTWARE/OS VERSIONS
Windows: 
macOS: 
Linux/KDE Plasma:  Linux mint

System:
  Kernel: 5.15.0-97-generic x86_64 bits: 64 compiler: gcc v: 11.4.0 Desktop: MATE 1.26.0
    info: mate-panel wm: marco 1.26.0 vt: 7 dm: LightDM 1.30.0 Distro: Linux Mint 21.1 Vera
    base: Ubuntu 22.04 jammy
Machine:
  Type: Laptop System: Micro-Star product: GT62VR 6RD v: REV:1.0 serial: <superuser required>
    Chassis: type: 10 serial: <superuser required>
  Mobo: Micro-Star model: MS-16L2 v: REV:1.0 serial: <superuser required>
    UEFI: American Megatrends v: E16L2IMS.117 date: 01/17/2018
Battery:
  ID-1: BAT1 charge: 71.1 Wh (100.0%) condition: 71.1/79.3 Wh (89.6%) volts: 16.6 min: 14.4
    model: MSI Corp. MS-16L2 type: Li-ion serial: N/A status: Full
CPU:
  Info: quad core model: Intel Core i7-6700HQ bits: 64 type: MT MCP smt: enabled arch: Skylake-S
    rev: 3 cache: L1: 256 KiB L2: 1024 KiB L3: 6 MiB
  Speed (MHz): avg: 3103 high: 3234 min/max: 800/3500 cores: 1: 3003 2: 2906 3: 3014 4: 3234
    5: 3142 6: 3159 7: 3164 8: 3205 bogomips: 41599
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Graphics:
  Device-1: NVIDIA GP106BM [GeForce GTX 1060 Mobile 6GB] vendor: Micro-Star MSI driver: nvidia
    v: 525.147.05 pcie: speed: 2.5 GT/s lanes: 16 ports: active: none off: eDP-1
    empty: DP-1,DP-2,HDMI-A-1 bus-ID: 01:00.0 chip-ID: 10de:1c60 class-ID: 0300
  Device-2: Acer BisonCam NB Pro type: USB driver: uvcvideo bus-ID: 1-11:5 chip-ID: 5986:055c
    class-ID: 0e02 serial: <filter>
  Display: x11 server: X.Org v: 1.21.1.4 compositors: 1: marco v: 1.26.0 2: Compton v: 1 driver:
    X: loaded: nvidia unloaded: fbdev,modesetting,nouveau,vesa gpu: nvidia display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x286mm (20.0x11.3") s-diag: 583mm (23")
  Monitor-1: DP-0 res: 1920x1080 hz: 60 dpi: 142 size: 344x194mm (13.5x7.6") diag: 395mm (15.5")
  OpenGL: renderer: NVIDIA GeForce GTX 1060/PCIe/SSE2 v: 4.6.0 NVIDIA 525.147.05
    direct render: Yes
Audio:
  Device-1: Intel 100 Series/C230 Series Family HD Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel bus-ID: 00:1f.3 chip-ID: 8086:a170 class-ID: 0403
  Device-2: NVIDIA GP106 High Definition Audio vendor: Micro-Star MSI driver: snd_hda_intel
    v: kernel pcie: speed: 8 GT/s lanes: 16 bus-ID: 01:00.1 chip-ID: 10de:10f1 class-ID: 0403
  Sound Server-1: ALSA v: k5.15.0-97-generic running: yes
  Sound Server-2: PulseAudio v: 15.99.1 running: yes
  Sound Server-3: PipeWire v: 0.3.48 running: yes
Network:
  Device-1: Qualcomm Atheros QCA6174 802.11ac Wireless Network Adapter vendor: Rivet Networks
    driver: ath10k_pci v: kernel pcie: speed: 2.5 GT/s lanes: 1 bus-ID: 02:00.0 chip-ID: 168c:003e
    class-ID: 0280
  IF: wlp2s0 state: up mac: <filter>
  Device-2: Qualcomm Atheros Killer E2400 Gigabit Ethernet vendor: Micro-Star MSI driver: alx
    v: kernel pcie: speed: 2.5 GT/s lanes: 1 port: d000 bus-ID: 04:00.0 chip-ID: 1969:e0a1
    class-ID: 0200
  IF: enp4s0 state: down mac: <filter>
Bluetooth:
  Device-1: Qualcomm Atheros QCA61x4 Bluetooth 4.0 type: USB driver: btusb v: 0.8 bus-ID: 1-10:4
    chip-ID: 0cf3:e300 class-ID: e001
  Report: hciconfig ID: hci0 rfk-id: 2 state: up address: <filter> bt-v: 2.1 lmp-v: 4.2
    sub-v: 25a hci-v: 4.2
Drives:
  Local Storage: total: 1.03 TiB used: 592.79 GiB (56.4%)
  ID-1: /dev/nvme0n1 vendor: Toshiba model: N/A size: 119.24 GiB speed: 31.6 Gb/s lanes: 4
    type: SSD serial: <filter> rev: 57XA4104 temp: 77.8 C scheme: GPT
  ID-2: /dev/sda vendor: Samsung model: SSD 860 QVO 1TB size: 931.51 GiB speed: 6.0 Gb/s
    type: SSD serial: <filter> rev: 2B6Q scheme: GPT
Partition:
  ID-1: / size: 94.34 GiB used: 48.79 GiB (51.7%) fs: ext4 dev: /dev/nvme0n1p3
  ID-2: /boot/efi size: 486 MiB used: 6.1 MiB (1.2%) fs: vfat dev: /dev/nvme0n1p1
  ID-3: /home size: 915.82 GiB used: 543.99 GiB (59.4%) fs: ext4 dev: /dev/sda1
Swap:
  ID-1: swap-1 type: partition size: 22.35 GiB used: 4.5 MiB (0.0%) priority: -2
    dev: /dev/nvme0n1p2
USB:
  Hub-1: 1-0:1 info: Hi-speed hub with single TT ports: 16 rev: 2.0 speed: 480 Mb/s
    chip-ID: 1d6b:0002 class-ID: 0900
  Device-1: 1-3:2 info: Logitech Unifying Receiver type: Keyboard,Mouse
    driver: logitech-djreceiver,usbhid interfaces: 2 rev: 2.0 speed: 12 Mb/s power: 98mA
    chip-ID: 046d:c534 class-ID: 0301
  Device-2: 1-7:6 info: MSI steel series rgb keyboard type: HID driver: gt683r_led,usbhid
    interfaces: 1 rev: 1.1 speed: 12 Mb/s power: 2mA chip-ID: 1770:ff00 class-ID: 0300
    serial: <filter>
  Device-3: 1-10:4 info: Qualcomm Atheros QCA61x4 Bluetooth 4.0 type: Bluetooth driver: btusb
    interfaces: 2 rev: 2.0 speed: 12 Mb/s power: 100mA chip-ID: 0cf3:e300 class-ID: e001
  Device-4: 1-11:5 info: Acer BisonCam NB Pro type: Video driver: uvcvideo interfaces: 2
    rev: 2.0 speed: 480 Mb/s power: 500mA chip-ID: 5986:055c class-ID: 0e02 serial: <filter>
  Hub-2: 2-0:1 info: Super-speed hub ports: 8 rev: 3.0 speed: 5 Gb/s chip-ID: 1d6b:0003
    class-ID: 0900
  Hub-3: 3-0:1 info: Hi-speed hub with single TT ports: 2 rev: 2.0 speed: 480 Mb/s
    chip-ID: 1d6b:0002 class-ID: 0900
  Hub-4: 4-0:1 info: Super-speed hub ports: 2 rev: 3.1 speed: 10 Gb/s chip-ID: 1d6b:0003
    class-ID: 0900
Sensors:
  System Temperatures: cpu: 72.0 C pch: 63.0 C mobo: 27.8 C gpu: nvidia temp: 54 C
  Fan Speeds (RPM): N/A
Repos:
  Packages: 2995 apt: 2936 flatpak: 59
  No active apt repos in: /etc/apt/sources.list
  No active apt repos in: /etc/apt/sources.list.d/ernstp-mesarc-jammy.list
  Active apt repos in: /etc/apt/sources.list.d/google-earth-pro.list
    1: deb [arch=amd64] http: //dl.google.com/linux/earth/deb/ stable main
  Active apt repos in: /etc/apt/sources.list.d/official-package-repositories.list
    1: deb https: //mirror.cedia.org.ec/linuxmint-packages vera main upstream import backport
    2: deb https: //edgeuno-bog2.mm.fcix.net/ubuntu jammy main restricted universe multiverse
    3: deb https: //edgeuno-bog2.mm.fcix.net/ubuntu jammy-updates main restricted universe multiverse
    4: deb https: //edgeuno-bog2.mm.fcix.net/ubuntu jammy-backports main restricted universe multiverse
    5: deb http: //security.ubuntu.com/ubuntu/ jammy-security main restricted universe multiverse
Info:
  Processes: 370 Uptime: 11h 49m wakeups: 3 Memory: 39.12 GiB used: 7.52 GiB (19.2%) Init: systemd
  v: 249 runlevel: 5 Compilers: gcc: 11.4.0 alt: 11/12 Client: Unknown python3.10 client
  inxi: 3.3.13


(available in About System)
KDE Plasma Version: 
KDE Frameworks Version: 
Qt Version: 

ADDITIONAL INFORMATION