Summary: | Speech to text work incorrrectly | ||
---|---|---|---|
Product: | [Applications] kdenlive | Reporter: | 杨雪枫 <amie1972> |
Component: | User Interface & Miscellaneous | Assignee: | Jean-Baptiste Mardelle <jb> |
Status: | RESOLVED FIXED | ||
Severity: | normal | CC: | amie1972, an.onymer, fritzibaby |
Priority: | NOR | Flags: | fritzibaby:
timeline_corruption+
|
Version First Reported In: | 21.04.3 | ||
Target Milestone: | --- | ||
Platform: | Compiled Sources | ||
OS: | Linux | ||
Latest Commit: | Version Fixed In: | 21.08 | |
Sentry Crash Report: | |||
Attachments: | empty output subtitle |
I tested it with AppImage 21.04.3 and the subtitle are created (Kdenlive language: English, test clip: English). Please check with the AppImage if it's working. bellow is my kdenlive log: subtitle constructor Filter! Tractor! root context get sub model new function === STARTING RENDER B === STARTING RENDER C, IN: 0 - 1269 [wav @ 0x7f946042b600] Using AVStream.codec to pass codec parameters to muxers is deprecated, use AVStream.codecpar instead. [wav @ 0x7f946042b600] Timestamps are unset in a packet for stream 0. This is deprecated and will stop working in the future. Fix your code to set the timestamps properly [wav @ 0x7f946042b600] Encoder did not produce proper pts, making some up. === STARTING RENDER D === RUNNING SPEECH ANALYSIS: "/tmp/.mount_kdenlitgVg4v/usr/share/kdenlive/scripts/speech.py" ==== ANALYSIS SPEECH: "/home/sysadmin/.local/share/kdenlive/speechmodels" - "vosk-model-small-cn-0.3" - "/tmp/tRxkSh.wav" - "/tmp/ziFhhU.srt" /// TERMINATING SPEECH JOB +++++++++++++++++++++++++++ srt File srt file import Saving subtitle filter: "/tmp/1626703653381.srt" But where I get empty file: [sysadmin@localhost ~]$ cat /tmp/1626703653381.srt [sysadmin@localhost ~]$ while kdenlive-21.04.2 work correctly. Goto settings -> configure Kdenlive -> speech to text -> click on check configuration -> which vosk and srt version do you have? Vosk 0.3.27,Srt 3.4.1 I have the same problem. I documented my steps in this post: https://www.reddit.com/r/kdenlive/comments/nwk7bv/speechtotext_stops_after_starting_speech/ OS is Windows 10 Pro, Version 2004, build 19041.1110 kdenlive 21.04.3, MLT Version 6.26.1 Tried vosk-model-en-us-daanzu-20200905 I can confirm. Even setting back vosk to a working version with pip3 install --no-cache-dir vosk==0.3.22. doesn’t make speech-to-text running. Please try following work around: Window-key+r enter "cmd". Enter in cmd.exe "pip3 install --no-cache-dir vosk==0.3.22." (wihtout ") Kdenlive: goto settings -> configure Kdenlive -> Speech to text -> click on check configuration -> you should see vosk 0.3.22. Restart Kdenlive -> gotto settings -> speech to text -> click on check configuration -> click check for update -> after the update to 0.3.30 is done, speech to text should work. (In reply to emohr from comment #8) > Please try following work around: > > Window-key+r enter "cmd". Enter in cmd.exe "pip3 install --no-cache-dir > vosk==0.3.22." (wihtout ") pip install works fine > Kdenlive: goto settings -> configure Kdenlive -> Speech to text -> click on > check configuration -> you should see vosk 0.3.22. "Check configuration" only tells me "Speech to text is properly configured" in a green box. Above that, I have a list of vosk models. > Restart Kdenlive -> gotto settings -> speech to text -> click on check > configuration -> click check for update -> after the update to 0.3.30 is > done, speech to text should work. "Check updates" shows "Updating modules" for a second and then just "Speech to text is properly configured" again. Unfortunately, this did not make it work for me. :S I assume you are on 21.04.3. It seems vosk 0.3.22 doesn't work properly with 21.04.3. Close Kdenlive. Install only vosk 0.3.22. Do the following: window+r enter cmd. Enter "pip3 install --no-cache-dir vosk==0.3.22" (wihtout "). Start Kdenlive. Does speech-to text works now? (In reply to emohr from comment #10) > I assume you are on 21.04.3. It seems vosk 0.3.22 doesn't work properly with > 21.04.3. > > Close Kdenlive. Install only vosk 0.3.22. Do the following: window+r enter > cmd. Enter "pip3 install --no-cache-dir vosk==0.3.22" (wihtout "). > > Start Kdenlive. Does speech-to text works now? No, it doesn't. cmd says "Requirement already satisfied:(...)", as I already performed the task before. Do I have to use an older version of kdenlive? I don't already understand how the work around is related to what you told me to do. My idea was to install an old vosk version so that we can force Kdenlive to update to a new version by themselves and that Kdenlive set all links again. I assume 21.04.0 should work: https://download.kde.org/stable/kdenlive/21.04/windows/ (In reply to emohr from comment #12) Sadly no. I installed 21.04.0 parallel to the current version. The problem persists, the behaviour did not change :( Maybe it helps to reset Kdenlive. To do so following these steps: https://userbase.kde.org/Kdenlive/Manual/Windows_Issues#Cannot_open_projects_made_with_previous_version It seems that 21.04.3 still has the problem. Please check with the nightly build if speech to text is working: https://binary-factory.kde.org/job/Kdenlive_Nightly_mingw64/lastSuccessfulBuild/artifact/ Nightly Build still has this bug Update vosk to 0.3.30 seems resolve this,so I shoud close this bug. Thank you for the feedback. Glad to hear it works with nightly build and vosk 0.3.30. |
Created attachment 139994 [details] empty output subtitle Speech to text starts and finish with no progress ,and results empty subtitle(no subtitle created.),no error reported.