20221125 Use PR2023 to automatically identify the subtitles of the MKV video generated by obs-studio screen recording
2022/11/25 19:07
01 obs.png
obs studio (64bit)
02 obs settings.png
03 obs output.png
04 obs is MKV.png by default
05 change obs to mkv to MP4.png
WARNING: Recordings saved to MP4/MOV will not be recoverable if the file cannot be completed (eg due to blue screen BSOD, power loss, etc.). If you want to record multiple audio tracks, please consider recording in MKV and repackage the recording to MP4/MOV when you are done (File→Video to Package)
06 obs video transfer package.png
07 obs conversion package.png
08 obs conversion package 2.png
09Conversion package.png
10Conversion and encapsulation completed.png
Transcapsulation completed
The video has been transcapsulated
OK
11PR import file.png
Importing files
12Import file renderings.png
13PR transcript sequence.png
14 PR transcription audio data.png
Creating automated transcriptions...
Rendering audio data
14PR transcription Simplified Chinese.png
Create Transcript
Language: Simplified Chinese (the default lock is English, there is a problem here, it will not remember the last language)
16PR recognition completed.png
Start time: 13:57 on 2022-11-24
End time: 16:47 on 2022-11-24
The video is 4 hours and 23 minutes, about 4.5 hours, and it takes about 3 hours to complete the recognition.
17Create descriptive subtitles.png
Create Captions
18 create.png from sequence transcription
Create subtitles
19Creating subtitles.png
20 Generate subtitles completed.png
21 Export to SRT file.png
22SRT.png
Method 2: MKV→MKA→MP4 (video files with only audio)
31 MKV.png
https://mkvtoolnix.download/downloads.html#windows
https://www.fosshub.com/MKVToolNix.html
32 Import MKV.png
start mixing
33 MKV2mka.png
34 ffmpeg1.png
ffmpeg -i "2022-07-02 13-10-13.mka" -acodec copy "2022-07-02 13-10-13.mp4"
Microsoft Windows [Version 10.0.19044.1889]
(c) Microsoft Corporation. all rights reserved.
C:\Users\ASUS>cd C:\AI-OCR
C:\AI-OCR>
C:\AI-OCR>ffmpeg -i "2022-07-02 13-10-13.mka" -acodec copy "2022-07-02 13-10-13.mp4"
ffmpeg version N-106352-g2a1eae2179-20220319 Copyright (c) 2000-2022 the FFmpeg developers
built with gcc 11.2.0 (crosstool-NG 1.24.0.533_681aaef)
configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static --pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64 --target-os=mingw32 --enable-gpl --enable-version3 --disable-debug --disable-w32threads --enable-pthreads --enable-iconv --enable-libxml2 --enable-zlib --enable-libfreetype --enable-libfribidi --enable-gmp --enable-lzma --enable-fontconfig --enable-libvorbis --enable-opencl --disable-libpulse --enable-libvmaf --disable-libxcb --disable-xlib --enable-amf --enable-libaom --enable-avisynth --enable-libdav1d --enable-libdavs2 --disable-libfdk-aac --enable-ffnvcodec --enable-cuda-llvm --enable-frei0r --enable-libgme --enable-libass --enable-libbluray --enable-libmp3lame --enable-libopus --enable-librist --enable-libtheora --enable-libvpx --enable-libwebp --enable-lv2 --enable-libmfx --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopenmpt --enable-librav1e --enable-librubberband --enable-schannel --enable-sdl2 --enable-libsoxr --enable-libsrt --enable-libsvtav1 --enable-libtwolame --enable-libuavs3d --disable-libdrm --disable-vaapi --enable-libvidstab --enable-vulkan --enable-libshaderc --enable-libplacebo --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxvid --enable-libzimg --enable-libzvbi --extra-cflags=-DLIBTWOLAME_STATIC --extra-cxxflags= --extra-ldflags=-pthread --extra-ldexeflags= --extra-libs=-lgomp --extra-version=20220319
libavutil 57. 24.101 / 57. 24.101
libavcodec 59. 25.100 / 59. 25.100
libavformat 59. 20.101 / 59. 20.101
libavdevice 59. 6.100 / 59. 6.100
libavfilter 8. 29.100 / 8. 29.100
libswscale 6. 6.100 / 6. 6.100
libswresample 4. 6.100 / 4. 6.100
libpostproc 56. 5.100 / 56. 5.100
Input #0, matroska,webm, from '2022-07-02 13-10-13.mka':
Metadata:
encoder : libebml v1.4.4 + libmatroska v1.7.1
creation_time : 2022-11-24T08:54:07.000000Z
Duration: 04:23:37.92, start: 0.000000, bitrate: 137 kb/s
Stream #0:0: Audio: aac (LC), 48000 Hz, stereo, fltp (default)
Metadata:
title : simple_aac_recording
BPS : 136245
DURATION : 04:23:37.920338304
NUMBER_OF_FRAMES: 741465
NUMBER_OF_BYTES : 269389856
_STATISTICS_WRITING_APP: mkvmerge v72.0.0 ('Minuano (Six-eight)') 64-bit
_STATISTICS_WRITING_DATE_UTC: 2022-11-24 08:54:07
_STATISTICS_TAGS: BPS DURATION NUMBER_OF_FRAMES NUMBER_OF_BYTES
Output #0, mp4, to '2022-07-02 13-10-13.mp4':
Metadata:
encoder : Lavf59.20.101
Stream #0:0: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, stereo, fltp (default)
Metadata:
title : simple_aac_recording
BPS : 136245
DURATION : 04:23:37.920338304
NUMBER_OF_FRAMES: 741465
NUMBER_OF_BYTES : 269389856
_STATISTICS_WRITING_APP: mkvmerge v72.0.0 ('Minuano (Six-eight)') 64-bit
_STATISTICS_WRITING_DATE_UTC: 2022-11-24 08:54:07
_STATISTICS_TAGS: BPS DURATION NUMBER_OF_FRAMES NUMBER_OF_BYTES
Stream mapping:
Stream #0:0 -> #0:0 (copy)
Press [q] to stop, [?] for help
size= 267995kB time=04:23:37.92 bitrate= 138.8kbits/s speed=3.76e+03x
video:0kB audio:263076kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 1.869737%
C:\AI-OCR>
35 ffmpeg2.png
36 pr import audio.png
37 Transcript sequence.png
38 Sequence 2.png
39PR recognition completed.png
40 Convert SRT'.png
41 Import MKV.png
42 don't know the format.png
File import failed