20221125 Use PR2023 to automatically identify the subtitles of the MKV video generated by obs-studio screen recording

20221125 Use PR2023 to automatically identify the subtitles of the MKV video generated by obs-studio screen recording
2022/11/25 19:07

01 obs.png

obs studio (64bit)

 


02 obs settings.png

 

03 obs output.png

 

04 obs is MKV.png by default

 

05 change obs to mkv to MP4.png

WARNING: Recordings saved to MP4/MOV will not be recoverable if the file cannot be completed (eg due to blue screen BSOD, power loss, etc.). If you want to record multiple audio tracks, please consider recording in MKV and repackage the recording to MP4/MOV when you are done (File→Video to Package)

 

06 obs video transfer package.png

 

07 obs conversion package.png

 

08 obs conversion package 2.png

 

09Conversion package.png

 

10Conversion and encapsulation completed.png

Transcapsulation completed
The video has been transcapsulated
OK

 

11PR import file.png

Importing files

 

12Import file renderings.png

 

13PR transcript sequence.png

 

14 PR transcription audio data.png

Creating automated transcriptions...
Rendering audio data

 

14PR transcription Simplified Chinese.png

Create Transcript
Language: Simplified Chinese (the default lock is English, there is a problem here, it will not remember the last language)
 

 

16PR recognition completed.png

Start time: 13:57 on 2022-11-24
End time: 16:47 on 2022-11-24
The video is 4 hours and 23 minutes, about 4.5 hours, and it takes about 3 hours to complete the recognition.

 

17Create descriptive subtitles.png

Create Captions

 

18 create.png from sequence transcription

Create subtitles

 

19Creating subtitles.png

 

20 Generate subtitles completed.png

 

21 Export to SRT file.png

 

22SRT.png

 

Method 2: MKV→MKA→MP4 (video files with only audio)

31 MKV.png

https://mkvtoolnix.download/downloads.html#windows
https://www.fosshub.com/MKVToolNix.html

 

32 Import MKV.png

start mixing

 

33 MKV2mka.png

 

34 ffmpeg1.png

ffmpeg -i "2022-07-02 13-10-13.mka" -acodec copy  "2022-07-02 13-10-13.mp4"

Microsoft Windows [Version 10.0.19044.1889]
(c) Microsoft Corporation. all rights reserved.

C:\Users\ASUS>cd C:\AI-OCR

C:\AI-OCR>
C:\AI-OCR>ffmpeg -i "2022-07-02 13-10-13.mka" -acodec copy  "2022-07-02 13-10-13.mp4"
ffmpeg version N-106352-g2a1eae2179-20220319 Copyright (c) 2000-2022 the FFmpeg developers
  built with gcc 11.2.0 (crosstool-NG 1.24.0.533_681aaef)
  configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static --pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64 --target-os=mingw32 --enable-gpl --enable-version3 --disable-debug --disable-w32threads --enable-pthreads --enable-iconv --enable-libxml2 --enable-zlib --enable-libfreetype --enable-libfribidi --enable-gmp --enable-lzma --enable-fontconfig --enable-libvorbis --enable-opencl --disable-libpulse --enable-libvmaf --disable-libxcb --disable-xlib --enable-amf --enable-libaom --enable-avisynth --enable-libdav1d --enable-libdavs2 --disable-libfdk-aac --enable-ffnvcodec --enable-cuda-llvm --enable-frei0r --enable-libgme --enable-libass --enable-libbluray --enable-libmp3lame --enable-libopus --enable-librist --enable-libtheora --enable-libvpx --enable-libwebp --enable-lv2 --enable-libmfx --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopenmpt --enable-librav1e --enable-librubberband --enable-schannel --enable-sdl2 --enable-libsoxr --enable-libsrt --enable-libsvtav1 --enable-libtwolame --enable-libuavs3d --disable-libdrm --disable-vaapi --enable-libvidstab --enable-vulkan --enable-libshaderc --enable-libplacebo --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxvid --enable-libzimg --enable-libzvbi --extra-cflags=-DLIBTWOLAME_STATIC --extra-cxxflags= --extra-ldflags=-pthread --extra-ldexeflags= --extra-libs=-lgomp --extra-version=20220319
  libavutil      57. 24.101 / 57. 24.101
  libavcodec     59. 25.100 / 59. 25.100
  libavformat    59. 20.101 / 59. 20.101
  libavdevice    59.  6.100 / 59.  6.100
  libavfilter     8. 29.100 /  8. 29.100
  libswscale      6.  6.100 /  6.  6.100
  libswresample   4.  6.100 /  4.  6.100
  libpostproc    56.  5.100 / 56.  5.100
Input #0, matroska,webm, from '2022-07-02 13-10-13.mka':
  Metadata:
    encoder         : libebml v1.4.4 + libmatroska v1.7.1
    creation_time   : 2022-11-24T08:54:07.000000Z
  Duration: 04:23:37.92, start: 0.000000, bitrate: 137 kb/s
  Stream #0:0: Audio: aac (LC), 48000 Hz, stereo, fltp (default)
    Metadata:
      title           : simple_aac_recording
      BPS             : 136245
      DURATION        : 04:23:37.920338304
      NUMBER_OF_FRAMES: 741465
      NUMBER_OF_BYTES : 269389856
      _STATISTICS_WRITING_APP: mkvmerge v72.0.0 ('Minuano (Six-eight)') 64-bit
      _STATISTICS_WRITING_DATE_UTC: 2022-11-24 08:54:07
      _STATISTICS_TAGS: BPS DURATION NUMBER_OF_FRAMES NUMBER_OF_BYTES
Output #0, mp4, to '2022-07-02 13-10-13.mp4':
  Metadata:
    encoder         : Lavf59.20.101
  Stream #0:0: Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, stereo, fltp (default)
    Metadata:
      title           : simple_aac_recording
      BPS             : 136245
      DURATION        : 04:23:37.920338304
      NUMBER_OF_FRAMES: 741465
      NUMBER_OF_BYTES : 269389856
      _STATISTICS_WRITING_APP: mkvmerge v72.0.0 ('Minuano (Six-eight)') 64-bit
      _STATISTICS_WRITING_DATE_UTC: 2022-11-24 08:54:07
      _STATISTICS_TAGS: BPS DURATION NUMBER_OF_FRAMES NUMBER_OF_BYTES
Stream mapping:
  Stream #0:0 -> #0:0 (copy)
Press [q] to stop, [?] for help
size=  267995kB time=04:23:37.92 bitrate= 138.8kbits/s speed=3.76e+03x
video:0kB audio:263076kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 1.869737%

C:\AI-OCR>

 

35 ffmpeg2.png

 

36 pr import audio.png

 

37 Transcript sequence.png

 

38 Sequence 2.png

 

39PR recognition completed.png

 

40 Convert SRT'.png

 

41 Import MKV.png

 

42 don't know the format.png

File import failed

 

Guess you like

Origin blog.csdn.net/wb4916/article/details/128043198