SadTalker AI model can automatically generate video using a picture and a piece of audio - Code World

SadTalker AI model can automatically generate video using a picture and a piece of audio

Enterprise 2023-08-01 18:49:54 views: null

The SadTalker model is an open source model that uses pictures and audio files to automatically synthesize character animations. We give the model a picture and an audio file, and the model will perform corresponding actions on the face of the transferred picture according to the audio file, such as opening the mouth and blinking. , move the head and other actions.
SadTalker, which generates 3DMM's 3D motion coefficients (head pose, expression) from audio and implicitly modulates a novel 3D-aware facial rendering for generating talking head motion videos.

To learn realistic motion, SadTalker explicitly models the connections between audio and different types of motion coefficients, respectively. To be precise, SadTalker proposes the ExpNet model to learn accurate facial expressions from audio by extracting motion coefficients and 3D rendered facial movements. As for the head pose, SadTalker uses PoseVAE to synthesize different styles of head movements.
The model not only supports English, but also supports Chinese, we can experience it directly on the hugging face

https://huggingface.co/spaces/vinthony/SadTalker

Of course, the official open source code, we can run this model directly on our own computer

https://github.com/OpenTalker/SadTalker

Of course, if we want to run this program, we need to install python3.8 or above, and download the pre-trained modelÿ

Guess you like

Origin blog.csdn.net/weixin_44782294/article/details/131386693

SadTalker AI model can automatically generate video using a picture and a piece of audio

Google launches large language model VideoPoet: text and pictures can generate video and audio

Databricks launches AI model SDK, which can automatically generate SQL code

[Digital Human] 1. SadTalker | Using voice to drive a single picture to synthesize video (CVPR2023)

Can AI automatically generate paintings? Share several AI painting software

This article was automatically generated using an AI large model

Audio and video processing FFMPeg development actual combat (7) - Filter using (adding a picture in the video)

Draw a few times at will to let AI finish the picture, and can also generate AI portraits

AI tool to automatically generate icons

[Audio and video] FFmpeg open video | save the picture

Audio and video ffmpeg command picture and video conversion

Audio audio can not automatically play the solution

[AI Large Model] How to use LLM and intelligent question and answer BI natural language to automatically generate intelligent reports?

How to automatically generate Vue code from the prototype drawing of Mo Dao? Can AI help?

Amazon Cloud Technology Releases Amazon HealthScribe, Using Generative AI Technology to Automatically Generate Clinical Documents

AI technology practice｜Using Tencent Cloud recording file recognition to automatically generate subtitles for videos without subtitles

Automatically generate flash text video with Python

Audio and video basics color model

Audio & video & picture common editing tool integration

FFmpeg decoded video of audio and video programming, save the video as a picture

Clonezilla - generate a mirror that can be executed automatically

Automatically generate Chinese text using LSTM

The MetaGPT AI model is open source and can intelligently generate high-quality code

Android audio and video topics (1) Draw a picture on the Android platform, using at least 3 different APIs, ImageView, SurfaceView, and custom View

Use python to realize AI video synthesis audio, and the mouth shape can be right

Using AI to generate reports - AI123

【Phase 1 of InsCode Stable Diffusion Meitu Event with Selected High-Quality Characters】Self-test using the Inscode-AI drawing model to generate beautiful pictures with detailed tutorials (no configuration is required, Xiaobai can run it immediately)

The video tag automatically plays audio and video and draws waveforms

AI digital human video based on SadTalker (taking the deployment of AutoDL computing power cloud platform as an example)

AI: Introduction to AI tools and products in the field of artificial intelligence by category (text, picture, programming, office, video, audio, multimodal), and detailed guides on how to use them (continuously updated)

Recommended

Ranking

error: (-215:Assertion failed) !_img.empty() in function ‘cv::imwrite‘

Database migration between Navicat servers

Minimum number of rotation of the array: Array

balenaEtcher for mac (make a boot disk software) v1.5.67

Custom processing serialization and deserialization in jackson

Mu-en-mask system development software

Mastering Regular Expressions

Find mileage Java--

Web pages can not directly concern the public micro-channel number how to do? A key to arouse public concern number of micro-channel solutions

[CodeForces - 739B] Alyona and a tree Tree + [difference] + bipartite

Daily

More

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)