speechbrain（一）MFCC特征提取

企业开发 2022-01-06 23:29:11 阅读次数: 0

流程图：

MFCC的提取过程：

声波----->DFT------->幅值图------->filter bank-------->log-------->DCT------>MFCC

speechbrain.processing.features下的类和函数

STFT：计算短时傅里叶变换

spectral_magnitude：返回复频谱图的幅值

Filterbank：计算filter bank特征

Deltas：计算delta系数（时间的导数）

ContextWindow：计算上下文，窗口大小由left_frames, right_frames参数指定

InputNormalization：数值归一化（减均值，除方差），avg_factor设置统计数据和累计统计数据之间的权重因子

from speechbrain.dataio.dataio import read_audio
from speechbrain.processing.features import STFT, spectral_magnitude, Filterbank, DCT, InputNormalization,ContextWindow, Deltas

signal =read_audio('samples/audio_samples/example1.wav')
signal = torch.tensor(signal).unsqueeze(0)
compute_stft = STFT(sample_rate=16000, win_length=25, n_fft=400,                     
                            window_fn=torch.hamming_window)
features = compute_stft(signal)
features = spectral_magnitude(features)
compute_fbank = Filterbank(n_mels=40, log_mel=True)
features = compute_fbank(features)
compute_mfcc = DCT(input_size=40, n_out=20)
features = compute_mfcc(features)
compute_deltas = Deltas(input_size=20)
deltas1 = compute_deltas(features)
deltas2 = compute_deltas(features)
features = torch.cat([features, deltas1, deltas2], dim=2)
compute_cw = ContextWindow(left_frames=3, right_frames=3)
features = compute_cw(features)
norm = InputNormalization()
features = norm(features, avg_factor = torch.tensor([1]).float())

speechbrain官网：SpeechBrain — SpeechBrain 0.5.0 documentationhttps://speechbrain.readthedocs.io/en/latest/index.html

猜你喜欢

转载自blog.csdn.net/qq_55796594/article/details/122229476

speechbrain（一）MFCC特征提取

mfcc特征提取

特征提取-MFCC

HTK特征提取(MFCC)代码分析(一)

语音识别-MFCC特征提取

语音特征提取方法-MFCC

librosa包进行mfcc特征提取

声音特征提取 MFCC向量

MFCC特征提取的MATLAB代码

深度解析MFCC特征提取

（一）特征提取

mfcc特征提取-39维度（基于kaldi）

AI大语音（四）——MFCC特征提取

【常用音频处理】hpcp/mfcc/fbank特征提取总结

语音识别 — 特征提取 MFCC 和 PLP

python提取mfcc特征

文本特征：特征提取（一）

语音识别——声音特征提取MFCC向量的具体步骤

基于MFCC特征提取和神经网络的语音信号识别算法matlab仿真

基于mfcc和DTW语音信息特征提取算法matlab仿真

SURF特征提取分析（一）

图像搜索特征提取（一）

SIFT特征提取

surf特征提取

sklearn 特征提取

特征提取总结

图像特征提取

opencv 特征提取

信号特征提取

文本特征提取

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)