▍1. spetro.m
语音信号的语谱图生成函数,function [ ] = spetro(Winsiz, Shift, Base, Mode, Gray),5个参数分别为帧长、帧移、取值门限(一般设为0)、彩色模式、灰度模式(This is a matlab function for creating speech spectrogram.)
语音信号的语谱图生成函数,function [ ] = spetro(Winsiz, Shift, Base, Mode, Gray),5个参数分别为帧长、帧移、取值门限(一般设为0)、彩色模式、灰度模式(This is a matlab function for creating speech spectrogram.)
语音识别,有GUI界面,实现0~9数字语音识别(speaker identification)
本系统是基于矢量量化技术的说话人识别系统。可根据语音库以及实时录音识别且具有图形用户界面。压缩包内含有源码、语音信息库音频及演示视频等。(The system is based on vector quantization speaker recognition system. Identifiable and has a graphical user interface based on speech and real-time recording. Within the compressed package contains the source code, voice information and presentation of audio and video.)
微软研究院的说话人识别工具包,包括GMM-UBM、I-Vector。其中demo_gmm_ubm_artificial.m和demo_ivector_plda_artificial.m为生成模拟特征参数进行训练与识别的教学示例,十分适合初学者学习说话人识别基础算法。具体使用方法请看内部文档。(Microsoft Research s speaker recognition toolkit, including GMM-UBM, I-Vector. Demo_gmm_ubm_artificial.m and demo_ivector_plda_artificial.m which generates an analog characteristic parameters for example teaching training and recognition, very suitable for beginners to learn the basic algorithm for speaker recognition. See the specific use of internal documents.)
基于GMM的MFCC算法的说话人识别,Maltab的高斯混合模型,12维。(MFCC speaker recognition algorithm based on GMM)
语音识别中信号特征的提取,基因周期估计,共振峰估计等。(Signal feature extraction in speech recognition, cycle genes estimation, formant estimation,etc)
4元麦克风阵列去除语音混响,适合正在学习阵列去混响的同学们参考一下(Microphone Array Dereverberation)
3、语音特征提取与分类 首先, 待识别语音转化为电信号后输入识别系统, 经过预处理后用数学方法提取语音特征信号, 提取出的语音特征信号可以看成该段语音的模式。(3, the voice feature extraction and classification First, to be recognized voice into electrical signals input recognition system, after pretreatment with a mathematical method to extract speech characteristic signal extracted speech characteristic signal can be seen as the voice segment model.)
语音信号时频分析,包括语谱图,过零率,自编的自相关,以及最后用了三种方法(短时能量和过零率、谱熵法、Teager算子)进行端点检测,代码完整且测试通过(time and frequency domain analysis of speech signals,including spectrogram,rate of zerocrossing,autocorrelation.and three method of endpoint detection,especially the Teager )
语音信号时频分析,包括语谱图,过零率,自编的自相关,以及最后用了三种方法(短时能量和过零率、谱熵法、Teager算子)进行端点检测,代码完整且测试通过(time and frequency domain analysis of speech signals,including spectrogram,rate of zerocrossing,autocorrelation.and three method of endpoint detection,especially the Teager )
语音信号的时域、频域分析,包括短时能量分析、短时平均过零率、自相关函数、短时平均幅度差函数等。(Time-domain speech signal, frequency domain analysis, including short-term energy analysis, the average short-term zero-crossing rate, autocorrelation function, such as short-time average magnitude difference function.)
语音信号的时域、频域分析,包括短时能量分析、短时平均过零率、自相关函数、短时平均幅度差函数等。(Time-domain speech signal, frequency domain analysis, including short-term energy analysis, the average short-term zero-crossing rate, autocorrelation function, such as short-time average magnitude difference function.)
基于倒谱短时部分反映了语音的声道特性,先用汉明窗取一帧语音,然后经变换得到语音倒谱,将倒谱短时部分取出,进行正交反变换后将得到声道的对数谱,即得到语音频谱的包络。将频谱包络和频谱画在一张图上,有很好的对比效果。获取的包络效果十分好。(Based Cepstral partly reflects the short channel characteristics of the speech, first take a Hamming window with a frame of speech, and speech cepstrum obtained by converting the cepstrum short segment out inverse orthogonal transform to obtain channel will of the spectrum, i.e. to obtain the envelope of the speech spectrum. The spectral envelope and spectral painted on a chart, there is a good contrast. Get the envelope effect is very good.)
基于倒谱短时部分反映了语音的声道特性,先用汉明窗取一帧语音,然后经变换得到语音倒谱,将倒谱短时部分取出,进行正交反变换后将得到声道的对数谱,即得到语音频谱的包络。将频谱包络和频谱画在一张图上,有很好的对比效果。获取的包络效果十分好。(Based Cepstral partly reflects the short channel characteristics of the speech, first take a Hamming window with a frame of speech, and speech cepstrum obtained by converting the cepstrum short segment out inverse orthogonal transform to obtain channel will of the spectrum, i.e. to obtain the envelope of the speech spectrum. The spectral envelope and spectral painted on a chart, there is a good contrast. Get the envelope effect is very good.)
一种用于语音或者心音去噪和提取包络的预处理方法,对于后面信号处理更加方便(A method for denoising and extract voice or heart sounds envelope pretreatment method, the signal processing is more convenient for the latter)
一种用于语音或者心音去噪和提取包络的预处理方法,对于后面信号处理更加方便(A method for denoising and extract voice or heart sounds envelope pretreatment method, the signal processing is more convenient for the latter)
一种关于倒谱的实现研究处理方法的总结以及展望(One kind of research on cepstrum processing method to achieve a summary and outlook)