用于音频内容识别的分类系统技术方案

技术编号：4170718 阅读：348 留言：0更新日期：2012-04-11 18:40

一种音频内容分类系统，包括训练端与测试端，训练端通过音频特征提取模块提取音频测试样本的特征，并通过分类器训练模块训练出分类器的参数；而测试端包括与训练端共用的音频特征提取模块用以提取输入信号的音频特征，而分类器决策模块是根据音频特征提取模块的输出音频特征为输入，对第一帧运用训练部分训练得到的分类器参数进行分类，同时瞬态特征提取模块对该输入信号的瞬态特征进行提取并输出至瞬态特征平滑模块，该瞬态特征平滑模块来对分类器决策模块的输出结果进行修正并输出，同时增量学习模块利用已分类的音频帧的类别信息和特征信息来当作一组增量学习样本更新分类器的参数。

Classification system for audio content identification

An audio classification system, including training and testing training end end end, extraction module feature extraction of audio samples tested by audio features, and through the classifier training module trained classifier parameters; and the test and training end end includes audio feature extraction module is used to share the audio input signal feature extraction, and classifier decision according to the characteristics of the audio output module audio feature extraction module as input, the classification of the first frame using the classifier parameters obtained from the training and training, the transient feature extraction module to extract and output to the transient characteristics of the transient characteristics of smoothing module of the input signal, the output of the classifier decision module to modify the results and outputs the transient characteristics at the same time smoothing module, incremental learning module using the audio frame has the information and special category classification The information is used as a set of incremental learning samples to update the classifier parameters.

全部详细技术资料下载

【技术实现步骤摘要】
用于音频内容识别的分类系统技术领城本专利技术涉及一种模式识别及信号处理技术，尤其涉及一种用于音频内容识别的分类系统，
技术介绍
音频是多媒体中的一种重要媒体，音频信息检索技术是多媒体信息检索技术中的一个重要部分，相应的现有技术可参考中国专利1391211、 1223739 及1270361号及美国专利5,613,037、 6,292,776及5,440,662号等。在音频检索应用中，需要对音频数据进行分类，它的目的是区分输入的音频信号属于那一类，常见的音频类别有人声、背景噪声、流行音乐、古典音乐等，并且音频内容分类的应用也非常广泛，特别是在音频检索领域，音频内容分类起着决定性的作用，而在一些多媒体摘要的抽取过程中，音频内容分类作为视频内容检索的一种辅助手段也起到了重要作用。广义上来说，在很多语音和音频标准，例如3GPP的AMR-WB和AMR-WB+里，它们都用到了语音/噪声分类器和语音/音乐分类器，提供给编码器输入信号是哪一种音频信号，从而对每一种信号采取不同的编码器，因此设计一种良好的音频内容分类方法是相当关键和重要的。在通常的分类方法中，通常用...

【技术保护点】
一种音频内容分类系统，包括训练端与测试端，其特征在于训练端包括：　音频特征提取模块，用以提取音频测试样本的特征；　分类器训练模块，其根据音频特征提取模块采集的音频特征以及该音频信号的类别信息，训练出分类器的参数；　而测试端包括：　和训练端共用的音频特征提取模块；　分类器决策模块，根据音频特征提取模块的输出音频特征为输入，对第一帧运用训练部分训练得到的分类器参数进行分类；　瞬态特征提取模块，对该输入信号的瞬态特征进行提取并输出至瞬态特征平滑模块；　该瞬态特征平滑模块来对分类器决策模块的输出结果进行修正并输出；　增量学习模块利...

【技术特征摘要】

【专利技术属性】
技术研发人员：黄鹤云，林福辉，
申请(专利权)人：展讯通信上海有限公司，
类型：发明
国别省市：31[中国|上海]

全部详细技术资料下载我是这个专利的主人