哪里可以下载TIMIT的科大讯飞语音库下载啊？请高手们帮帮忙。

风水堪舆学 | 网络营销 | 住宅风水 | 英文歌曲 | Adobe After Effects | 电脑配置 | 书籍改编电影 | 下载 | Legion | 网络推广 | 动画制作 | 赛事 | PLC | 小说创作 | 虚拟专用服务器 | 成语 | 家庭 | 单反相机 | 电视节目 | 投影机 | 面相 | 香港购物 | 配音 | 文具 | 二次元 | 影视 | 固态硬盘ssd | 虚拟机 | 跆拳道 | r（编程语言） | 秦时明月之天行九歌 | 使命召唤 | 网盘 | 地图 | 琅琊榜（电视剧） | 手机内存 | 角色扮演 | 华硕 | 百度输入法 | 盗墓笔记（小说） | 营销策划 | 化妆品 | Windows | ip地址 | 装修设计 | 齐内丁·齐达内 | 动画电影 | 中国中央电视台 | 罗兰 | 网站优化 | 斗鱼直播 | 冷知识 | 张帅 | 任天堂 | 摄影师 | 三菱商事 | 迅雷（软件） | 计算机病毒 | amd | 屏幕 | 微单相机 | 电学 | qq浏览器 | MacOS | 联赛 | snh48 | 芯片（集成电路） | 后宫·甄嬛传（书籍） | 植物辨识 | 运动 | 大一 | 美容 | 双色球 | 蓝牙音箱 | 楼盘 | 电脑电源 | 采暖 | 显卡驱动 | 体育赛事 | thinkpad | 离婚 | 武侠小说 | 索尼笔记本 | 中国足球协会超级联赛（csl） | youtube | 王力宏（人物） | 外星人 | 努比亚（手机品牌） | 海贼王 | 移动电源 | 完美世界（游戏） | 摩托车 | 编辑器 | 低音炮 | 收益 | 海关 | 徐波 | akb48 | 互联网创业 | 张璐 | 男性 | 性价比 | MacBook Air | 新疆维吾尔自治区 | 插座 | 外汇平台 | 华为Mate30 | 羽毛球技术 | 腾讯 QQ | 蓝屏 | 字幕 | 免费软件 | 电脑故障 | 女生 | 周星驰（人物） | 足球欧洲杯 | pdf | macbook | 直播 | 生活经历 | 骁龙处理器 | 主题曲 | 户外运动 | CPU | 娱乐圈 | 初恋 | 家居 | 流氓软件 | 名言 | 中国足球 | 近视眼 | acg | 一级方程式赛车（f1） | 小品 | 网站运营 | 英格兰足球超级联赛 | 一体机 | 人肉搜索 | 日本电影 | 系统软件 | 人生 | 流星花园 | 电钢琴 | 分辨率 | 迅雷 | 机械设计 | 古典音乐 | 液晶电视 | 睡眠 | 大片 | 资产 | Html/Css | ansys | 天蝎座 | 对联 | 大二 | 吉他学习 | 实习 | uc浏览器 | 计算机科学 | 新华社 | 脱毛 | 视力 | 乐视超级电视 | 大学生活 | 开关电源 | 平面设计 | 音乐版权 | iPhone 11 Pro | 面膜 | 鞠婧祎 | 胡歌（演员） | 郭富城 | 语言 | 赵丽颖（演员） | 意大利 | 电路设计 | 情侣 | NBA篮球 | 蔡徐坤 | 豆瓣电影 | 社交软件 | 微信开发 | 足球彩票 | 电工 | 手机摄像头 | 用户界面设计师 | 华语流行音乐 | 网卡 | 易烊千玺 | 笛子 | 日语学习 | 日语歌曲 | 歌手 | 张子枫 | 搏击项目 | 谭松韵 | 快捷键 | O2O | 移民 |

你的位置：网站首页 >> 频道首页 >>互联网 >>哪里可以下载TIMIT的科大讯飞语音库下载啊？请高手们帮帮忙。

哪里可以下载TIMIT的科大讯飞语音库下载啊？请高手们帮帮忙。

来源：蜘蛛抓取(WebSpider) 时间：2017-08-13 12:23 标签：朗读女小燕语音库下载

TIMIT语音库（转）
我的图书馆
TIMIT语音库（转）
TIMIT语音库为大多数论文及研究中常用的语音库，适用于语音识别、说话人识别等语音信号处理。在MIT网站可以找到一些，为 16kHz sampling, 16 bit sample, PCM encoding。样例才160个句子，不够用。这儿能找到，用抓取工具全部下载下来有600多M但问题是虽然其为wav结尾，matlab中wavread却读不了，用二进制打开文件发现&google一下，原来其为&整个语音库有6300个文件，如何全部转换为普通的wav文件？step1&遍历整个文件夹，把所有wav文件全找出来&find_wav.mfunction [ wav_files ] = find_wav( path )%FIND_WAV, find all wav file recursivelywav_files = [];if(isdir(path) == 0)endpath_files = dir(path);fileNum = length(path_files);for k= 3:fileNum
file = [path,'\', path_files(k).name];
if (path_files(k).isdir == 1)
ret = find_wav(file);
if(isempty(ret) ~= 1)
if(isempty(wav_files))
wav_files = char(ret);
wav_files = char(wav_files, ret);
elseif strfind(path_files(k).name, '.wav')
if(isempty(wav_files))
wav_files = char(file);
wav_files = char(wav_files, file);
endendendstep2&文件转换conver_wav.m%SPHERE 文件转换为wav文件fs = 16000;files = find_wav('.');for fileIdx = 1:length(files)
file = files(fileIdx,:);
fileID = fopen(file);
%判断文件头，防止误操作
head = fread(fileID, 1024, 'char*1');
headStr = sprintf('%s',head(1:7));
if(~strcmp(headStr,'NIST_1A'))
fclose(fileID);
frewind(fileID);
allData = fread(fileID, inf, 'short');
fclose(fileID);
delete(file);
wavwrite(allData(513:end)./32768, fs, file);
%SPHERE 文件头1024字节
endstep3&检查 check_wav.mfiles = find_wav('.');for fileIdx = 1:length(files)
file = files(fileIdx,:);
[y, fs, nbits] = wavread(file);%不是wav文件就会报错
if(fs~=16000)
fprintf('%s: fs~=16000\n', file);
if(nbits ~= 16)
fprintf('%s: nbits ~= 16\n', file);
endend大功告成
TA的最新馆藏
喜欢该文的人也喜欢哪里可以下载TIMIT的语音库啊？请高手们帮帮忙。-数据库高手们，来帮帮忙
哪里可以下载TIMIT的语音库啊？请高手们帮帮忙。数据库高手们，来帮帮忙
哪里可以下载TIMIT的语音库啊？请高手们帮帮忙。
哪里可以下载TIMIT的语音库啊?请高手们帮帮忙。……
http://blog.csdn.net/jwb361/article/details/966390...
您好,你有TIMIT的语音库嘛能不能分享一下
我最近做语音增强……
http://blog.csdn.net/jwb361/article/details/966390...
TIMIT语音库怎么用啊……
TIMIT语音库有着准确的音素标注,因此可以应用于语音分割性能评价,同时该数据库又含有几百个说话人语...
如何在MATLAB里使用HTK?……
先下载HTK然后根据用户手册安装好(其实就是把HTK的路径添加到windows的path里面去)。然...TIMIT语音库 - WELEN - 推酷
TIMIT语音库 - WELEN
TIMIT语音库有着准确的音素标注，因此可以应用于语音分割性能评价，同时该数据库又含有几百个说话人语音，所以也是评价说话人识别常用的权威语音库，但该语音库的商业用途是要花钱买的。下面的资源来自与MIT教学实验使用，大概有430多M。
下载地址：
不需要单个文件下载，可以使用下面的下载工具批量下载。
下载工具：
&&&&&&&&& The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& (TIMIT)
&&&&&&&&&&&&&&&&&&&&&&&&&&& Training and Test Data
&&&&&&&&&&&&&&&&&&&&&&&&&& NIST Speech Disc CD1-1.1
The TIMIT corpus of read speech has been designed to provide speech data for
the acquisition of acoustic-phonetic knowledge and for the development and
evaluation of automatic speech recognition systems.& TIMIT has resulted from
the joint efforts of several sites under sponsorship from the Defense Advanced
Research Projects Agency - Information Science and Technology Office
(DARPA-ISTO).& Text corpus design was a joint effort among the Massachusetts
Institute of Technology (MIT), Stanford Research Institute (SRI), and Texas
Instruments (TI).& The speech was recorded at TI, transcribed at MIT, and has
been maintained, verified, and prepared for CD-ROM production by the National
Institute of Standards and Technology (NIST).& This file contains a brief
description of the TIMIT Speech Corpus.& Additional information including the
referenced material and some relevant reprints of articles may be found in the
printed documentation which is also available from NTIS (NTIS# PB91-100354).
1. Corpus Speaker Distribution
-- ---------------------------
TIMIT contains a total of 6300 sentences, 10 sentences spoken by each of 630
speakers from 8 major dialect regions of the United States.& Table 1 shows the
number of speakers for the 8 dialect regions, broken down by sex.& The
percentages are given in parentheses.& A speaker's dialect region is the
geographical area of the U.S.& where they lived during their childhood years.
The geographical areas correspond with recognized dialect regions in U.S.
(Language Files, Ohio State University Linguistics Dept., 1982), with the
exception of the Western region (dr7) in which dialect boundaries are not
known with any confidence and dialect region 8 where the speakers moved around
a lot during their childhood.
&& Table 1:& Dialect distribution of speakers
&&&&& Dialect
&&&&& Region(dr)&&& #Male&&& #Female&&& Total
&&&&& ----------& --------- ---------& ----------
&&&&&&&& 1&&&&&&&& 31 (63%)& 18 (27%)&& 49 (8%)&
&&&&&&&& 2&&&&&&&& 71 (70%)& 31 (30%)& 102 (16%)
&&&&&&&& 3&&&&& &&&79 (67%)& 23 (23%)& 102 (16%)
&&&&&&&& 4&&&&&&&& 69 (69%)& 31 (31%)& 100 (16%)
&&&&&&&& 5&&&&&&&& 62 (63%)& 36 (37%)&& 98 (16%)
&&&&&&&& 6&&&&&&&& 30 (65%)& 16 (35%)&& 46 (7%)
&&&&&&&& 7&&&&&&&& 74 (74%)& 26 (26%)& 100 (16%)
&&&&&&&& 8&&&&&&&& 22 (67%)& 11 (33%)&& 33 (5%)
&&&&&& ------&&&& --------- ---------& ----------
&&&&&&&& 8&&&&&&& 438 (70%) 192 (30%)& 630 (100%)
The dialect regions are:
&&&& dr1:& New England
&&&& dr2:& Northern
&&&& dr3:& North Midland
&&&& dr4:& South Midland
&&&& dr5:& Southern
&&&& dr6:& New York City
&&&& dr7:& Western
&&&& dr8:& Army Brat (moved around)
2. Corpus Text Material
-- --------------------
The text material in the TIMIT prompts (found in the file &prompts.doc&)
consists of 2 dialect &shibboleth& sentences designed at SRI, 450
phonetically-compact sentences designed at MIT, and 1890 phonetically-diverse
sentences selected at TI.& The dialect sentences (the SA sentences) were meant
to expose the dialectal variants of the speakers and were read by all 630
speakers.& The phonetically-compact sentences were designed to provide a good
coverage of pairs of phones, with extra occurrences of phonetic contexts
thought to be either difficult or of particular interest.& Each speaker read 5
of these sentences (the SX sentences) and each text was spoken by 7 different
speakers.& The phonetically-diverse sentences (the SI sentences) were selected
from existing text sources - the Brown Corpus (Kuchera and Francis, 1967) and
the Playwrights Dialog (Hultzen, et al., 1964) - so as to add diversity in
sentence types and phonetic contexts.& The selection criteria maximized the
variety of allophonic contexts found in the texts.& Each speaker read 3 of
these sentences, with each sentence being read only by a single speaker.
Table 2 summarizes the speech material in TIMIT.
&&& Table 2:& TIMIT speech material
& Sentence Type&& #Sentences&& #Speakers&& Total&& #Sentences/Speaker
& -------------&& ----------&& ---------&& -----&& ------------------
& Dialect (SA)&&&&&&&&& 2&&&&& &&&630&&&&&& 1260&&&&&&&&&& 2
& Compact (SX)&&&&&&& 450&&&&&&&&&& 7&&&&&& 3150&&&&&&&&&& 5
& Diverse (SI)&&&&&& 1890&&&&&&&&&& 1&&&&&& 1890&&&&&&&&&& 3
& -------------&& ----------&& ---------&& -----&&& ----------------
& Total&&&&&&&&&&&&& 2342&&&&&&&&& &&&&&&&&&6300&&&&&&&&& 10
3. Suggested Training/Test Subdivision
-- -----------------------------------
The speech material has been subdivided into portions for training and
testing.& The criteria for the subdivision is described in the file
&testset.doc&.& THIS SUBDIVISION HAS NO RELATION TO THE DATA DISTRIBUTED ON
THE PROTOTYPE VERSION OF THE CDROM.
Core Test Set:
The test data has a core portion containing 24 speakers, 2 male and 1 female
from each dialect region.& The core test speakers are shown in Table 3.& Each
speaker read a different set of SX sentences.& Thus the core test material
contains 192 sentences, 5 SX and 3 SI for each speaker, each having a distinct
text prompt.
&&& Table 3:& The core test set of 24 speakers
&&&& Dialect&&&&& &&Male&&&&& Female
&&&& -------&&&&&& ------&&&& ------
&&&&&&& 1&&&&&&& DAB0, WBT0&&& ELC0&&&
&&&&&&& 2&&&&&&& TAS1, WEW0&&& PAS0&&&
&&&&&&& 3&&&&&&& JMP0, LNT0&&& PKT0&&&
&&&&&&& 4&&&&&&& LLL0, TLS0&&& JLM0&&&
&&&&&&& 5&&&&&&& BPM0, KLT0&&& NLP0&&&
&&&&&&& 6&&&&&&& CMJ0, JDH0&&& MGD0&&&
&&&&&&& 7&&&&&&& GRT0, NJM0&&& DHC0
&&&&&&& 8&&&&&&& JLN0, PAM0&&& MLD0&&&
Complete Test Set:
A more extensive test set was obtained by including the sentences from all
speakers that read any of the SX texts included in the core test set.& In
doing so, no sentence text appears in both the training and test sets.& This
complete test set contains a total of 168 speakers and 1344 utterances,
accounting for about 27% of the total speech material.& The resulting dialect
distribution of the 168 speaker test set is given in Table 4.& The complete
test material contains 624 distinct texts.
&&&& Table 4:& Dialect distribution for complete test set
&&&&& Dialect&&& #Male&& #Female&& Total
&&&&& -------&&& -----&& -------&& -----
&&&&&&& 1&&&&&&&&&& 7&&&&&&& 4&&&&&& 11
&&&&&&& 2&&&&&&&&& 18&&&&&&& 8&&&&&& 26
&&&&&&& 3&&&&&&&&& 23&&&&&&& 3&&&&&& 26
&&&&&&& 4&&&&&&&&& 16&&&&&& 16&&&&&& 32
&&&&&&& 5&&&&&&&&& 17&&&&&& 11&&&&&& 28
&&&&&&& 6&&&&&&&&&& 8&&&&&&& 3&&&&&& 11
&&&&&& &7&&&&&&&&& 15&&&&&&& 8&&&&&& 23
&&&&&&& 8&&&&&&&&&& 8&&&&&&& 3&&&&&& 11
&&&&& -----&&&&& -----&& -------&& ------
&&&&& Total&&&&&& 112&&&&&& 56&&&&& 168
4. CDROM TIMIT Directory and File Structure
-- ----------------------------------------
The speech and associated data is organized on the CD-ROM according to the
following hierarchy:
/&CORPUS&/&USAGE&/&DIALECT&/&SEX&&SPEAKER_ID&/&SENTENCE_ID&.&FILE_TYPE&
&&&& where,
&&&& CORPUS :== timit
&&&& USAGE :== train | test
&&&& DIALECT :== dr1 | dr2 | dr3 | dr4 | dr5 | dr6 | dr7 | dr8
&&&&&&&&&&&&&&&& (see Table 1 for dialect code description)
&&&& SEX :== m | f
&&&& SPEAKER_ID :== &INITIALS&&DIGIT&
&&&&&&&&& where,
&&&&&&&&& INITIALS :== speaker initials, 3 letters
&&&&&&&&& DIGIT :== number 0-9 to differentiate speakers with identical
&&&&&&&&&&&&&&&&&&& initials
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&
&&&& SENTENCE_ID :== &TEXT_TYPE&&SENTENCE_NUMBER&
&&&&&&&&& where,
&&&&&&&&&&&&&
&&&&&&&&& TEXT_TYPE :== sa | si | sx
&&&&&&&&&&&&&&&&&&& &&&&(see Section 2 for sentence text type description)
&&&&&&&&& SENTENCE_NUMBER :== 1 ... 2342
&&&&&&&&&&&&&&&&&&&
&&&& FILE_TYPE :== wav | txt | wrd | phn
&&&&&&&&&&&&&&&&&& (see Table 5 for file type description)
&&&& /timit/train/dr1/fcjf0/sa1.wav
&&&&&&&&&&&&&&&&&&&&&&&&
&&&& (TIMIT corpus, training set, dialect region 1, female speaker,
&&&&& speaker-ID &cjf0&, sentence text &sa1&, speech waveform file)
&&&&& /timit/test/df5/mbpm0/sx407.phn
&&&&& (TIMIT corpus, test set, dialect region 5, male speaker, speaker-ID
&&&&&& &bpm0&, sentence text &sx407&, phonetic transcription file)
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&
Online documentation and tables are located in the directory &timit/doc&.
A brief description of each file in this directory can be found in Section 6.
5. File Types
-- ----------
The TIMIT corpus includes several files associated with each utterance.& In
addition to a speech waveform file (.wav), three associated transcription
files (.txt, .wrd, .phn) exist.& These associated files have the form:
&&&&&&& &BEGIN_SAMPLE& &END_SAMPLE& &TEXT&&new-line&
&&&&&&& &BEGIN_SAMPLE& &END_SAMPLE& &TEXT&&new-line&
&&&&&&& where,&&&&&&&
&&&&&&&&&&&&&&& BEGIN_SAMPLE :== The beginning integer sample number for the
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& segment (Note: The first BEGIN_SAMPLE of each
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& file is always 0)
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&
&&&&&&&&&&&&&&& END_SAMPLE :== The ending integer sample number for the segment
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& (Note: Because of the transcription method used,
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& the last END_SAMPLE in each transcription file
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& may be less than the actual last sample in the
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& corresponding .wav file)
&&&&&&&&&&&&&&& TEXT :== &ORTHOGRAPHY& | &WORD_LABEL& | &PHONETIC_LABEL&
&&&&&&&&&&&&&&&&&&&&&&&&
&&&&&&&&&&&&&&& where,
&&&&&&&&&&&&&&&
&&&&&&&&&&&&&&&&&&&& ORTHOGRAPHY :== Complete orthographic text transcription
&&&&&&&&&&&&&&&&&&&& WORD_LABEL :== Single word from the orthography
&&&&&&&&&&&&&&&&&&&& PHONETIC_LABEL :== Single phonetic transcription code
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& (See &phoncode.doc& for description
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&& of codes)
&&& Table 5:& Utterance-associated file types&&&&&&&&&
&File Type&&&&&&&&&&&&&&&&&&&& Description
&---------& ------------------------------------------------------
&&&& .wav - SPHERE-headered speech waveform file.& (See the &/sphere&
&&&&&&&&&&& directory for speech file manipulation utilities.)
&&&& .txt - Associated orthographic transcription of the words the
&&&&&&&&&&& person said.& (Usually this is the same as the prompt, but
&& &&&&&&&&&in a few cases the orthography and prompt disagree.)
&&&& .wrd - Time-aligned word transcription. The word boundaries
&&&&&&&&&&& were aligned with the phonetic segments using a dynamic
&&&&&&&&&&& string alignment program (see the printed documentation
&&&&&&&&&&& section &Notes on the Word Alignments& and the lexical
&&&&&&&&&&& pronunciations given in &timitdic.txt&.)
&&&& .phn - Time-aligned phonetic transcription.& (See the reprint
&&&&&&&&&&& of the article by Seneff and Zue (1988), in the printed
&&&&&&&&&&& documentation, and the section &Notes on Checking the
&&&&&&&&&&& Phonetic Transcriptions& for more details on the phonetic
&&&&&&&&&&& transcription protocols.)
&&&&&&&&&&&&
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&
Example transcriptions from the utterance in &/timit/test/dr5/fnlp0/sa1.wav&
Orthography (.txt):
&&&&&&& 0 61748 She had your dark suit in greasy wash water all year.
Word label (.wrd):
Phonetic label (.phn):
(Note: beginning and ending silence regions are marked with h#)
&&&&&&& 0 7470 h#
&& &&&&& axr
&&&&&&&&&&&
&&&&&&&&&&&
6. Online Documentation
-- --------------------
Compact documentation is located in the &/timit/doc& directory.& Files in this
directory with a &.doc& extension contain freeform descriptive text and files
with a &.txt& extension contain tables of formatted text which can be searched
programmatically.& Lines in the &.txt& files beginning with a semicolon are
comments and should be ignored on searches.& The following is a brief
description of their contents:
&&& phoncode.doc - Table of phone symbols used in phonemic dictionary and
&&&&&&&&&&&&&&&&&& phonetic transcriptions
&&&& prompts.txt - Table of sentence prompts and sentence-ID numbers
&&& spkrinfo.txt - Table of speaker attributes
&&& spkrsent.txt - Table of sentence-ID numbers for each speaker
&&&& testset.doc - Description of suggested train/test subdivision
&&& timitdic.doc - Description of phonemic lexicion
&&& timitdic.txt - Phonemic dictionary of all orthographic words in prompts
A more extensive description of corpus design, collection, and transcription
can be found in the printed documentation.
已发表评论数()
请填写推刊名
描述不能大于100个字符!
权限设置：公开
仅自己可见
正文不准确
标题不准确
排版有问题
主题不准确
没有分页内容
图片无法显示
视频无法显示
与原文不一致

哪里可以下载TIMIT的科大讯飞语音库下载啊？请高手们帮帮忙。

我要回帖

更多关于朗读女小燕语音库下载的文章

随机推荐

哪里可以下载TIMIT的科大讯飞语音库下载啊？请高手们帮帮忙。

我要回帖

更多关于 朗读女小燕语音库下载 的文章

随机推荐

更多关于朗读女小燕语音库下载的文章