HARVARD speech corpus - audio recording 2019

Published on 2019-05-13T12:16:41Z (GMT) by Philippa Demonte
<div>High-quality (sampling rate: 48 kHz; 32-bit rate) digital audio .wav files of a new recording of the HARVARD speech corpus in its entirety (720 phonetically balanced sentences), featuring a female native British English speaker. For use in speech-in-noise tests, evaluations of audio quality, machine learning, and so on.<br></div><div><br></div><div>Recorded: December 2018 at the University of Salford.</div><div><br></div><div>The audio files are licensed under the Creative Commons Attribution-NonCommercial 4.0 International License</div><div>(https://creativecommons.org/licenses/by-nc/4.0/legalcode)</div><div><br></div><div>Citation for the HARVARD sentences:</div><div>* From the appendix of: 'IEEE Subcommittee on Subjective Measurements IEEE Recommended Practices for Speech Quality Measurements'. IEEE Transactions on Audio and Electroacoustics, vol. 17, pp. 227-246. 1969.</div><div>* http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/fgdata/OldFiles/Recorder.app/utterances/Type1/harvsents.txt<br></div><div><br></div><div>For an overview of the lists of HARVARD speech corpus sentences:</div><div>* harvard.txt</div><div><br></div><div>For an overview of how this version of the speech corpus was recorded and audio engineered:</div><div>* harvard_201218_British_English_recording.txt</div><div><br></div><div>For examples of the audio files at three different stages of processing</div><div>* Speech corpus - example of raw audio: HARVARD list 01.wav</div><div>* Speech corpus - example of edited audio: Harvard_L01_S01_0.wav</div><div>* Speech corpus - example of edited end-pointed zero-padded audio: Harvard_L01_S01_5.wav</div><div><br></div><div>For the .wav audio files of the HARVARD speech corpus in its entirety at different stages of processing, see the zip folders:</div><div>* Speech corpus - Harvard - raw audio</div><div>* Speech corpus - Harvard - edited and end-pointed audio</div><div>* Speech corpus - Harvard - edited, end-pointed, zero-padded audio</div><div><br></div><div>Each raw audio file is a recording of a single Harvard speech corpus list of 10 sentences.</div><div>The two zip folders of the edited versions contain 10 individual sentence .wav files per sub-folder.<br></div><div><br></div><div>For a .wav audio file of the room ambience in which this version of the Harvard speech corpus was recorded in:</div><div>* Speech corpus - ambient noise of recording room</div><div><br></div><div>For the Matlab 2018b scripts used for the end-pointing and zero-padding applied to the audio files:</div><div>* EndPoint.m</div><div>* zeropad.m<br></div><div><br></div><div><br></div><div><br></div>

Cite this collection

Demonte, Philippa (2019): HARVARD speech corpus - audio recording 2019. figshare. Collection.