The Vocal Distance Dataset consists of individuals who were recorded speaking in Korean, which amounted to a total of 573 hours. Due to the placement of recording devices in three different distances from the individuals, there is 191 hours of audio for each distance: 0.4m, 2m, and 4m. Since sound varies widely with distance, this dataset will allow engineers to build more accurate models that can identify sounds. Instead of relying on a dataset recorded from one distance, it will be more reliable to use a dataset that contains multiple distances depending on how far away the speaker is from the microphone. To record the data, there were three microphones placed at a distance of (0.4m, 2m, and 4m) away from the speaker. To maintain the consistency of the data, the speaker was always standing and the average height of each microphone was 1.5m tall.
Featured Vocal Distance Data Sound Sample
Recorded at 0.4m
Recorded at 2m
Recorded at 4m
If you would like to hear more samples, please contact us. (firstname.lastname@example.org)
We can provide about 180 samples for the Multiple Location Dataset.
We offer the vocal distance dataset at a much lower price than other datasets since it doesn't include the metadata such as the speaker's gender, age, etc.
However, if you would like more information regarding the speaker's demographics and the content of the voice recordings, you may purchase other types of audio datasets that we have on the website. Please inquire for further information.
Feel free to contact us for more detailed information including the pricing and licensing.
Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)