List of Datasets

Some Datasets that might be useful for this project

This page contains a table of links to datasets that might be useful for this project. The table does not display correctly in the description, so view the full post to see it.

Title Length Scenes
Recordings of Nature Wordpress website Many 2-hour all-night recordings made with 'tree-ear' microphones. High quality, stereo. This is a goldmine for longer datasets. Uncategorized nature, mostly made overnight.
DCase 2018 Making Sense of Sounds info download 1500 5-seconds audio segments.
  • urban
  • music
  • effects
  • human
  • nature
DCase 2018 Task 1 (TUT Urban Acoustic Scenes 2018, Development dataset) info download 10-seconds audio segments from 10 acoustic scenes. Each acoustic scene has 864 segments (144 minutes of audio). The dataset contains in total 24 hours of audio
  • airport
  • shopping_mall
  • metro_station
  • street_pedestrian
  • public_square
  • street_traffic
  • tram
  • bus
  • metro
  • park
DCase 2018 Task 5 (derivative of SINS) info download audio segments of 10s. Segments containing more then one active class were left out. Continuous recording of one person living in a vacation home over a period of one week. Recorded with quadrophonic microphone arrays in the Kitchen and Living Room
  • Absence
  • Cooking
  • Dishwashing
  • Eating
  • Other (present but not doing any relevant activity)
  • Social activity (visit, phone call)
  • Vacuum cleaning
  • Watching TV
  • Working (typing, mouse click, ...)
DCASE 2016 and 2017 Task 1 (TUT Acoustic Scenes 2017) info download For each recording location, 3-5 minute long audio recording was captured. The original recordings were then split into segments with a length of 10 seconds. Each acoustic scene has 312 segments totaling 52 minutes of audio. The dataset was collected in Finland by Tampere University of Technology between 06/2015 - 01/2017
  • Bus
  • Cafe / Restaurant
  • Car
  • City center
  • Forest path
  • Grocery store
  • Home
  • Lakeside beach
  • Library
  • Metro station
  • Office
  • Residential area
  • Train
  • Tram
  • Urban park
DCASE 2016 Task 4 (CHiME-Home) info   The audio data are provided as 4-second chunks All from one domestic scene. Prominent sound sources in the acoustic environment are two adults and two children, television and electronic gadgets, kitchen appliances, footsteps and knocks produced by human activity, in addition to sound originating from outside the house.
DCASE 2013 Task 1 info download 1 min recordings The developement and testing datasets, denoted as office live (OL), will consist of 1 min recordings of every-day audio events in a number of office environments
Emo-Soundscapes info   1213 6-second Creative Commons licensed audio clips. Soundscape emotion recognition (SER) aims at the automatic recognition of emotions perceived in soundscape recordings. 1182 annotators from 74 different countries rank the audio clips according to the perceived valence and arousal. Labeled by valence and arousal, not by scene class.
Defreville-Aucouturier environmental audio dataset info download "For this study, we gathered a database of 106 3 min recordings of urban soundscapes, recorded in Paris using an omnidirectional microphone." (The database actually contains 16 recordings ranging in length from about a minute to over 35 minutes). Recorded in Paris using an omnidirectional microphone.
  • Avenue
  • Neighborhood
  • Street Market
  • Park
Recordings are further labeled into 11 “detailed classes,” which correspond to the place and date of recording of a given environment. For instance, “Parc Montsouris 􏰁Paris 14è􏰀” is a subclass of the general “Park” class.
DEMAND info download 15 recordings, Recorded with 16-channel array
  • DOMESTIC
    • kitchen
    • living
    • washing
  • NATURE
    • field
    • park
    • river
  • OFFICE
    • hallway
    • meeting
    • office
  • PUBLIC
    • caffeteria
    • presto
    • station
  • STREET
    • square
    • traffic
  • TRANSPORTATION
    • bus
    • car
    • metro
Acoustic Environment Classification (Noise DB Series 1 and 2) info download 1 file of 4 or 5 minutes for each scene. Series 1 was recorded using a Sony MiniDisk recorder and external microphone in 2002. Sampling rate: WAV 22.050kHZ 16bit Mono Series 2 was taken using a Samsung YP55H MP3 recorder in 2004. Sampling rate: WAV 8.00kHZ 8bit Mono. We used an MP3 recorder attached to the strap of a shoulder bag as the recording device to capture the environmental noise from a typical daily routine
  • Series 1
    • Bar
    • Beach
    • Bus
    • Car
    • Football Match
    • Laundrette
    • Lecture
    • Office
    • Rail station
    • Street
  • Series 2
    • Building site
    • Bus
    • Car (city)
    • Car (highway)
    • Launderette
    • Office
    • Presentation
    • Shopping mall
    • Street (people)
    • Street (traffic)
    • Supermarket
    • Train
DARES info download 120 fragments of 60 second recordings, resulting in about 1.3GB of audio. Recorded with binaural ‘headphone’ mics.
  • Streets
    • Busy street (9)
    • Quiet street (5)
    • Pedestrian area (2),
    • Bicycle path (3)
    • Residential area (2)
    • Bus stop (3)
  • Nature
    • Forest (5)
    • Park (2)
    • Beach (2)
    • Field (1)
  • Home
    • Living room (28)
    • Study (13)
    • Flat (13)
    • Kitchen (9)
    • Hallway (5)
    • Bed room (3)
    • Shed (1)
  • Public buildings
    • Supermarket (6)
    • Shop (2)
    • Train station (1)
  • Vehicles
    • Bus (4)
    • Train (2)
    • Ferry (1)
  • Other
    • Lawn (1)
    • Elevator (2)
Ecolisten info download Longer recordings up to an hour in length recorded with an ambisonic sound field mic. Available in A format, B format, and stereo.
  • Beaver Creek Biosphere reserve
  • Death Valley National Park
  • Jornada Experimental Range
  • Joshua Tree National Park
  • Mojave Desert Preserve
  • Organ Pipe Cactus National Park
  • Sequoia and Kings Canyon National Park
  • Sian Kaant Biosphere reserve
  • Vogelfreistaette Flachwasser
  • park
URBANSOUND info download 1302 labeled sound recordings from Freesound. Each recording is labeled with the start and end times of sound events from 10 classes.
  • air_conditioner
  • car_horn
  • children_playing
  • dog_bark
  • drilling
  • enginge_idling
  • gun_shot
  • jackhammer
  • siren
  • street_music

Comments

Popular posts from this blog

Ambisonic Rendering in the Story Bubble

How I calibrated my contact microphone

WaveRNN