Simulating realistically-spatialised simultaneous speech using video-driven speaker detection and the CHiME-5 dataset

Page will contain download links to the data extracted and was used in the analysis in the paper. Links to the tools developed to extract labels will also be available.