Skip to main content

A Combined Features Approach for Speaker Segmentation using BIC and Artificial Neural Networks

13 October 2013

New Image

We present a combined features approach for speaker segmentation task. This approach utilizes different acoustic features extracted from audio stream. The Bayesian Information Criterion (BIC) is used for each acoustic feature as a distance measure to verify the merging of two audio segments. An Artificial Neural Network (ANN) combines the time index from each ÄBIC with the highest value, and estimates the change point. In the experiments, a data set containing examples with several speakers is used to compare our approach with the Chen and Gopalakrishnan's window-growing-based approach, using different acoustic features sets. The results show an improvement in both the Miss Detection Rate (MDR) and the False Alarm Rate (FAR) compared to the window-growing-based approach.