The ability to automatically scan voicemail messages for content and caller identity cues would be a useful service. This paper describes a system which automatcally files voicemail messages into caller folders using text independent speaker recognition techniques. Callers are represented by Gaussian mixture models (GMM's). The speech for an incoming message is processed and scored against caller models created for a subscriber. A message whose matching score exceeds a threshold is filed in the matching matching score exceeds a threshold is filed in the matching caller folder; otherwise it is tagged as "unknown". The subscriber has the ability to listen to an "unknown" message and file it in the proper folder, if it exists, or create a new folder, if it does not. Such subscriber labelled messages are used to train and adapt caller models. The system has been evalauted on a database of voicemail messages collected at AT&T Labs. A set of 20 callers from this database is designated as "ingroup". Each of these callers has recorded at least 20 messages totalling 10 or more minutes in duration. A distinct set of 220 messages, each from a different caller, are designated as "outgroup". representative performance figures with threshold parameters set to ensure that out-group acceptance is low compared with ingroup rejection are the following. The average ingroup message rejection rate is 11.0
展开▼