Technical Publications


Contact Information

A System for Unrestricted Topic Retrieval from Radio News Broadcasts

James D.


The "topic classification" systems described in the speech literature typically partition a collection of spoken messages into a small number of pre-defined topics. As such, they are only useful if the set of message topics does not vary over time. However, the techniques of textual information retrieval (IR) have long allowed for retrieval by arbitrary subject from a document collection. This paper describes experiments in unrestricted retrieval from a collection of radio news broadcasts. A hybrid message indexing strategy, with conventional word recognition and a fast lattice-based wordspotter, allows for the retrieval of news reports concerning any subject. The results show that retrieval can be carried out extremely quickly and that high accuracy is possible, even with errorful recognition output.

[Jam96] James D.. A System for Unrestricted Topic Retrieval from Radio News Broadcasts. In Proc Int Conf Acoust, Speech and Sig Proc (ICASSP), pages 279-282, Atlanta, GA, USA, May 1996.

Get publication ( 58K, Adobe Acrobat PDF ).
Get publication ( 37K, PostScript ).

Questions, comments, suggestions?
This site is generously hosted by Macrofocus GmbH, developer of TreeMap, High-D, and other fine visualization tools
Page rendered on Thursday, February 03, 2000