TCTS Lab Research Groups

The THISL project

[ FPMs > TCTS > ASR group > Projects > THISL ]





Thematic Indexing of Spoken Language ESPRIT Long Term Research RTD Project Ref. 23495.

See the Official web site for complete information about this project.

  • Sheffield University (UK)
  • British Broadcast Company (UK)
  • Faculté Polytechnique de Mons (Belgium)
  • SoftSound (UK)
  • Thomson-CSF (France)
  • IDIAP (Switzerland)
  • International Computer Science Institute (USA, Subcontractor)

  • Abstract

    THISL is an ESPRIT Long Term Research Project focused on the retrieval of multimedia information (primarily written or spoken text) using a spoken language interface. The project is concerned with the construction of a demonstration system which performs good recognition of broadcast speech from TV and radio news programmes and the production of multimedia indexing data from this. The project concentrates on British and American English applications, with work in progress targeting a French speech recognition application. At the midway point of the project, we have constructed a prototype system, based on an archive of 100 hours of North American broadcast news. This has been successfully evaluated within the framework of the TREC-6 and TREC-7 spoken document retrieval tracks. By early 1999, we have a second prototype system based on an archive of several hundred hours of BBC news output.