next up previous contents
Next: WP6: Hardware and Software Up: No Title Previous: WP5: Conclusion

Bibliography

1
Jont B. Allen.
How do humans process and recognize speech?
IEEE Transactions on Speech and Audio Processing, 2(4):567-577, Oct 1994.

2
Jeff Bilmes.
Joint distributional modeling with cross-correlation based features.
In Proc. IEEE Automatic Speech Recognition and Understanding Workshop. IEEE, December 1997.

3
Jeff Bilmes.
Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling.
In Proceedings of the IEEE International Conference on Acoustics, Speech, & Signal Processing, Seattle, WA, May 1998.
Submitted.

4
Hervé Bourlard and Stéphane Dupont.
Subband-based speech recognition.
In Proceedings of the IEEE International Conference on Acoustics, Speech, & Signal Processing, volume 2, pages 125-128, May 1997.

5
Hervé Bourlard and Nelson Morgan.
Connectionist Speech Recognition - A Hybrid Approach.
Kluwer Academic Press, 1994.

6
Harvey Fletcher.
Speech and Hearing in Communication.
Krieger, New York, 1953.

7
Richard P. Lippmann.
Speech recognition by machines and humans.
Speech Communication, 22(1):1-15, 1997.

8
George A. Miller and Patricia E. Nicely.
An analysis of perceptual confusions among some English consonants.
Journal of the Acoustical Society of America, 27(2):338-352, Mar 1955.

9
Nikki Mirghafori.
An alternative approach to automatic speech recognition using sub-band linguistic categories.
Thesis Proposal (http://www.icsi.berkeley.edu/~ nikki/papers/thesis_prop.ps), Dec 1996.

10
Nikki Mirghafori and Nelson Morgan.
Transmissions and transitions: A study of two common assumptions in multi-band ASR.
In Proceedings of the IEEE International Conference on Acoustics, Speech, & Signal Processing, 1998.
Submitted.

11
Sudhakar Rao and Wiliam A. Pearlman.
Analysis of linear prediction, coding, and spectral estimation from subbands.
IEEE Transactions on Information Theory, 42(4):1160-1178, Jul 1996.

12
Sangita Tibrewala and Hynek Hermansky.
Sub-band based recognition of noisy speech.
In Proceedings of the IEEE International Conference on Acoustics, Speech, & Signal Processing, volume 2, pages 1255-1258, May 1997.

13
M. J. Tomlinson, M. J. Russell, R. K. Moore, A. P. Buckland, and M. A. Fawley.
Modelling asynchrony in speech using elementary single-signal decomposition.
In Proceedings of the IEEE International Conference on Acoustics, Speech, & Signal Processing, volume 2, pages 1247-1250, Apr 1997.

14
Hermansky, H. and Morgan, N., ``RASTA processing of speech'', IEEE Transactions on Speech and Audio Processing, special issue on Robust Speech Recognition, vol.2 no. 4, pp. 578-589, Oct. 1994.

15
H. Bourlard and S. Dupont, ``A new ASR approach based on independent processing and recombination of partial frequency bands'', Proc. of Intl. Conf. on Spoken Language Processing, Philadelphia, Oct 1996, pp 422-425.

16
H. Bourlard and S. Dupont and C. Ris, ``Multi-Stream Speech Recognition'', IDIAP-RR 96-07, Martigny.

17
A.P. Varga and R.K. Moore, ``Hidden Markov Model decomposition of speech and noise''
In Proceedings of the IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 845-848, 1990.

18
, R.A. Cole and M. Fanty and T. Lander, ``Telephone Speech Corpus at CSLU'',
In Proc. of Intl. Spoken Language Processing, Yokohama, Japan, September, 1994.

19
S. Furui ``Speaker independant isolated word recognizer using dynamic features of speech spectrum''
In IEEE Trans. on Acoustics, Speech and Signal Processing, Vol 34, pp 52-59, 1986

20
M.J. Hunt, M. Lennig, and P. Mermelstein
Experiments in Syllable-based Recognition of Continuous Speech
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 880-883, 1980.

21
S-L. Wu and M.L. Shire and S. Greenberg and N.Morgan
Integrating Syllable Boundary Information into Speech Recognition
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 987-990, 1997.

22
S. Renals and M. Hochberg
Decoder Technology for Connectionist Large Vocabulary Speech Recognition
Dept. of Computer Science, University of Sheffield Technical Report CS-95-17, 1995.

23
S. Renals and M. Hochberg
Efficient Evaluation of the LVCSR Search Space Using the NOWAY Decoder
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 149-152, 1996.

24
L. Gillick and S.J. Cox
Some Statistical Issues in the Comparison of Speech Recognition Algorithms.
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 532-535, 1989

25
John F. Pitrelli and Cynthia Fong and Suk H. Wong and Judith R. Spitz and Hong C. Leung, ``PhoneBook : A Phonetically-Rich Isolated-Word Telephone-Speech Database'',
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 101-104, 1995

26
R. Haeb-Umbach and H. Ney, ``Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition'',
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 13-16, 1992

27
X. Aubert and R. Haeb-Umbach and H. Ney, ``Continuous Mixture Densities and Linear Discriminant Analysis for Context -Dependent Acoustic Models'',
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 648-651, 1993

28
R. Haeb-Umbach and D. Geller and H. Ney, ``Improvements in Connected Digit Recognition using Linear Discriminant Analysis and Mixture Densities'',
In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 239-242, 1993

29
V. Fontaine and C. Ris and H. Leich, ``Nonlinear Discriminant Analysis with Neural Networks for Speech Recognition'',
In Proceedings of EUSIPCO, 1583-1586, 1996.

30
D.J. Kershaw
Phonetic Context-Dependency in a Hybrid ANN/HMM Speech Recognition System
Cambridge University Engineering Department PhD. thesis, 1996.

31
D.J. Kershaw, M.M. Hochberg, and A.J. Robinson
Context-Dependent Classes in a Hybrid Recurrent Network-HMM Speech Recognition System
In Advances in Neural Information Processing Systems, vol 8, 1996.

32
Hennebert J, Ris C, Bourlard H., Renals S. Morgan N.,
``Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems''
In Proceedings Eurospeech'97, vol.4, pp. 1951-1954

33
D. J. Bartholemew.
Latent variable models and factor analysis, Charles Griffin and Co., London, 1987.

34
C. M. Bishop, M. Svenson and C. K. I. Williams.
GTM: The generative topographic mapping.
Neural Computation, to appear 1997.

35
W. J. Hardcastle, F. E. Gibbon and W. Jones.
Visual display of tongue-palate contact: Electropalatography in the assessment and remediation of speech disorders.
British Journal of Disorders in Communication, 26:41-74, 1991.

36
W. J. Hardcastle, W. Jones, C. Knight, A. Trudgeon and G. Calder.
New developments in electropalatography: A state of the art report.
Journal of Clinical Linguistics and Phonetics, 3:1-38, 1989.



Christophe Ris
1998-11-10