Details

Title

Phoneme Segmentation Based on Wavelet Spectra Analysis

Journal title

Archives of Acoustics

Yearbook

2011

Volume

vol. 36

Issue

No 1

Authors

Keywords

speech recognition ; speech segmentation ; discrete wavelet transform

Divisions of PAS

Nauki Techniczne

Coverage

29-47

Publisher

Polish Academy of Sciences, Institute of Fundamental Technological Research, Committee on Acoustics

Date

2011

Type

Artykuły / Articles

Identifier

DOI: 10.2478/v10168-011-0003-2

Source

Archives of Acoustics; 2011; vol. 36; No 1; 29-47

References

Abry P. (1997), Ondelettes et turbulence (eng. Wavelets and turbulence). ; Cardinal P. (2005), Segmentation of recordings based on partial transcriptions, null, 3345. ; Daubechies I. (1992), Ten lectures on Wavelets, doi.org/10.1137/1.9781611970104 ; Glass J. (2003), A probabilistic framework for segment-based speech recognition, Computer Speech and Language, 17, 137, doi.org/10.1016/S0885-2308(03)00006-8 ; Grayden D. (1994), Phonemic segmentation of fluent speech, null, 73. ; Grocholewski S. (1995), <i>Assumptions of acoustic database for Polish language</i> [in Polish: <i>Założenia akustycznej bazy danych dla języka polskiego</i> (CD-ROM), Mat. I KK: Głosowa komunikacja człowiek-komputer, Wrocław, 177-180. ; Hermansky H. (1990), Perceptual linear predictive (PLP) analysis of speech, Journal of the Acoustical Society of America, 87, 4, 1738, doi.org/10.1121/1.399423 ; Hermansky H. (1994), RASTA processing of speech, IEEE Transactions on Speech and Audio Processing, 2, 4, 578, doi.org/10.1109/89.326616 ; Holmes J. (2001), Speech Synthesis and Recognition. ; Hunt A. (1996), Unit selection in a concatenative speech synthesis system using a large speech database, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 1996, ICASSP-96, 1, 373, doi.org/10.1109/ICASSP.1996.541110 ; Morgan N. (2005), Pushing the envelope - aside, IEEE Signal Processing Magazine, 22, 81, doi.org/10.1109/MSP.2005.1511826 ; Ostendorf M. (1996), From HMM's to segment models: A unified view of stochastic modeling for speech recognition, IEEE Transactions on Speech and Audio Processing, 4, 360, doi.org/10.1109/89.536930 ; Rabiner L. (1993), Fundamentals of speech recognition. ; Rioul O. (1991), Wavelets and signal processing, IEEE Signal Processing Magazine, 8, 11, doi.org/10.1109/79.91217 ; Russell M. (2005), A multiple-level linear/linear segmental HMM with a formant-based intermediate layer, Computer Speech and Language, 19, 205, doi.org/10.1016/j.csl.2004.08.001 ; Stöber K. (1998), Additional use of phoneme duration hypotheses in automatic speech segmentation, null, 1595. ; Suh Y. (1996), Phoneme segmentation of continuous speech using multi-layer perceptron, null, 1297. ; Toledano D. (2003), Automatic phonetic segmentation, IEEE Transactions on Speech and Audio Processing, 11, 6, 617, doi.org/10.1109/TSA.2003.813579 ; C. van Rijsbergen (1979), Information Retrieval. ; Wang D. (2005), Piecewise linear stylization of pitch via wavelet analysis, null, 3277. ; Weinstein C. (1975), A system for acoustic-phonetic analysis of continuous speech, IEEE Transactions on Acoustics, Speech and Signal Processing, 23, 54, doi.org/10.1109/TASSP.1975.1162651 ; Young S. (1996), Large vocabulary continuous speech recognition: a review, IEEE Signal Processing Magazine, 13, 5, 45, doi.org/10.1109/79.536824 ; Zheng C. (2004), Fusion based speech segmentation in DARPA SPINE2 task, null. ; Ziółko B. (2006), Wavelet method of speech segmentation, null. ; Ziółko B. (2007), Fuzzy recall and precision for speech segmentation evaluation, null. ; Zue V. (1985), The use of speech knowledge in automatic speech recognition, Proceedings of the IEEE, 73, 1602, doi.org/10.1109/PROC.1985.13342
×