Ditemukan 9196 dokumen yang sesuai dengan query
Benesty, Jacob
"This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames and frequency bands are assumed to be independent. The second category also concerns the single-channel problem. The difference is that now the interframe correlation is taken into account and a filter is applied in each subband instead of just a gain.
The third and fourth classes discuss the problem of multichannel noise reduction in the STFT domain with and without interframe correlation, respectively. In the last category, consider the interband correlation in the design of the noise reduction filters. Illustrate the basic principle for the single-channel case as an example, while this concept can be generalized to other scenarios. In all categories, propose different optimization cost functions from which derive the optimal filters and we also define the performance measures that help analyzing them."
Heidelberg : [, Springer], 2012
e20418134
eBooks Universitas Indonesia Library
Rao, K. Sreenivasa Rao
"This book presents a statistical parametric speech synthesis (SPSS) framework for developing a speech synthesis system where the desired speech is generated from the parameters of vocal tract and excitation source. Throughout the book, the authors discuss novel source modeling techniques to enhance the naturalness and overall intelligibility of the SPSS system. This book provides several important methods and models for generating the excitation source parameters for enhancing the overall quality of synthesized speech. The contents of the book are useful for both researchers and system developers. For researchers, the book is useful for knowing the current state-of-the-art excitation source models for SPSS and further refining the source models to incorporate the realistic semantics present in the text. For system developers, the book is useful to integrate the sophisticated excitation source models mentioned to the latest models of mobile/smart phones.
- Presents the efficient excitation source modeling techniques for generating high quality speech;
- Includes a combination of both waveform and parametric methods to enhance the quality of synthesis;
- Features and methods that need less memory and computational requirements than others, allowing them to be integrated to smart phones and smaller devices."
Switzerland: Springer Nature, 2019
e20509754
eBooks Universitas Indonesia Library
Sclater, Neil
Indianapolis, Indiana: Howard W. Sams, 1983
621.381 9 SCL i
Buku Teks Universitas Indonesia Library
Harrington, Jonathan, 1950-
Chichester, U.K.: Wiley-Blackwell, 2010
414.8 HAR p
Buku Teks SO Universitas Indonesia Library
Rabiner, Lawrence R.
Englewood Cliffs, NJ: Prentice-Hall, 1978
621.380 412 RAB d
Buku Teks Universitas Indonesia Library
Coleman, John R.
New York: Cambridge University Press, 2005
410.285 COL i
Buku Teks SO Universitas Indonesia Library
Mary, Leena
"This book presents techniques for audio search, aimed to retrieve information from massive speech databases by using audio query words. The authors examine different features, techniques and evaluation measures attempted by researchers around the world. The topics covered also include available databases, software / tools, patents / copyrights, and different platforms for benchmarking. The content is relevant for developers, academics, and students."
Switzerland: Springer Cham, 2019
e20502755
eBooks Universitas Indonesia Library
Jimenez, Ricardo
San Diego: Academic Press , 1991
621.39 JIM d
Buku Teks Universitas Indonesia Library
"This volume constitutes the refereed proceedings of the Spanish Conference, IberSPEECH 2012: Joint VII “Jornadas en Tecnología del Habla” and III Iberian SLTech Workshop, held in Madrid, Spain, in November 21-23, 2012. The 29 revised papers were carefully reviewed and selected from 80 submissions. The papers are organized in topical sections on speaker characterization and recognition, audio and speech segmentation, pathology detection and speech characterization, dialogue and multimodal systems, robustness in automatic speech recognition, applications of speech and language technologies."
Berlin: Springer-Verlag, 2012
e20408240
eBooks Universitas Indonesia Library
Petr Sojka, editor
"This book constitutes the refereed proceedings of the 15th International Conference on Text, Speech and Dialogue, TSD 2012, held in Brno, Czech Republic, in September 2012. The 82 papers presented together with 2 invited talks were carefully reviewed and selected from 173 submissions. The papers are organized in topical sections on corpora and language resources, speech recognition, tagging, classification and parsing of text and speech, speech and spoken language generation, semantic processing of text and speech, integrating applications of text and speech processing, machine translation, automatic dialogue systems, multimodal techniques and modeling."
Berlin: Springer-Verlag, 2012
e20409426
eBooks Universitas Indonesia Library