Data-Driven Techniques in Speech Synthesis edited by R.I. Damper.

Data-Driven Techniques in Speech Synthesis gives a first review of this new field. All areas of speech synthesis from text are covered, including text analysis, letter-to-sound conversion, prosodic marking and extraction of parameters to drive synthesis hardware. Fuelled by cheap computer processing...

Full description

Saved in:
Bibliographic Details
Corporate Author: SpringerLink (Online service)
Other Authors: Damper, R.I (Editor)
Format: eBook
Language:English
Published: New York, NY : Springer US : Imprint: Springer, 2001.
Edition:1st ed. 2001.
Series:Telecommunications Technology & Applications Series
Springer eBook Collection.
Subjects:
Online Access:Click to view e-book
Holy Cross Note:Loaded electronically.
Electronic access restricted to members of the Holy Cross Community.

MARC

LEADER 00000nam a22000005i 4500
001 b3206056
003 MWH
005 20191022092012.0
007 cr nn 008mamaa
008 121205s2001 xxu| s |||| 0|eng d
020 |a 9781475734133 
024 7 |a 10.1007/978-1-4757-3413-3  |2 doi 
035 |a (DE-He213)978-1-4757-3413-3 
050 4 |a E-Book 
072 7 |a PHDS  |2 bicssc 
072 7 |a SCI001000  |2 bisacsh 
072 7 |a PHDS  |2 thema 
245 1 0 |a Data-Driven Techniques in Speech Synthesis  |h [electronic resource] /  |c edited by R.I. Damper. 
250 |a 1st ed. 2001. 
264 1 |a New York, NY :  |b Springer US :  |b Imprint: Springer,  |c 2001. 
300 |a XVIII, 316 p.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Telecommunications Technology & Applications Series 
490 1 |a Springer eBook Collection 
505 0 |a 1 Learning About Speech from Data: Beyond NETtalk -- 1.1 Introduction -- 1.2 Architecture of a TTS System -- 1.3 Automatic Pronunciation Generation -- 1.4 Prosody -- 1.5 The Synthesis Module -- 1.6 Conclusion -- 2 Constructing High-Accuracy Letter-to-Phoneme Rules with Machine Learning -- 2.1 Introduction -- 2.2 The Nettalk Approach -- 2.3 High-Performance ML Approach -- 2.4 Evaluation of Pronunciations -- 2.5 Conclusions -- 3 Analogy, the Corpus and Pronunciation -- 3.1 Introduction -- 3.2 Why Adopt a Psychological Approach? -- 3.3 The Corpus as a Resource -- 3.4 The Sullivan and Damper Model -- 3.5 Parallels with Optimality Theory -- 3.6 Implementation -- 3.7 Corpora -- 3.8 Performance Evaluation -- 3.9 Future Challenges -- 4 A Hierarchical Lexical Representation for Pronunciation Generation -- 4.1 Introduction -- 4.2 Previous Work -- 4.3 Hierarchical Lexical Representation -- 4.4 Generation Algorithm -- 4.5 Evaluation Criteria -- 4.6 Results on Letter-to-Sound Generation -- 4.7 Error Analyses -- 4.8 Evaluating the Hierarchical Representation -- 4.9 Discussions and Future Work -- 5 English Letter-Phoneme Conversion by Stochastic Transducers -- 5.1 Introduction -- 5.2 Modelling Transduction -- 5.3 Stochastic Finite-State Transducers -- 5.4 Inference of Letter-Phoneme Correspondences -- 5.5 Translation -- 5.6 Results -- 5.7 Conclusions -- 6 Selection of Multiphone Synthesis Units and Grapheme-to-Phoneme Transcription using Variable-Length Modeling of Strings -- 6.1 Introduction -- 6.2 Multigram Model -- 6.3 Multiphone Units for Speech Synthesis -- 6.4 Learning Letter-to-Sound Correspondences -- 6.5 General Discussion and Perspectives -- 7 TreeTalk: Memory-Based Word Phonemisation -- 7.1 Introduction -- 7.2 Memory-Based Phonemisation -- 7.3 tribl and TreeTalk -- 7.4 Modularity and Linguistic Representations -- 7.5 Conclusion -- 8 Learnable Phonetic Representations in a Connectionist TTS System — I: Text to Phonetics -- 8.1 Introduction -- 8.2 Problem Background -- 8.3 Data Inputs and Outputs to Module M1 -- 8.4 Detailed Architecture of the Text-to-Phonetics Module -- 8.5 Model Selection -- 8.6 Results -- 8.7 Conclusions and Further Work -- 9 Using the Tilt Intonation Model: A Data-Driven Approach -- 9.1 Background -- 9.2 Tilt Intonation Model -- 9.3 Training Tilt Models -- 9.4 Experiments and Results -- 9.5 Conclusion -- 10 Estimation of Parameters for the Klatt Synthesizer from a Speech Database -- 10.1 Introduction -- 10.2 Global Parameter Settings -- 10.3 Synthesis of Vowels, Diphthongs and Glides -- 10.4 Stop Consonants (and Voiceless Vowels) -- 10.5 Estimation of Fricative Parameters -- 10.6 Other Sounds -- 10.7 Application: A Database of English Monosyllables -- 10.8 Conclusion -- 11 Training Accent and Phrasing Assignment on Large Corpora -- 11.1 Introduction -- 11.2 Intonational Model -- 11.3 Classification and Regression Trees -- 11.4 Predicting Pitch Accent Placement -- 11.5 Predicting Phrase Boundary Location -- 11.6 Conclusion -- 12 Learnable Phonetic Representations in a Connectionist TTS System — II: Phonetics to Speech -- 12.1 Introduction -- 12.2 Architecture of Phonetics-to-Speech Module -- 12.3 Training and Alignment -- 12.4 Phonetics-to-Speech Results -- 12.5 Conclusions and Further Work. 
520 |a Data-Driven Techniques in Speech Synthesis gives a first review of this new field. All areas of speech synthesis from text are covered, including text analysis, letter-to-sound conversion, prosodic marking and extraction of parameters to drive synthesis hardware. Fuelled by cheap computer processing and memory, the fields of machine learning in particular and artificial intelligence in general are increasingly exploiting approaches in which large databases act as implicit knowledge sources, rather than explicit rules manually written by experts. Speech synthesis is one application area where the new approach is proving powerfully effective, the reliance upon fragile specialist knowledge having hindered its development in the past. This book provides the first review of the new topic, with contributions from leading international experts. Data-Driven Techniques in Speech Synthesis is at the leading edge of current research, written by well respected experts in the field. The text is concise and accessible, and guides the reader through the new technology. The book will primarily appeal to research engineers and scientists working in the area of speech synthesis. However, it will also be of interest to speech scientists and phoneticians as well as managers and project leaders in the telecommunications industry who need an appreciation of the capabilities and potential of modern speech synthesis technology. 
590 |a Loaded electronically. 
590 |a Electronic access restricted to members of the Holy Cross Community. 
650 0 |a Acoustics. 
650 0 |a Data structures (Computer science). 
650 0 |a Signal processing. 
650 0 |a Image processing. 
650 0 |a Speech processing systems. 
650 0 |a Computational linguistics. 
650 0 |a Phonology. 
650 0 |a Artificial intelligence. 
690 |a Electronic resources (E-books) 
700 1 |a Damper, R.I.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
830 0 |a Telecommunications Technology & Applications Series 
830 0 |a Springer eBook Collection. 
856 4 0 |u https://holycross.idm.oclc.org/login?auth=cas&url=https://doi.org/10.1007/978-1-4757-3413-3  |3 Click to view e-book  |t 0 
907 |a .b32060567  |b 04-18-22  |c 02-26-20 
998 |a he  |b 02-26-20  |c m  |d @   |e -  |f eng  |g xxu  |h 0  |i 1 
912 |a ZDB-2-ENG 
912 |a ZDB-2-BAE 
950 |a Engineering (Springer-11647) 
902 |a springer purchased ebooks 
903 |a SEB-COLL 
945 |f  - -   |g 1  |h 0  |j  - -   |k  - -   |l he   |o -  |p $0.00  |q -  |r -  |s b   |t 38  |u 0  |v 0  |w 0  |x 0  |y .i21192212  |z 02-26-20 
999 f f |i ea62a6fc-3c02-5572-bbe9-9415f33af775  |s 80ea3549-f66c-536e-aeb3-4d245727ff68  |t 0 
952 f f |p Online  |a College of the Holy Cross  |b Main Campus  |c E-Resources  |d Online  |t 0  |e E-Book  |h Library of Congress classification  |i Elec File