SST 1996 Proceedings
Page numbers refer to nominal page numbers assigned to each paper for purposes of citation.
Session 1 Multimodal: Speech
| Pages | Authors | Title | |
|---|---|---|---|
| 1--6 | R.E.E.ROBINSON | Synthesising Facial Movement: Real Time Visual Speech | |
| 7--12 | Takefumi KITAYAMA, Hiroyuki KAMATA and Yoshihisa ISHIDA | Development Of Speech Training System For The Hard Of Hearing Person Based On Voice Synthesis Technique Using Vocal Tract Area Function | |
| 13--17 | Jordi Robert-Ribes and Bruce Millar | A Simple System For Measuring Audiovisual Speech |
Session 2 Features Analysis 1
| Pages | Authors | Title | |
|---|---|---|---|
| 19--24 | Peter Barger, Stefan Slomka, Pierre Castellano and Sridha Sridharan | Gender Gates For Automatic Speaker Recognition | |
| 25--30 | Raphael Ahn and W. Harvey Holmes | Voiced/Unvoiced/Silence Classification Of Speech Using 2-Stage Neural Networks With Delayed Decision Input | |
| 31--36 | K.L. Jenkin and M.S. Scordilis | Automatic Syllable Stress Classification Methods |
Session 3 Linguistics/Phonetics 1
| Pages | Authors | Title | |
|---|---|---|---|
| 37--42 | Yuko KINOSHITA | Linguistic Phonetic Differences In The Acoustics Of Plosives In Chinese Dialects | |
| 43--48 | Robert Bannert & Peter E. Czigler | Observations On The Duration Of /S/ In Standard Swedish | |
| 49--54 | Dawn M. Behne, Peter E. Czigler and Kirk P. Sullivan | Acoustic Characteristics Of Perceived Quantity And Quality In Swedish Vowels | |
| 55--60 | G. Dogil and J. Roux | Notes On Unencoded Speech: Clicks And Their Accompaniments In Xhosa | |
| 67--66 | David Deterding | Diphthong Measurements In Singapore English |
Session 4 Speech Recognition 1: Adverse Conditions
| Pages | Authors | Title | |
|---|---|---|---|
| 67--72 | Olli Viikki, Kari Laurila, Petri Haavisto | A Confidence Measure For Detecting Recognition Errors In Isolated Word Recognition | |
| 73--78 | S.E. Dixon and D.M.W. Powers | The Characterisation, Separation And Transcription Of Complex Acoustic Signals | |
| 79--84 | Jean-Baptiste PUEL | Cellular Phone Speech Recognition : Neural Nets Preprocessing Vs. Robust Hmm Architectures | |
| 85--90 | B. T. Logan and A. J. Robinson | Noise Estimation For Enhancement And Recognition Within An Autoregressive Hidden-Markov-Model Framework | |
| 91--96 | Jinhai Cai and Zhi-Qiang Liu | An Adaptive Approach To Robust Speech Recognition |
Session 5 Forensic Linguistics
| Pages | Authors | Title | |
|---|---|---|---|
| 97--102 | Andrew Butcher | Getting The Voice Line-Up Right: Analysis Of A Multiple Auditory Confrontation | |
| 103--108 | F. Schlichting and K.P.H. Sullivan | Discrimination Of Imitated Voices | |
| 109--114 | Phil Rose | Speaker Verification Under Realistic Forensic Conditions | |
| 115--120 | J. Pittam and E.S. Rintel | The Acoustics Of Voice And Ethnic Identity | |
| 121--126 | Phil Rose and Alison Simmons | F-Pattern Variability In Disguise And Over The Telephone Comparisons For Forensic Speaker Identification |
Session 6 Speech Recognition II
| Pages | Authors | Title | |
|---|---|---|---|
| 127--132 | Parham Mokhtari and Frantz Clermont | A Methodology For Investigating Vowel-Speaker Interactions In The Acoustic-Phonetic Domain | |
| 133--138 | W. J. Tey, N. P. Jong, and R. Togneri | Investigation Of Speech And Speaker Recognition Based On Trajectory Modeling Of Utterances | |
| 139--144 | Michael Wagner | Combined Speech-Recognition/Speaker-Verification System With Modest Training Requirements |
Session 7 Linguistics/Phonetics 2
| Pages | Authors | Title | |
|---|---|---|---|
| 145--150 | Frantz Clermont | Multi-Speaker Formant Data On The Australian English Vowels: A Tribute To J.R.L. Bernard'S (1967) Pioneering Research | |
| 151--156 | Marija Tabain | Nasal Consonants In Yanyuwa And Yindjibarndi: An Acoustic Study | |
| 157--162 | Gerry Docherty & Paul Foulkes | A Corpus-Based Account Of Variation In The Realisation Of 'Released‘ /T/ In English | |
| 163--168 | Helen Fraser | An Introduction To Phenomenological Phonology (PP) |
Session 8 Coding & Synthesis
| Pages | Authors | Title | |
|---|---|---|---|
| 169--174 | S. C. Chu and J. S. Pan | Tabu Search Algorithms To VQ Codevector Index Assignment For Noisy Channels | |
| 175--180 | Mike Wu & W. H. Holmes | A Low Rate Sinusoidal Speech Coder | |
| 181--186 | H.R. Sadegh Mohammadi and W.H. Holmes | Differential Interpolative Prediction Scalar Quantization Of The Line Spectral Frequencies For Low Bit-Rate Spectral Coding Of Speech | |
| 187--192 | J. S. Pan and S. C. Chu | Improved Algorithms For VQ Codeword Search And The Derivation Of Bound For Quadratic Metric Using Principal Component Transform | |
| 193--198 | H.R. Sadegh Mohammadi and W.H. Holmes | Considerations In The Selection Of An Objective Measure To Assess The Quality Of Spectral Coding Methods | |
| 199--204 | Kerrie Lee, Phillip Dermody, Daniel Woo | Evaluation Of A Method For Subjective Assessment Of Speech Quality In Telecommunication Applications | |
| 205--210 | Peter Veprek | Czech Text-To-Speech System For A Reading Machine |
Session 9 Speech Disorders I
| Pages | Authors | Title | |
|---|---|---|---|
| 211--216 | Lyn Goldberg Ph.D. American | The Effects Of The Attenuation Of Second And Third Formant Frequencies On The Recognition Of Stop Consonant Vowel Syllables In Aphasic And Nonaphasic Subjects | |
| 217--222 | P.F. McCormack & B. Dodd | A Feature Analysis Of Speech Errors In Subgroups Of Speech Disordered Children | |
| 223--228 | Lynda Penny, Simon Mitchell, Natasha Saunders, Jenny Hunwick, Helen Mitchard & Mary Vrlic | Some Aspects Of Speech And Voice In Healthy Ageing People | |
| 227--232 | Sameer Singh, Romola Bucks, Jody M. Cuerden | Speech In Alzheimer'S Disease | |
| 233--238 | Sameer Singh and Tom Gedeon | Hypertext Tools In Speech And Language Therapy |
Session 10 Speaker Recognition
| Pages | Authors | Title | |
|---|---|---|---|
| 239--244 | A. Satriawan and J.B. Millar | Broad Phonetic Class Based Speaker Modelling | |
| 245--249 | S. Hussain, F. R. McInnes and M. A. Jack | Comparison Of Neural Network Techniques For Speaker Verification | |
| 251--256 | A. Samouellan | Automatic Language Identification Using Inductive Inference | |
| 257--262 | Karsten Kumpf | Lda Based Modelling Of Foreign Accents In Continuous Speech | |
| 263--268 | D. R. Dersch | The Acoustic Fingerprint: A Method For Speaker Identification, Speaker Verification And Accent Identification |
Session 11 Speech Disorders II: Cochlear Implant And Hearing Improvement
| Pages | Authors | Title | |
|---|---|---|---|
| 269--274 | J.Z. Sarant, P.J. Blamey and G.M. Clark | The Effect Of Language Knowledge On Speech Perception In Children With Impaired Hearing | |
| 275--280 | Cécile Pereira | Angry, Happy, Sad Or Plain Neutral? The Identification Of Vocal Affect By Hearing-Aid Users | |
| 281--286 | P.J. Blamey, E.S. Parisi & G.J. Dooley | Perception Of Two-Formant Vowels By Normal Listeners And People Using A Hearing Aid And A Cochlear Implant In Opposite Ears. | |
| 287--292 | Bernice McGuire | Speech, Phonological Awareness And Reading Skills In Children With Impaired Hearing |
Session 12 Speech Recognition III
| Pages | Authors | Title | |
|---|---|---|---|
| 295--301 | A. Samouelian | Connected Digit Recognition Using Inductive Inference | |
| 301--306 | A. Jusek, G. A. Fink, F Kummert, and G. Sagerer | Automatically Generated Models For Unknown Words | |
| 307--312 | D. R. Dersch | Neural Network Approaches To Speech Recognition: A General Radial Basis Function Network For Speaker-Independent Phone Classification | |
| 313--318 | David B. Grayden and Michael S. Scordilis | Using The Vowel Triangle In Automatic Speech Recognition | |
| 319--324 | Michael Barlow, Stephanie Dal, Tatsuo Matsuoka and Sadaoki Furui | An Automatically Acquired Cfg For Speech Understanding And Hypotheses Reordering |
Session 13 Speech Development
| Pages | Authors | Title | |
|---|---|---|---|
| 325--330 | Christine Kitamura & Denis Burnham | Pitch & Communicative Intent In Infant-Directed Speech: Longitudinal Data | |
| 331--336 | S. McLeod, J. van Doorn, and V. Reed | Homonyms And Cluster Reduction In The Normal Development Of Children'S Speech | |
| 337--342 | P.F. McCormack & T. Knighton | Gender Differences In The Speech Patterns Of Two And A Half Year Old Children. | |
| 343--348 | Christine Kitamura & Denis Burnham | Infant Preferences For Infant-Directed Speech: Is Vocal Affect More Salient Than Pitch? |
Session 14 Databases
| Pages | Authors | Title | |
|---|---|---|---|
| 351--356 | Peter Roach, Simon Arnfield and Elizabeth Hallum | Babel: A Multi-Language Database | |
| 355--360 | Christoph Draxler | The German Speechdat Telephone Speech Corpus Overview And Experiences | |
| 361--366 | Steve Cassidy and Jonathan Harrington | Emu: An Enhanced Hierarchical Speech Data Management System |
Session 15 Posters
| Pages | Authors | Title | |
|---|---|---|---|
| 367--372 | C. Blight, A. Butcher, & P. McCormack | Nasal Airflow Measures Pre- And Post- Tonsillectomy | |
| 407--412 | Young-Mok Ahn, Hoi-Rin Kim | Development Of A Very Fast Preprocessor |
Session 15 Posters
Session 16 Poster
Session 16 Second Language Linguistics
| Pages | Authors | Title | |
|---|---|---|---|
| 491--496 | John Ingram | Perception Of Tensity And Aspiration In Synthesised Korean Stop Consonants | |
| 497--502 | Duncan Markham | Similarity And Newness -Workable Concepts In Describing Phonetic Categorisation? | |
| 503--508 | Denis Burnham, Sheila Keane | Where Does Auditory-Visual Speech Integration Occur? Japanese Speakers' Perception Of The Mcgurk Effect As A Function Of Vowel Environment | |
| 509--514 | K.P.H. Sullivan and Y.N. Karst | Perception Of English Accent By Native British English Speakers And Swedish Learners Of English | |
| 515--520 | C Tsurutani and J. Ingram | Prosodic Template In Word Blending: A Comparison Between Native Japanese And English Learners Of Japanese |
Session 17 Signal Processing
| Pages | Authors | Title | |
|---|---|---|---|
| 521--526 | Peter Veprek and Michael S. Scordilis | Enhanced Speech Classification And Pitch Detection | |
| 527--532 | David R.L. Davies and J. Bruce Millar | Evaluation Of A Computationally Efficient Method For Generating A Voiced-Source Synchronised Timing Signal | |
| 533--538 | L.Candille, M. George, A. Soquet and H. Meloni | Control Of A Vocal Tract Model Based On Articulatory Measurements And Acoustic Optimization | |
| 539--544 | D. Cole, M. Moody and S. Sridharan | Alternative Methods For Reverberant Speech Enhancement | |
| 545--550 | Peter Veprek and Michael S. Scordilis The University of Melbourne, Australia | A Constrained Dtw-Based Procedure For Speech Segmentation | |
| 551--556 | Richard Katsch , Phillip Dermody, John Seymour, Loredana Cerrato | Objective Identification Of Speech Presented In Noise |
Session 18 Speech Physiology
| Pages | Authors | Title | |
|---|---|---|---|
| 555--560 | W Hardcastle, B Vaxelaire, F Gibbon, P Hoole and N Nguyen | Tongue Kinematics In /Kl/ Clusters And Singleton /K/: A Combined Ema/Epg Study | |
| 561--566 | Anders Lofqvist | Control Of Oral Closure And Release In Bilabial Stop Consonants | |
| 567--572 | Peter J. Alfonso | Long-Term Spatiotemporal Stability Of Lip-Jaw Synergies For Bilabial Closure | |
| 577--582 | JANET FLETCHER, MARY E. BECKMAN, and JONATHAN HARRINGTON | Accentual-Prominence-Enhancing Strategies In Australian English |
Session 19 Prosody I
| Pages | Authors | Title | |
|---|---|---|---|
| 581--586 | Xiaonong Sean Zhu | Two Stress Patterns Of Shanghai Compounds | |
| 587--592 | Denis Burnham, Elizabeth Francis, Di Webster, Sudaporu Luksaneeyanawin, Francisco Lacerda, and Chayada Attapaiboon | Facilitation Or Attenuation In The Development Of Speech Mode Processing? Tone Perception Over Linguistic Contexts | |
| 593--598 | Phil Rose | Aerodynamic Involvement In Intrinsic F0 Perturbations - Evidence From Thai-Phake |
Session 20 Prosody II
| Pages | Authors | Title | |
|---|---|---|---|
| 599--604 | Anne Cutler and Takashi Otake | The Processing Of Word Prosody In Japanese | |
| 605--610 | Phil Rose | The Realisation Of Stopped-Syllable Tones In Hua Sai And Pakphanang | |
| 611--616 | Janet Fletcher and Jonathan Harrington | Timing Of Intonational Events In Australian English |
Session 21 Features Analysis II
| Pages | Authors | Title | |
|---|---|---|---|
| 617--622 | Stefan Slomka, Peter Barger, Pierre Castellano and Sridha Sridharan | Gender Gates In Degraded Environments | |
| 623--628 | Marija Tabain and Catherine Watson | Classification Of Fricatives | |
| 629--634 | K. Chong and R. Togneri | Extraction Of A Speech Signal In The Presence Of A Musical Note Signal | |
| 635--640 | Daniel Woo, Phillip Dermody | Simulation Of Human Incremental Speech Gating Performance Using Time Frequency Analysis And A Simple Classifier. | |
| 640--645 | Ira Gerson, Orhan Karaali, Gerald Corrigan, and Noel Massey | Neural Network Speech Synthesis | |
| 645--650 | W.N Farrell and W.G. Cowley | Maximum A Posteriori Decoding For Speech Codec Parameters |
