SST 2016 Proceedings

Proceedings of the Sixteenth Australasian International Conference on Speech Science and Technology (SST2016)

Dec 6-9 2016, Parramatta, Australia

Editors Christopher Carignan and Michael D. Tyler

ISSN 2207-1296 Copyright © 2016 ASSTA All rights reserved

The pdf of the full proceedings can be downloaded here. Individual papers can be downloaded from the conference schedule below.

For SST2016, we invited submission of 4-page papers (for a 20-minute oral presentation or a poster presentation and publication in the proceedings) and 1-page abstracts (for poster presentation only and publication in the conference programme). The submissions were all blind peer reviewed by two anonymous reviewers, and papers were selected on the basis of reviewer comments and scores. Authors resubmitted final deanonymised versions of their papers and abstracts that took into account the reviewers’ comments.

Click here to return to the SST conference series page.

SST2016 Conference
Novotel Sydney Parramatta
Wednesday 7 December 2016
  Oral Sessions 11:00-12:15
   
L1 ACQUISITION
Chair: Titia Benders
SPEECH PROSODY
Chair: Sasha Calhoun
11:00-11:25 Fundamental frequency characteristics of infant vocalisations: A study in voice quality Perception of statement and question intonation: Cantonese versus Mandarin
Adele Gregory and Marija Tabain Una Chow and Stephen Winters
11:25-11:50 3-year-olds produce pitch contours consistent with Mandarin tone 3 sandhi Prosodic characteristics of Japanese polite speech spoken by native and non-native speakers
Nan Xu Rattanasone, Ping Tang, Ivan Yuen, Liqun Gao and Katherine Demuth Chiharu Tsurutani, Shuju Shi and Nobuaki Minematsu
11:50-12:15 The role of positive affect in the acquisition of word-object associations Accentual lengthening in 5-year-old AusE-speaking children: Preliminary results
Nicole Traynor, Karen Mulak, Rachel Robbins, Gabrielle Weidemann and Paola Escudero Ivan Yuen, Nan Xu Rattanasone, Elaine Schmidt, Gretel Macdonald, Rebecca Holt and Katherine Demuth
  Oral Sessions 1:05-2:20
   
AUSTRALASIAN LANGUAGES
Chair: Rikke Bundgaard-Nielsen
L2 ACQUISITION
Chair: Karen Mulak
1:05-1:30 A comparative ultrasound study of manner contrasts in Arrernte and Kannada: Manner contrasts Tailoring phonetic learning to the needs of individuals on the basis of language aptitude
Marija Tabain, Alexei Kochetov, Richard Beare and Narayan Sreedevi Mark Antoniou and Melissa Blair
1:30-1:55 Verbal reduplication and minimal words in Kaytetye L2 phonological category formation and discrimination in learners varying in L2 experience
Forrest Panther, Mark Harvey, Harold Koch, Myfany Turpin and Michael Proctor Mona Faris, Catherine Best and Michael Tyler
1:55-2:20 Short vowels in L1 Aboriginal English spoken in South-Western Victoria The effects of training order and intensity on the perception and production of English /e/-/ae/ by Cantonese ESL learners
Deborah Loakes, Janet Fletcher, John Hajek, Joshua Clothier and Ben Volchok Janice Wing Sze Wong
  Oral Sessions 2:40-3:55
   
APPLICATIONS OF SPEECH SCIENCE AND TECHNOLOGY
Chair: Adam Vogel
SOCIOPHONETICS
Chair: Debbie Loakes
2:40-3:05 The Australian SpIN� speech in noise test Similarity in global accent promotes generalized learning of accent markers
Peter Blamey, Maryam Zargarbashi, Jeremy Blamey and Elaine Saunders Ann-Kathrin Grohe, Margarita Downing and Andrea Weber
3:05-3:30 Using optical flow and electromagnetic articulography in multimodal speech research Patterns of gender variation in the speech of primary school-aged children in Australian English: The case of /p t k/
Samantha Gordon Danner, Louis Goldstein, Eric Vatikiotis-Bateson, Robert Fuhrman and Adriano Barbosa Casey Tait and Marija Tabain
3:30-3:55 Acoustic monitoring of speech impairment in motor neuron disease associated with frontotemporal dementia: A case series Change in Māori focus/topic ko: The impact of language contact on prosody
Matthew Poole, Amy Brodtmann, David Darby and Adam Vogel Sasha Calhoun, Naoko Yui and Karena Kelly

Thursday 8 December 2016
  Oral Sessions 10:30-12:10
     
SPEECH PERCEPTION I
Chair: Heather Kember
SPECIAL SESSION: EFFECTS OF EARLY LANGUAGE ENVIRONMENT AND LINGUISTIC INPUT ON THE DEVELOPMENT OF SPEECH PERCEPTION SKILLS IN THE FIRST YEARS OF LIFE
Chair: Marina Kalashnikova
ARTICULATORY PHONETICS
Chair: Donald Derrick
10:30-10:55 Disambiguation of Australian English vowels Does a vowel by any other accent sound the same � to toddler ears? Exploring articulatory characteristics of linking /ɹ/ in British English
Tunde Szalay, Titia Benders, Felicity Cox and Michael Proctor Catherine Best, Christine Kitamura, Sophie Gates, and Angela Carpenter Mitsuhiro Nakamura
10:55-11:20 Regional priming in Australian English kit, dress and trap vowels Measuring sensitivity to phonological detail in monolingual and bilingual infants using pupillometry Tongue positions corresponding to formant values in Australian English vowels
Michael Walker, Anita Szakay and Felicity Cox Katalin Tam�si, Thilanga Wewalaarachchi, Barbara H�hle and Leher Singh Arwen Blackwood Ximenes, Jason Shaw and Christopher Carignan
11:20-11:45 The relative contributions of duration and amplitude to the perception of Japanese-accented English as a function of L2 experience Sensitivity to vowel, consonant and tone variation in early childhood Nasal aerodynamics and coarticulation in Bininj Kunwok: Smoothing spline analysis of variance
Saya Kawase, Jeesun Kim and Chris Davis Thilanga D. Wewalaarachchi and Leher Singh Hywel Stoakes, Janet Fletcher and Andrew Butcher
11:45-12:10 The role of closure duration in the perception of word-initial geminates in Kelantan Malay The effects of allomorphic variation on children�s acquisition of plural morphology Preliminary investigations into the Australian English articulatory vowel space
Mohd Hilmi Hamzah, Janet Fletcher and John Hajek Benjamin Davies, Nan Xu Rattanasone and Katherine Demuth Louise Ratko, Michael Proctor, Felicity Cox and Sean Veld
  Oral Sessions 1:10-2:25
     
BILINGUALISM
Chair: Mark Antoniou
FORENSIC SPEECH SCIENCE I
Chair: Eugenia San Segundo Fernandez
SPEECH REGISTERS
Chair: Denis Burnham
1:10-1:35 The early bilingual influence on speech and music processing Preliminary performance comparison between PCAKLR and GMM-UBM for computing the strength of speech evidence in forensic voice comparison Free labeling of audio-visual attitudinal expressions in German
Liquan Liu Hanie Mehdinezhad and Bernard Guillemin Hansj�rg Mixdorff, Angelika H�nemann and Albert Rilliard
1:35-2:00 Monolingual and bilingual adults can successfully learn foreign language words implicitly Impact of various GSM network factors on forensic voice comparison An analysis of Lombard Effect on Thai lexical tones: The role of communicative aspect
Hana Zjakic, Alba Tuninetti and Paola Escudero Balamurali Nair, Esam Alzqhoul and Bernard Guillemin Chariya Boontham, Chutamanee Onsuwan, Tanawan Saimai and Charturong Tantibundhit
2:00-2:25 The bilingual advantage in the language processing domain: Evidence from the Verbal Fluency Task Adapted Gaussian Mixture Model in likelihood ratio based forensic voice comparison using long term fundamental frequency Whispered and Lombard speech: Different ways to exaggerate articulation
Gloria Pino Escobar, Marina Kalashnikova and Paola Escudero Carolin Elisabeth Buncle Diesner and Shunichi Ishihara Chris Davis and Jeesun Kim
  Oral Sessions 2:50-4:05
     
SPEECH PERCEPTION II
Chair: Laurence Bruggeman
SPEECH ENHANCEMENT
Chair: Julien Epps
PITCH ACCENT
Chair: Chiharu Tsurutani
2:50-3:15 Revisiting the interlanguage speech intelligibility benefit Comparison of speech enhancement algorithms for forensic applications Searching for importance: Focus facilitates memory for words in English
Chang Shu, Ian Wilson and
Jeremy Perkins
Ahmed Hasan, David Dean, Bouchra Senadji and Vinod Chandran Heather Kember, Jiyoun Choi and Jenny Yu
3:15-3:40 Speech normalization across speaker, sex and accent variation is handled similarly by listeners of different language backgrounds A Kalman filtering algorithm with joint metrics-based tuning for single-channel speech enhancement Pitch accent type affects lexical activation in German: Evidence from eye tracking
Gloria Pino Escobar, Josephine Terry, Buddhamas Pralle Kriengwatana and Paola Escudero Aidan George, Stephen So, Ratna Ghosh and Kuldip Paliwal Katharina Zahner, Muna Sch�nhuber, Janet Grijzenhout and Bettina Braun
3:40-4:05 Cross-accent word recognition is affected by perceptual assimilation A comparison of estimation methods in the discrete cosine transform modulation domain for speech enhancement Evidence and intonational contours: An experimental approach to meaning in intonation
Sarah Wright, Mark Lathouwers, Catherine Best and Michael Tyler Aidan George, Christine Pickersgill, Belinda Schwerin and Stephen So Byron Ahn, Stefanie Shattuck-Hufnagel and Nanette Veilleux
  Oral Sessions 4:30-5:45
     
MORPHOPHONOLOGY
Chair: Mark Harvey
SPEECH SYNTHESIS
Chair: Frantz Clermont
LEXICAL TONE
Chair: Ivan Yuen
4:30-4:55 �She has many... cat?�: On-line processing of L2 morphophonology by Mandarin learners of English A noise-robust linear prediction analysis for efficient speech coding Normalization of Zhangzhou citation tones
Valeria Peretokina, Catherine Best, Michael Tyler and Bruno Di Biase Aadel Alatwi, Stephen So and Kuldip Paliwal Yishan Huang, Mark Donohue, Phil Rose and Paul Sidwell
4:55-5:20 Pause acceptability is predicted by morphological transparency in Wubuy Synthesizing attitudes in German Comparing normalisation strategies for citation tone F0 in four Chinese dialects
Brett Baker and Rikke Bundgaard-Nielsen Angelika H�nemann and Petra Wagner Philip Rose
5:20-5:45 Morphological status and acoustic realization: Findings from New Zealand English Exploring text to speech synthesis in non-standard languages Perception of tonal contrasts: High-variability perceptual training with iconic visual information
Julia Zimmermann Jesin James, Catherine Watson and Deepa Gopinath Yan Chen

Friday 9 December 2016
     
INFANT-DIRECTED SPEECH
Chair: Nan Xu Rattanasone
AUTOMATIC SPEECH RECOGNITION
Chair: Catherine Watson
11:15-11:40 Exploring the association of infant temperament on maternal fundamental frequency contours  
Alix Woolard, Titia Benders, Linda Campbell, Frini Karayanidis, Joerg Mattes, Vanessa Murphy, Olivia Whalen and Alison Lane  
11:40-12:05 Emotion-related explanations of the vowel variability in infant-directed speech Noise-robust linear prediction cepstral features for network speech recognition
Titia Benders Aadel Alatwi, Stephen So and Kuldip Paliwal
12:05-12:30 The role of affect processing on infant word learning Formant dynamics and durations of um improve the performance of automatic speaker recognition systems
Jessica Bazouni, Liquan Liu, Gabrielle Weidemann and Paola Escudero Vincent Hughes, Paul Foulkes and Sophie Wood
12:30-12:55 Exploring quantitative differences in mothers� and fathers� infant-directed speech to Australian 6-month-olds Eigenfeatures: An alternative to Shifted Delta Coefficients for language identification
Christa Lam-Cassettari and Paige Noble Sarith Fernando, Vidhyasaharan Sethu and Eliathamby Ambikairajah
  Oral Sessions 1:45-3:25
   
ACOUSTIC PHONETICS
Chair: Hywel Stoakes
SPEECH EMOTION RECOGNITION
Chair: Vincent Hughes
1:45-2:10 Secondary tongue retraction in Arabic emphatics: An acoustic study Lacking vision: Insights into the automatic classification of emotion in AMC�s the walking dead
Hamed Altairi, Catherine Watson and Jason Brown Joanne Quinn
2:10-2:35 Sound change or experimental artifact? A study on the impact of data preparation on measuring sound change Depression prediction via acoustic analysis of formulaic word fillers
Catherine Watson and Zoe Evans Brian Stasak, Julien Epps and Nicholas Cummins
2:35-3:00 Temporal correlates of Lopit singleton and geminate glides Time to embrace emotion change: Selecting emotionally salient segments for speech-based emotion prediction
Rosey Billington Zhaocheng Huang and Julien Epps
3:00:3:25 Child Kriol has stop distinctions based on VOT and constriction duration  Continuous spoken emotion recognition based on time-frequency features of the glottal pulse signal within stressed vowels
Rikke Bundgaard-Nielsen, Brett Baker and Elise Bell Li Tian and Catherine Watson
  Oral Sessions 3:45-5:00
   
CROSS-LANGUAGE VOWEL PERCEPTION
Chair: Kimiko Tsukada
FORENSIC SPEECH SCIENCE II
Chair: Bernard Guillemin
3:45-4:10 Can Australian English listeners learn non-native vowels via distributional learning? The effect of sampling procedures on the performance of likelihood ratio based forensic voice comparison: F0 distributional parameters
Jia Hoong Ong, Josephine Terry and Paola Escudero Shunichi Ishihara
4:10-4:35 The relationship between Australian English speakers� non-native perception and production of Brazilian Portuguese vowels Exploring forensic accent recognition using the Y-ACCDIST system
Jaydene Elvin, Paola Escudero, Daniel Williams and Catherine Best Georgina Brown
4:35-5:00 Lebanese Arabic listeners find Australian English vowels easy to discriminate Holistic perception of voice quality matters more than L1 when judging speaker similarity in short stimuli
Ronda Aboultaif, Jaydene Elvin, Daniel Williams and Paola Escudero Eugenia San Segundo, Paul Foulkes and Vincent Hughes

 

Poster presentations 
Background specificity in forensic voice comparison and its relation to the Bayesian prior probability
Michael Wagner and Yuko Kinoshita

Durational and spectral differences in Thai diphthongs and final glides
Phongphat Man-khongdi, Chutamanee Onsuwan, and Charturong Tantibundhit

Effect of clinical depression on automatic speaker verification
Sheeraz Memon, Mukhtiar Ali Unar, and Bhawani Shakar Chowdhry

Japanese vowel deletion occurs in words in citation form
Alexander Kilpatrick, Rikke Bundgaard-Nielsen, and Brett Baker

Lexical manipulation as a discovery tool for psycholinguistic research
Laurence Bruggeman and Anne Cutler

Preservation of tone in right-dominant tone sandhi: A fragment of disyllabic tone sandhi in Máodiàn Wú Chinese
Ruiqing Shen and Phil Rose

Sub-band cepstral variability within and between speakers under microphone and mobile conditions: A preliminary investigation
Frantz Clermont, Yuko Kinoshita, and Takashi Osanai

The role of nuclear stress in intelligibility: The case of Cantonese speakers of English
Simon Ka Ngai Leung and Janice Wing Sze Wong

Towards a better understanding of regional variation in Standard Australian English:  Analysis and comparison of Tasmanian English monophthongs
Rael Stanley

Using version control to facilitate a reproducible and collaborative workflow in acoustic phonetics
Nay San

Visualisation tools to analyse phonetic confusions for speech perception tests
Chung Ting Justine Hui, Catherine Watson, and Takayuki Arai