Statistical Speech Technology Group
Papers in Peer-Reviewed Journals

Books Journals Conferences Talks

2006

Tong Zhang, Mark Hasegawa-Johnson and Stephen E. Levinson, "Cognitive State Classification in a spoken tutorial dialogue system," Speech Communication 48(6):616-632, 2006.

Jennifer Cole, Heejin Kim, Hansook Choi, and Mark Hasegawa-Johnson, "Prosodic effects on acoustic cues to stop voicing and place of articulation: Evidence from Radio News speech." J Phonetics. (to appear).

Tong Zhang, Mark Hasegawa-Johnson and Stephen E. Levinson, "Extraction of Pragmatic and Semantic Salience from Spontaneous Spoken English," Speech Communication (to appear).

Soo Eun Chang, Nicoline Ambrose, Christy Ludlow, and Mark Hasegawa-Johnson, "Brain Anatomy Differences in Childhood Stuttering," Brain (to appear).

Ken Chen, Mark Hasegawa-Johnson, Aaron Cohen, Sarah Borys, Sung-Suk Kim, Jennifer Cole and Jeung-Yoon Choi, "Prosody Dependent Speech Recognition on Radio News Corpus of American English," IEEE Transactions on Audio, Speech, and Language, 14(1):232-245, 2006.

2005

Mark Hasegawa-Johnson, Ken Chen, Jennifer Cole, Sarah Borys, Sung-Suk Kim, Aaron Cohen, Tong Zhang, Jeung-Yoon Choi, Heejin Kim, Taejin Yoon and Sandra Chavarria, "Simultaneous Recognition of Words and Prosody in the Boston University Radio Speech Corpus," Speech Communication, 46(3-4):418-439, 2005.

Jeung-Yoon Choi, Mark Hasegawa-Johnson and Jennifer Cole, "Finding Intonational Boundaries Using Acoustic Cues Related to the Voice Source," Journal of the Acoustical Society of America, 118(4):2579-88, 2005.

2004

Sung-Suk Kim, Mark Hasegawa-Johnson, and Ken Chen, Automatic Recognition of Pitch Movements Using Time-Delay Recursive Neural Network., accepted for publication, IEEE Signal Processing Letters

Mohamed Kamal Omar and Mark Hasegawa-Johnson, Model Enforcement: a Unified Feature Transformation Framework for Classification and Recognition. Accepted for publication, IEEE Transactions on Signal Processing

M. Kamal Omar and Mark Hasegawa-Johnson, Approximately Independent Factors of Speech Using Nonlinear Symplectic Transformation. IEEE Transactions on Speech and Audio Processing, accepted for publication.

2003

Sarah Borys, Mark Hasegawa-Johnson, and Jennifer Cole, Prosody as a Conditioning Variable in Speech Recognition." Illinois Journal of Undergraduate Research, 2003.

M. Hasegawa-Johnson, S. Pizza, A. Alwan, J. Cha, and K. Haker, Vowel Category Dependence of the Relationship Between Palate Height, Tongue Height, and Oral Area, Journal of Speech, Language, and Hearing Research, June, 2003.

Y. Zheng, M. Hasegawa-Johnson and S. Pizza, "PARAFAC Analysis of the Three dimensional tongue Shape," Journal of the Acoustical Society of America, Vol. 113, No. 1, January 2003.(Figures here)

2002

M. Hasegawa-Johnson and A. Alwan, Speech Coding: Fundamentals and Applications. in Wiley Encyclopedia of Telecommunications and Signal Processing, ed. J. Proakis, Wiley and Sons, NY, December 2002.

M. Hasegawa-Johnson, Finding the Best Acoustic Measurements for Landmark-Based Speech Recognition. Accumu: the Journal of Arts and Technology of the Kyoto Computer Gakuin, Kyoto Computer Gakuin, December 2002.

2000

Hasegawa-Johnson, M.A. (2000), "Line spectral frequencies are the poles and zeros of a discrete matched-impedance vocal tract model", JASA 108(1):457-460.

1992-1999

Hasegawa-Johnson, M.A. (1998) Electromagnetic exposure safety of the Carstens Articulograph AG100, JASA 104:2529-2532.

Hasegawa-Johnson, M.A. (1998) Course Notes in Speech Production, Speech Coding, and Speech Recognition. (download page)

Johnson, M.A. (1992) ``Analysis of durational rhythms in two poems by Robert Frost.'' MIT Speech Comm. Group Working Papers, Cambridge, MA, 29-42.

Johnson, M.A. and Taniguchi, T. (1992) On-line and off-line computational reduction techniques using backward filtering in CELP speech coders. IEEE Trans. ASSP 40:2090-2093.