Welcome to Henry Hao Tang's Personal Web Site
PDF version (Updated on Feb. 24, 2010)

Objective

      Seeking an R&D summer intern position in related fields.

Research Interests

  • Statistical Pattern Recognition
  • Machine Learning
  • Computer Vision
  • Computer Graphics
  • Speech Processing
  • Biometrics
  • Multimedia Signal Processing

Education

  • 2005 ~ now, Ph.D. candidate, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois (advisor: Dr. Thomas S. Huang)
  • 2005 M.S., Department of Electrical and Computer Engineering, Rutgers University, Piscataway, New Jersey (advisor: Dr. James L. Flanagan)
  • 2003 M.E., Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, China
  • 1998 B.E., Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, China

Employment

  • Jan. 2009 ~ May 2009, Teaching Assistant, ECE Department, UIUC
  • Summer 2008, Intern, IBM T. J. Watson Research Center
  • Jan. 2008 ~ May 2008, Teaching Assistant, ECE Department, UIUC
  • Summer 2007, Intern, Microsoft Research
  • Aug. 2005 ~ now, Research Assistant, IFP Group, Beckman Institute, UIUC
  • Aug. 2004 ~ May 2005, Teaching Assistant, English Department, Rutgers University
  • Jan. 2004 ~ Aug. 2004, Research Assistant, CAIP Center, Rutgers University
  • Jul. 1998 ~ Jul. 2003, Teacher, EEIS Department, University of Science and Technology of China
  • Jul. 1998 ~ Jul. 2003, Senior Software Engineer, Research Director, Research Manager, Anhui USTC iFlyTEK. Co., Ltd. (Ke Da Xun Fei)

Major Projects

  • 2008~, Audiovisiual Emotion Recognition and Synthesis
  • 2008~, Speaker Biometrics
  • Summer 2008, Semi-Supervised Speaker Clustering, IBM Research
  • Summer 2007, An Expressive Avatar Research Platform, Microsoft Research
  • 2007~, Video-Based Motion Capture Camera and Microphone Array System (Hardware & Software) and 3D Audiovisual Dynamic Face Databases, UIUC
  • 2007, Emotional Speech Databases for Emotive Prosody and Emotional Text-to-Speech Synthesis Research, UIUC
  • 2007, Clear Evaluation - Acoustic Event Detection and Classification, UIUC
  • 2006, Clear Evaluation - Multimodal Person Identification, UIUC
  • 2005 ~ 2006, A Text Messaging System with Emotive Audio-Visual Avatar (joint project between Motorola and UIUC), UIUC
  • 2004, Very Low Bit-Rate Speech Coding (subcontract from Sarnoff Corp.), Rutgers
  • 2002 ~ 2003, Corpus-Based Chinese-English Mixed-Lingual Text-to-Speech Synthesis, iFlyTEK (award winning)
  • 2002 ~ 2003, Corpus-Based Cantonese Text-to-Speech Synthesis, iFlyTEK
  • 2002, Talking Book (plug-in software incorporating TTS functionality into e-books), iFlyTEK
  • 2001 ~ 2002, Distributed Speech Synthesis, iFlyTEK (patented)
  • 2000, Talking Web (a Chinese-English bilingual talking web browser), iFlyTEK
  • 1999, An Intelligent Chinese Speech Platform, iFlyTEK
  • 1999, Speaking Earth Clock (universal time telling software supporting customized Chinese-English bilingual speech recognition and synthesis), iFlyTEK
  • 1998 ~ 2000, The KD-Series Chinese Text-to-Speech Systems, iFlyTEK (award winning)
  • 1998 ~ 1999, The Uni-Brain Intelligent Chinese Platform, iFlyTEK
  • 1998 ~ 1999, The Tian-Yin-Hua-Wang System (large-scale desktop software supporting Chinese text-to-speech, speech-to-text and voice navigation), iFlyTEK
  • 1996 ~ 1998, Geographic Information Systems (GIS), USTC (on Unix)
  • 1995 ~ 1996, A Photo-Realistic Scene Generation System Based on the Ray-Tracing Algorithm, USTC (on Unix)

Publications

Journal

  1. Hao Tang, Yun Fu, Jilin Tu, Mark Hasegawa-Johnson, and Thomas S. Huang, "Humanoid Audio-Visual Avatar with Emotive Text-To-Speech Synthesis," IEEE Transactions on Multimedia (T-MM), Volume: 10,  Issue: 6, pp. 969-981, October, 2008

Conference

  1. Hao Tang, Stephen M. Chu, Thomas S. Huang, "****** under anonymous review ******," submitted to the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009.
  2. Stephen M. Chu, Hao Tang, Thomas S. Huang, "Locality Preserving Speaker Clustering," submitted to 2009 International Conference on Multimedia & Expo (ICME'09)
  3. Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang, "Emotion Recognition from Speech via Boosted Gaussian Mixture Models," submitted to 2009 International Conference on Multimedia & Expo (ICME'09)
  4. Stephen M. Chu, Hao Tang, Thomas S. Huang, "Fishervoice and Semi-supervised Speaker Clustering," 2009 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09) (accepted)
  5. Hao Tang, Stephen M. Chu, Thomas S. Huang, "Generative Model-based Speaker Clustering via Mixture of von Mises-Fisher Distributions," 2009 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09) (accepted)
  6. Hao Tang and Thomas S. Huang, "Boosting Gaussian Mixture Models via Discriminant Analysis," 2008 IEEE International Conference on Pattern Recognition (ICPR'08), Tempa, FL, December, 2008
  7. Xi Zhou, Xiaodan Zhuang, Hao Tang, Mark Hasegawa-Johnson, Thomas Huang, "A Novel Gaussianized Vector Representation for Natural Scene Categorization," 2008 IEEE International Conference on Pattern Recognition (ICPR'08), Tempa, FL, December, 2008 (IBM Best Student Paper Award)
  8. Hao Tang and Thomas S. Huang, "MPEG4 Performance-Driven Avatar via Robust Facial Motion Tracking," 2008 IEEE International Conference on Image Processing (ICIP'08), San Diego, CA, October, 2008
  9. Jianchao Yang, Hao Tang, Yi Ma, Thomas Huang, "Face Hallucination via Sparse Coding," 2008 IEEE International Conference on Image Processing (ICIP'08), San Diego, CA, October, 2008
  10. Hao Tang, Xi Zhou, Matthias Odisio, Mark Hasegawa-Johnson, and Thomas S. Huang, "Two-Stage Prosody Prediction for Emotional Text-to-Speech Synthesis,"  INTERSPEECH 2008, Brisbane, Australia, September, 2008
  11. Hao Tang and Thomas S. Huang, "3D Facial Expression Recognition Based on Properties of Line Segments Connecting Facial Feature Points," 2008 IEEE International Conference on Automatic Face and Gesture Recognition (FG'08), Amsterdam, The Neitherlands, September, 2008
  12. Hao Tang and Thomas S. Huang, "3D Facial Expression Recognition Based on Automatically Selected Features," CVPR 2008 Workshop on 3D Face Processing (CVPR-3DFP'08), Anchorage, Alaska, June, 2008
  13. Hao Tang, Yuxiao Hu, Yun Fu, Mark Hasegawa-Johnson, and Thomas S. Huang, "Real-Time Conversion from a Single 2D Face Image to a 3D Text-Driven Emotive Audio-Visual Avatar," 2008 IEEE International Conference on Multimedia & Expo (ICME'08), Hannover, Germany, June 2008
  14. Yuxiao Hu, Hao Tang, and Thomas S. Huang, "Camera and Microphone Array for 3D Audiovisual Face Data Collection," 2008 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'08), Las Vegas, Nevada, March, 2008
  15. Hao Tang, Zhixiong Chen, and Thomas S. Huang, "Comparison of Algorithms for Speaker Identification under Adverse Far-Field Recording Conditions with Extremely Short Utterances," Proc. 2008 IEEE International Conference On Networking, Sensing and Control (ICNSC'08), pp. 796 - 801, Sanya, China, April, 2008
  16. Hao Tang, Yun Fu, Jilin Tu, Thomas S. Huang, and Mark Hasegawa-Johnson, "EAVA: A 3D Emotive Audio-Visual Avatar," 2008 IEEE Workshop on Applications of Computer Vision (WACV'08), Copper Mountain, CO, January, 2008
  17. Xi Zhou, Xiaodan Zhuang, Ming Liu, Hao Tang, Mark Hasgeawa-Johnson, and Thomas Huang, "HMM-Based Acoustic Event Detection with AdaBoost Feature Selection," Proc. CLEAR Evaluation and Workshop (Classification of Events, Activities, and Relationships), Baltimore, MD, May, 2007
  18. Huazhong Ning, Ming Liu, Hao Tang, and Thomas Huang, "A Spectral Learning Approach to Speaker Diarization," Proc. Ninth International Conference on Spoken Language Processing (ICSLP'06), Pittsburgh, PA, September, 2006
  19. Ming Liu, Hao Tang, Huazhong Ning, and Thomas Huang, "Person Identification Based on Multichannel and Multimodality Fusion," Proc. CLEAR Evaluation and Workshop (Classification of Events, Activities, and Relationships), Southampton, UK, April, 2006
  20. Hao Tang and Thomas Huang, "Improved Graphical Model for Audiovisual Object Tracking," Proc. 2006 IEEE International Conference on Multimedia & Expo (ICME'06), pp. 997 - 1000, Toronto, Canada, July, 2006
  21. Zhixiong Chen and Hao Tang, "Sparse Bayesian Approach to Classification," Proc. 2005 IEEE International Conference on Networking, Sensing and Control (ICNSC'05), pp. 914 - 917, Tucson, AZ, March, 2005
  22. Hao Tang, Bo Yin, Renhua Wang, "Study on Distributed Speech Synthesis System," Proc. 2003 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'03), pp. 732-735, Hongkong, Hong Kong, April, 2003
  23. Yingying Xu, Hao Tang, Peiren Zhang, "An Advanced Text-to-Speech Server System Based on SOAP Protocol," Proc. 2003 International Conference on Acoustics, Speech, and Signal Processing (ICASSP'03), pp. 728-731, Hongkong, Hong Kong, April, 2003
  24. Hao Tang, Bo Yin, Renhua Wang, "Design of Embedded Application Oriented Distributed Speech Synthesis System with High Naturalness," Proc. 2002 International Symposium on Chinese Spoken Language Processing (ISCSLP'02), pp. 76-79, Taipei, Taiwan, August, 2002
  25. Tao Chen, Qingfeng Liu, Hao Tang, Renhua Wang, "An Intelligent Chinese Speech Platform," Proc. Fourth National Conference on Intelligent Computer Interface and Applications, Taiyuan, China, July, 1999

Patents

  • Hao Tang and Bo Yin, "Data Exchange Format for Speech Synthesis System," Patent number 02148666.2, China, 2005
  • Hao Tang and Bo Yin, "A Distributed Speech Synthesis System," Patent number 02108890.X, China, 2002
  • Hao Tang and Bo Yin, "A Distributed Speech Synthesis Method," Patent number 02116017.1, China, 2002

Honors and Awards

  • Best Student Paper Award, International Conference on Pattern Recognition, 2008
  • National Scientific and Technological Progress Award, 2nd Prize, China, 2002 (conferred by The State Council of China)
  • Scientific and Technological Progress Award of Anhui Province, 1st Prize, China, 2000
  • Certificate of Scientific and Technological Achievement of Anhui Province, China, 2000

Professional Activities

  • Reviewer, IEEE Trans. on Pattern Analysis and Machine Intelligence (T-PAMI)
  • Reviewer, IEEE Trans. on Audio, Speech and Language Processing (T-ASLP)
  • Reviewer, IEEE Trans. on Circuits and Systems for Video Technology (T-CSVT)
  • Reviewer, International Journal of Image and Graphics
  • Reviewer, Neurocomputing
  • Reviewer, IEEE Signal Processing Letters
  • Student Member, Institute of Mathematical Statistics (IMS)
  • Student Member, IEEE

Computer Skills

  • Over 15 years of programming experience in C and C++
  • 5 years of work experience as a senior software engineer at a high-tech company
  • Daily ( professional level) use of Windows and UNIX/Linux operating systems
  • Primary languages & tools: C, C++, C++/CLI, Java, Matlab, Python, MFC, .NET, OpenGL, HTK, Attila (IBM), etc.

Graduate Courses

University of Illinois at Urbana-Champaign

  • A+  Random Processes
  • A    Image Processing
  • A    Computer Vision
  • A+  Fundamentals of Speech Processing
  • A    Pattern Recognition
  • A    Machine Learning
  • A+  Digital Signal Processing I
  • B+  Digital Signal Processing II
  • A    Image and Neuroimage Processing
  • A    Fundamentals of Engineering Acoustics
  • A    Electromagnetic Waves and Radiating Systems

Rutgers University

  • A   Digital Signals and Filters
  • A   Optimum Signal Processing
  • A   Digital Speech Processing
  • A   Advanced Topics in DSP
  • A   Computer Vision
  • A   Parallel and Distributed Computing
  • A   Data Structures and Algorithms
  • A   Numerical Analysis
  • A   Linear Algebra and Applications
  • A   Digital Spectral Analysis
  • A   Random Signals and Systems

References

      Provided upon request.