Collecting 3D Dynamic Face
Data
We
propose to implement a synchronized video/audio data capture system (camera/microphone
array). The main work includes camera/microphone synchronization and
multi-camera calibration.

Documents:
1.
Technical Report (1MB)
2.
Presentation
(10MB)
Data:
1.
3D foam head scanned by
CyberWare, used as the ground truth for reconstruction (5MB) [download]
2.
Checker board for camera
calibration (4.7MB) [download]
3.
Sample raw avi video (Bayer Pattern)
(28MB) [download]
4.
Sample rgb24 color avi video
(12MB) [download]
5.
Demo videos for facial marker
tracking [too big, provided upon request]
6.
etc.
Codes:
1.
Multi-camera Data Collection in
C++ [download]
Input:
Output: raw (Bayer Pattern) avi files
Feature: synchronization, compression (Microsoft RLE8)
2.
Color Conversion in C++ [download]
Input: raw (Bayer Pattern) avi file
Output: rgb24 color avi file (not compressed)
Feature: rawàrgb,
select different de-mosaic algorithms, same the raw frame
3.
3D ground truth labeling tool
in Matlab[download]
Input:
texture and range data from Cyberware Scanner,
Output:
3D coordinates of the points labeled by the user
Feature:
Interactive, remove the scanning noise
4.
2D facial marker tracking in
Matlab [download]
5.
3D reconstruction by stereo
triangulation in Matlab [download]
6.
etc.
Useful Links:
Point Grey Research Inc. http://www.ptgrey.com/
About Bayer Pattern: http://en.wikipedia.org/wiki/Bayer_pattern
Camera Calibration Toolbox from CalTech http://www.vision.caltech.edu/bouguetj/calib_doc/
References:
1. Y.X. Hu, H. Tang, T.S. Huang, Camera
and Microphone Array for 3D Audio/Visual Face Data Collection, The 33rd
International Conference on Acoustics, Speech, and Signal Processing
(ICASSP2008)
2. Luhong Liang, Yu Luo, Feiyue Huang, and Ara V
Nefian, "A multi-stream audio-video large-vocabulary Mandarin Chinese
speech database", IEEE International Conference on Multimedia and Expo
(ICME 2004) June, 2004, Taiwan, China
3. Rodney Goh, Lihao Liu, Xiaoming Liu, and Tsuhan
Chen,"The CMU Face In Action (FIA) Database",
in the Proceedings of the 2005 IEEE International Workshop on Analysis and
Modeling of Face and Gestures, in conjunction with the 2005 International
Conference of Computer Vision (ICCV '05), Beijing, China, October 2005.