Graduate Students
Ferda Ofli
Ph.D. Koc University, 2010
Advisor: Murat Tekalp, Yucel Yemez, Engin Erzin
Ferda Ofli. Learning Statistical Music-to-Dance Mappings for Choreography Synthesis. PhD thesis, Koc University, 2010.
“We propose many-to-many statistical mappings from music measures (music segments) to dance figures (dance segments) towards generating plausible music-driven dance choreographies. We assume that dance figures (dance segment boundaries) coincide with music measures (music segment boundaries).”
- F. Ofli, E. Erzin, Y. Yemez, and A.M. Tekalp. Multi-modal analysis of dance performances for music-driven choreography synthesis. In ICASSP’10, Dallas, USA, 2010.
- F. Ofli, E. Erzin, Y. Yemez, A.M. Tekalp, A.T. Erdem, C. Erdem, T. Abaci, and M. Ozkan. Un-supervised dance figure analysis from video for dancing avatar animation. In ICIP’08, San Diego, USA, 2008.
- F. Ofli, C. Canton-Ferrer, J. Tilmanne, Y. Demir, E. Bozkurt, Y. Yemez, E. Erzin, and A.M. Tekalp. Audio-driven human body motion analysis and synthesis. In ICASSP’08, Las Vegas, USA, 2008.
- F. Ofli, Y. Demir, E. Erzin, Y. Yemez, , and A. M. Tekalp. Multicamera audio-visual analysis of dance figures. In IEEE Int. Conf. on Multimedia Expo, ICME-2007., 2007.
- F. Ofli, Y. Demir, C. Canton-Ferrer, J. Tilmanne, K. Balcı, E. Bozkurt, I. Kızıloglu, Y. Yemez, E. Erzin, A.M. Tekalp, L. Akarun, and A.T. Erdem. Cok bakıslı isitsel-gorsel dans verilerinin analizi ve sentezi (analysis and synthesis of multiview audio-visual dance figures). In SIU’08, Didim, Turkey, 2008.
Elif Bozkurt
M.S. Koc University, 2010
Advisor: Engin Erzin
Elif Bozkurt. A Formant Position based Weighted Spectral Features for Spontaneous Emotion Recognition. Master’s thesis, Koc University, 2010.
“We present formant position based weighted Mel Frequency Cepstral Coefficient (WMFCC) features for the emotion recognition problem and compare performance results with commonly used feature sets. Since, the Line Spectral Frequency (LSF) features are positioned close to each other around formant frequencies, we propose normalized inverse harmonic mean function to weight critical band energies for the extraction of MFCC features.”
- E. Bozkurt, C. Eroglu Erdem, T. Erdem, and E. Erzin. Formant position based weighted spectral features for emotion recognition. Submitted to Speech Communication, 2010.
- E. Bozkurt, E. Erzin, C. Eroglu Erdem, and T. Erdem. Improving automatic emotion recognition from speech signals. In INTERSPEECH’09, UK, 2009.
- F. Ofli, Y. Demir, C. Canton-Ferrer, J. Tilmanne, K. Balcı, E. Bozkurt, I. Kızıloglu, Y. Yemez, E. Erzin, A.M. Tekalp, L. Akarun, and A.T. Erdem. Cok bakıslı isitsel-gorsel dans verilerinin analizi ve sentezi (analysis and synthesis of multiview audio-visual dance figures). In SIU’08, Didim, Turkey, 2008.
Emre Öztürk
M.S. Koc University, 2010
Advisor: Engin Erzin
Emre Ozturk. Driver status identification from driving behavior signals. Master’s thesis, Koc University, 2010.
“Driving behavior signals differ in how and under which conditions the driver use vehicle control units, such as pedals, driving wheel, etc. In this study we investigate how the drivingbehavior signals differ among drivers and among different driving tasks. ”
- E. Ozturk and E. Erzin. Driving status identification under different distraction conditions from driving behaviour signals. In 4th Biennial Workshop on DSP for In-Vehicle Systems and Safety,UTD, TX, USA, 2009.
Yasemin Demir
Ph.D. student at University of California, Berkeley
M.S. Koc University, 2008
Advisor: Engin Erzin
Yasemin Demir. Music - driven dance synthesis by multimodal dance performance analysis. Master’s thesis, Koc University, 2008.
“We present a framework for evaluation of audio feature and dance figure correlation for audio - visual analysis and synthesis of dance figures. Dance figures are performed synchronously with the musical rhythm.”
- Y. Demir, E. Erzin, Y. Yemez, and A. M. Tekalp. Evaluation of audio features for audio-visual analysis of dance figures. In EUSIPCO’08, Lausanne, Switzerland, 2008.
- F. Ofli, C. Canton-Ferrer, J. Tilmanne, Y. Demir, E. Bozkurt, Y. Yemez, E. Erzin, and A.M. Tekalp. Audio-driven human body motion analysis and synthesis. In ICASSP’08, Las Vegas, USA, 2008.
- F. Ofli, Y. Demir, E. Erzin, Y. Yemez, , and A. M. Tekalp. Multicamera audio-visual analysis of dance figures. In IEEE Int. Conf. on Multimedia Expo, ICME-2007., 2007.
- F. Ofli, Y. Demir, C. Canton-Ferrer, J. Tilmanne, K. Balcı, E. Bozkurt, I. Kızıloglu, Y. Yemez, E. Erzin, A.M. Tekalp, L. Akarun, and A.T. Erdem. Cok bakıslı isitsel-gorsel dans verilerinin analizi ve sentezi (analysis and synthesis of multiview audio-visual dance figures). In SIU’08, Didim, Turkey, 2008.
Emre Sargın
MTS at Google
Ph.D. student at University of California, Santa Barbara
M.S. Koc University, 2006
Advisor: Murat Tekalp, Yucel Yemez, Engin Erzin
Emre Sargın. Audio-visual correlation modeling for speaker identification and synthesis. Mas-
ter’s thesis, Koc University, 2006.
“This thesis addresses two major problems of multimodal signal processing using audiovisual correlation modeling: speaker recognition and speaker synthesis. We address the first problem, i.e., the audiovisual speaker recognition problem within an open-set identification framework, where audio (speech) and lip texture (intensity) modalities are fused employing a combination of early and late integration techniques.”
- M. E. Sargın, Y. Yemez, E. Erzin, and A. M. Tekalp. Analysis of head gesture and prosody patterns for prosody-driven head-gesture animation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006.
- M. E. Sargın, Y. Yemez, and A.M. Tekalp. Audio-visual synchronization and fusion using canonical correlation analysis. IEEE Transactions on Multimedia, 9(7):1396–1403, November 2007.
- M. E. Sargın, Y. Yemez, E. Erzin, and A. M. Tekalp. Prosody-driven head-gesture animation. In IEEE Int. Conf. on Acoustic, Speech, Signal Proc. (ICASSP’07), 2007.
Ulas Bagcı
Ph.D. student at University of Nottingham, UK
M.S. Koc University, 2005
Advisor: Engin Erzin
Ulas Bagcı. Boosting classifiers for automatic music genre classification. Master’s thesis, Koc
University, 2005.
“Music genre classification is an important tool for music information retrieval systems and has been finding important applications in various media platforms. Two important problems of the automatic music genre classification are feature extraction and classifier design.”
- U. Bagcı and E. Erzin. Automatic classification of musical genres using inter-genre similarity. IEEESignal Processing Letters, Vol. 14, No. 8, pp. 521-524, August 2007.
- U. Bagcı and E. Erzin. Boosting classifiers for music genre classification. In 20th InternationalSymposium on Computer and Information Sciences (ISCIS 2005), Berlin, 2005.
- U. Bagcı and E. Erzin. Muzik turlerinin sınıflanmasında benzer kesisim bilgileri uygulamaları. InSIU 2006, Antalya, 2006.
Ertan Cetingul
Ph.D. student at Johns Hopkins University, Baltimore
M.S. Koc University, 2005
Advisor: Murat Tekalp, Engin Erzin, Yucel Yemez
Ertan Cetingul. Discrimination analysis of lip motion features for multimodal speaker identification and speech-reading. Master’s thesis, Koc University, 2005.
“In this thesis a new multimodal speaker/speech recognition system that integrates audio, lip texture, lip geometry, and lip motion modalities is presented. There have been several studies that jointly use audio, lip intensity and/or lip geometry information for speaker identification and speech recognition applications.”
- H.E. Cetingul, E. Erzin, Y. Yemez, and Tekalp A.M. Multimodal speaker/speech recognition using lip motion, lip texture and audio. Signal Processing, Special Section: Multimodal Human-Computer Interfaces, 86:3549–3558, December 2006.
- H.E. Cetingul, E. Erzin, Y. Yemez, and Tekalp A.M. Discriminative analysis of lip motion features for speaker identification and speech-reading. IEEE Transactions on Image Processing, 15:2879 – 2891, October 2006.
- H.E. Cetingul, E. Erzin, Y. Yemez, and Tekalp A.M. Robust lip-motion features for speaker identification. In IEEE Int. Conf. on Acoustic, Speech and Signal Processing, Philadelphia, March 2005.
Alper Kanak
TUBITAK-UEKAE
M.S. Koc University, 2004
Advisor: Murat Tekalp, Engin Erzin, Yucel Yemez
Alper Kanak. Multimodal speaker identification with audio-video processing. Master’s thesis, Koc University, 2004.
“In this these we present a multimodal text=dependent speaker identification system. The objective is to improve the recognition performance over conventional unimodal or bimodal schemes.”
- A. Kanak, E. Erzin, Y. Yemez, and A.M. Tekalp. Speaker identification using multimodal audio-video processing. IEEE Int. Conf. on Image Processing, 2003.
- A. Kanak, E. Erzin, Y. Yemez, and A.M. Tekalp. Joint audio-video processing for biometric speaker identification. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, 2003.
