Audio-visual imposture

Walid Karam; Chafic Mokbel; Hanna Greige; Gerard Chollet

doi:10.1117/12.665707

2 May 2006 Audio-visual imposture

Walid Karam, Chafic Mokbel, Hanna Greige, Gerard Chollet

Proceedings Volume 6250, Mobile Multimedia/Image Processing for Military and Security Applications; 62500B (2006) https://doi.org/10.1117/12.665707
Event: Defense and Security Symposium, 2006, Orlando (Kissimmee), Florida, United States

Abstract

A GMM based audio visual speaker verification system is described and an Active Appearance Model with a linear speaker transformation system is used to evaluate the robustness of the verification. An Active Appearance Model (AAM) is used to automatically locate and track a speaker's face in a video recording. A Gaussian Mixture Model (GMM) based classifier (BECARS) is used for face verification. GMM training and testing is accomplished on DCT based extracted features of the detected faces. On the audio side, speech features are extracted and used for speaker verification with the GMM based classifier. Fusion of both audio and video modalities for audio visual speaker verification is compared with face verification and speaker verification systems. To improve the robustness of the multimodal biometric identity verification system, an audio visual imposture system is envisioned. It consists of an automatic voice transformation technique that an impostor may use to assume the identity of an authorized client. Features of the transformed voice are then combined with the corresponding appearance features and fed into the GMM based system BECARS for training. An attempt is made to increase the acceptance rate of the impostor and to analyzing the robustness of the verification system. Experiments are being conducted on the BANCA database, with a prospect of experimenting on the newly developed PDAtabase developed within the scope of the SecurePhone project.

Citation Download Citation

Walid Karam, Chafic Mokbel, Hanna Greige, and Gerard Chollet "Audio-visual imposture", Proc. SPIE 6250, Mobile Multimedia/Image Processing for Military and Security Applications, 62500B (2 May 2006); https://doi.org/10.1117/12.665707

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
11 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Speaker recognition

Video

Feature extraction

3D modeling

Solid modeling

Facial recognition systems

Expectation maximization algorithms

Show All Keywords

Keywords/Phrases

Search In:

Publication Years