10:00
Oral Session 7-GZ – Facial Expression Recognition
Chair: Kostas Karpouzis
10:00
25 mins
|
Evaluating AAM Fitting Methods for Facial Expression Recognition
Akshay Asthana, Jason Mora Saragih, Michael Wagner, Roland Goecke
Abstract: The human face is a rich source of information for the viewer and facial expressions are a major component in judging a person's affective state, intention and personality. Facial expressions are an important part of human-human interaction and have the potential to play an equally important part in human-computer interaction. This paper evaluates various Active Appearance Model (AAM) fitting methods, including both the original formulation as well as several state-of-the-art methods, for the task of automatic facial expression recognition. The AAM is a powerful statistical model for modelling and registering deformable objects. The results of the fitting process are used in a facial expression recognition task using a region-based intermediate representation related to Action Units, with the expression classification task realised using a support vector machine. Experiments are performed for both person-dependent and person-independent setups. Overall, the best facial expression recognition results were obtained by using the Iterative Error Bound Minimisation method, which consistently resulted in accurate face model alignment and facial expression recognition even when the initial face detection used to initialise the fitting procedure was poor.
|
10:25
25 mins
|
Get The FACS Fast: Automated FACS face analysis benefits from the addition of velocity
Timothy R Brick, Michael D Hunter, Jeffrey F Cohn
Abstract: Much progress has been made in automated facial image analysis, yet current approaches still lag behind what is possible using manual labeling of facial actions. While many factors may contribute, a key one may be the limited attention to dynamics of facial action. Most approaches classify frames in terms of either displacement from a neutral, mean face or, less frequently, displacement between successive frames (i.e. velocity). In the current paper, we evaluated the hypothesis that attention to dynamics can boost recognition rates. Using the well-known Cohn-Kanade database and support vector machines, adding velocity and acceleration decreased the number of incorrectly classified results by 14.2% and 11.2%, respectively. Average classification accuracy for the displacement and velocity classifier system across all classifiers was 90.2%. Findings were replicated using linear discriminant analysis, and found a mean decrease of 16.4% in incorrect classifications across classifiers. These findings suggest that information about the dynamics of a movement, that is, the velocity and to a lesser extent the acceleration of a change, can helpfully inform classification of facial expressions.
|
10:50
25 mins
|
Dynamic Cascades with Bidirectional Bootstrapping for Spontaneous Facial Action Unit Detection
Yunfeng Zhu, Fernando De la Torre, Jeffrey Cohn, Yujin Zhang
Abstract: Automatic facial action unit recognition from video has
been a long standing problem in facial expression analysis.
Several issues contribute to the challenge of this task.
These include non-frontal pose, moderate to large out-ofplane
head motion, large variability in the temporal scale
of facial gestures, and the exponential nature of possible
facial action combinations. Most popular methods pose the
AU recognition as a binary classification problem using different
features (e.g. appearance, shape) and classifiers (e.g.
Boosting, support vector machines).
Although existing systems have reported improved results,
an important and relative unexplored problem in AU
learning-based recognition systems is how to select the positive
and negative samples optimal for AU recognition. Typically,
the peak frames of the AUs are selected as positive
class and the negative samples are randomly selected from
other AUs. However, this approach might suffer from two
main drawbacks: (1) if there is a large number of training
samples, some classifiers such as Support Vector Machines
(SVMs) do not scale well with the number of samples (e.g.
O(n3) for the worse case in SVM), (2) given the manual
FACS coding, it is not clear how to choose the positive and
negative samples. If we only label the peaks as positive
samples, there will be a large imbalance between positive
and negative samples, especially for infrequent AU. On the
other hand, if all the frames from onset to offset are labeled
as positive, the number of false positive samples will increase.
In this paper, we propose the Dynamic Cascades
with Bidirectional Bootstrapping method to automatically
select the positive and negative class samples in training.
The results reveal improved results in the recognition performance
of important AUs on the RU-FACS Spontaneous
Expression Database.
|
|