- •Stellingen
- •Propositions
- •List of Figures
- •List of Tables
- •1 Introduction
- •Introduction
- •Affect, emotion, and related constructs
- •Affective Computing: A concise overview
- •The closed loop model
- •Three disciplines
- •Human-Computer Interaction (HCI)
- •Health Informatics
- •Three disciplines, one family
- •Outline
- •2 A review of Affective Computing
- •Introduction
- •Vision
- •Speech
- •Biosignals
- •A review
- •Time for a change
- •3 Statistical moments as signal features
- •Introduction
- •Emotion
- •Measures of affect
- •Affective wearables
- •Experiment
- •Participants
- •Equipment and materials
- •Procedure
- •Data reduction
- •Results
- •Discussion
- •Comparison with the literature
- •Use in products
- •4 Time windows and event-related responses
- •Introduction
- •Data reduction
- •Results
- •Mapping events on signals
- •Discussion and conclusion
- •Interpreting the signals measured
- •Looking back and forth
- •5 Emotion models, environment, personality, and demographics
- •Introduction
- •Emotions
- •Modeling emotion
- •Ubiquitous signals of emotion
- •Method
- •Participants
- •International Affective Picture System (IAPS)
- •Digital Rating System (DRS)
- •Signal processing
- •Signal selection
- •Speech signal
- •Heart rate variability (HRV) extraction
- •Normalization
- •Results
- •Considerations with the analysis
- •The (dimensional) valence-arousal (VA) model
- •The six basic emotions
- •The valence-arousal (VA) model versus basic emotions
- •Discussion
- •Conclusion
- •6 Static versus dynamic stimuli
- •Introduction
- •Emotion
- •Method
- •Preparation for analysis
- •Results
- •Considerations with the analysis
- •The (dimensional) valence-arousal (VA) model
- •The six basic emotions
- •The valence-arousal (VA) model versus basic emotions
- •Static versus dynamic stimuli
- •Conclusion
- •IV. Towards affective computing
- •Introduction
- •Data set
- •Procedure
- •Preprocessing
- •Normalization
- •Baseline matrix
- •Feature selection
- •k-Nearest Neighbors (k-NN)
- •Support vector machines (SVM)
- •Multi-Layer Perceptron (MLP) neural network
- •Discussion
- •Conclusions
- •8 Two clinical case studies on bimodal health-related stress assessment
- •Introduction
- •Post-Traumatic Stress Disorder (PTSD)
- •Storytelling and reliving the past
- •Emotion detection by means of speech signal analysis
- •The Subjective Unit of Distress (SUD)
- •Design and procedure
- •Features extracted from the speech signal
- •Results
- •Results of the Stress-Provoking Story (SPS) sessions
- •Results of the Re-Living (RL) sessions
- •Overview of the features
- •Discussion
- •Stress-Provoking Stories (SPS) study
- •Re-Living (RL) study
- •Stress-Provoking Stories (SPS) versus Re-Living (RL)
- •Conclusions
- •9 Cross-validation of bimodal health-related stress assessment
- •Introduction
- •Speech signal processing
- •Outlier removal
- •Parameter selection
- •Dimensionality Reduction
- •k-Nearest Neighbors (k-NN)
- •Support vector machines (SVM)
- •Multi-Layer Perceptron (MLP) neural network
- •Results
- •Cross-validation
- •Assessment of the experimental design
- •Discussion
- •Conclusion
- •10 Guidelines for ASP
- •Introduction
- •Signal processing guidelines
- •Physical sensing characteristics
- •Temporal construction
- •Normalization
- •Context
- •Pattern recognition guidelines
- •Validation
- •Triangulation
- •Conclusion
- •11 Discussion
- •Introduction
- •Hot topics: On the value of this monograph
- •Applications: Here and now!
- •TV experience
- •Knowledge representations
- •Computer-Aided Diagnosis (CAD)
- •Visions of the future
- •Robot nannies
- •Digital Human Model
- •Conclusion
- •Bibliography
- •Summary
- •Samenvatting
- •Dankwoord
- •Curriculum Vitae
- •Publications and Patents: A selection
- •Publications
- •Patents
- •SIKS Dissertation Series
3.6 Data reduction
3.5.3 Procedure
After the subject was seated, the electrodes were attached to their chest, their fingers, and their face. Then, we checked the recording equipment and adjusted it if necessary. After a 5- minute rest period, the 16 video fragments were presented to the subject in pseudo-random order, so that positive and negative scenes were spread evenly over the session. Twelve subjects received that same pseudo-random order, though each started with a different scene in the list. The remaining 12 subjects were given the reverse pseudo-random order, again each starting with a different scene. We presented a plain blue screen for 120 seconds between two fragments, to allow the effects of the previous film fragment to fade out.
The entire viewing session lasted slightly over one hour, after which we removed the electrodes. Next, the subjects were asked to answer a few questions regarding each of the film fragments viewed. We deliberately did not ask these questions directly after each individual film fragment, since this would direct the participants’ attention to the questioned items in all subsequent viewing, which would have given the rest of the viewing session an unnatural character. In order to help them recall their feelings during the presentation of the film fragments, the participants were sequentially provided with representative print-outs of each fragment. For each film fragment, they were asked to rate, on a 7-point Likert scale, the intensity of positive feelings they had had while watching it, as well as the intensity of negative feelings, and the amount of arousal. With these three axes we expected to include the both axes of Russel’s valence-arousal model [372, 566, 647], as well as the possibility of mixed emotions [79, 92, 357, 379]. As we needed to present separate scales for positive and negative feelings in order to capture possible mixed emotions, we could not deploy the Self Assessment Mannequin (SAM) [372].
3.6 Data reduction
For each video fragment, we calculated the average positive rating as well as the average negative rating. Based on these averages, we could classify the fragments into 4 emotion classes: neutral, mixed, positive, and negative. In order to obtain an even distribution over emotion classes, we selected two fragments in each emotion category for further analysis. In each category, we chose the fragments with a duration closest to 120 seconds, so that time effects could more easily be compared (see Table 3.1). This resulted in the following set for further analysis: Color Bars and Abstract Figures (both ’neutral’, with both ratings below 2.5), The Bear and Tarzan (both ’positive’, with positive ratings above 5.0 and negative ratings below 2.0), Final Destination and Lion King (both ’mixed’, with both positive and negative ratings above 3.0), and Cry Freedom and Pink Flamingoes (both ’negative’, with negative ratings above 5.0 and positive ratings below 2.0).
49
3 Statistical moments as signal features
Not all biosignals data were fit for analysis: the EMG signals of 2 subjects were corrupted, probably due to loose contacts, and we decided not to include these data sets in further analyses. Moreover, for the same reason, the recordings of one subject, during the film scene of the “Pink flamingos”, were skipped. For the remaining 22 subjects, we processed the 4 biosignals to obtain the following measures: mean, absolute deviation, standard deviation, variance, skewness, and kurtosis.
Mean, absolute deviation and standard deviation are well-known dimensional quantities with the same units as the measured signal. Variance is also a frequently used parameter. The skewness and kurtosis, however, are expressed as non-dimensional quantities; see [197] for their introduction. [318] provide a comprehensive overview and a comparison of the skewness and kurtosis measures for both normal and non-normal distributed samples. In this overview, they state that it is suggested that “skewness and kurtosis should be viewed as ’vague concepts’, which can be formalized in many ways. Accordingly, many different definitions have been proposed.” For this research, we adopted the following descriptions: Skewness characterizes the degree of asymmetry of a distribution around its mean and kurtosis characterizes the relative peakedness and tail weight of a distribution.
Following the literature [81, 318, 534, 710], we define skewness and kurtosis for samples {x1, x2, . . . , xN } as:
1 2 |
N |
) = |
N j=1 |
|
|
|
3 |
(3.1) |
|
σ |
|||||||||
Skewness(x , x , . . . , x |
|
1 |
N |
xj − x |
|
||||
|
|
|
|
X |
|
|
|
|
|
and
|
|
|
1 N |
|
xj − |
|
4 |
|
|
|
|
Kurtosis(x , x , . . . , x |
|
) = |
|
x |
|
3 |
|
||||
N |
|
|
|
|
− |
(3.2) |
|||||
1 2 |
|
N j=1 |
σ |
|
|||||||
|
|
|
|
X |
|
|
|
|
|
|
|
with σ being the standard deviation and x being the mean of the data set. For a normal distribution, the third and fourth central moments are respectively 0 and 3 [197, 318]. Since our objective was to describe both skewness and kurtosis relative to that of a normal distribution, a correction of −3 was applied for kurtosis, as is often done.
3.7 Results
For each of the six statistical parameters, for each film fragment, and for each subject, the complete EDA and EMG signals were processed over the last 120 seconds of the film fragment. The duration of 120 seconds was chosen because it was available for the majority of the scenes. Two film fragments were shorter than that, and for them we included measurements taken during the blue screen following it in order to add up to a section of 120 seconds
50
