# aaq **Repository Path**: mirrors_microsoft/aaq ## Basic Information - **Project Name**: aaq - **Description**: Code and Data artifact for Affective Air Quality: A Dataset of Gas Sensor Data during Emotional Experiences - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-09-25 - **Last Updated**: 2025-09-27 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Affective Air Quality (AAQ) This repository hosts the official code and data artifact for the "Affective Air Quality" dataset. Details about the user study and data collection can be found in our [paper][paper-link]. The dataset released it's the first to include gas sensing data emitted by humans while experiencing different emotions using a wearable device. More specifically, the detection of volatile organic compounds, nitrogen dioxide, carbon monoxide and alcohol while watching 12 emotional videos (3 per valance-arousal quadrant). It contains anonymized user data from a psychophysiological experiment focused on understanding how video-elicited emotions impact chemical breath composition (and exuded body gases in general). All data collected is fully anonymized. ### Ethics The study and prototype was approved by the Certified Review Board and Compliance at Microsoft. No changes were made to the study protocol after approval. ## Repository Contents 1. [Participant Data](#1-participant-data): intake form that includes demographics, personality, self-reported mood and sleep, as well as session notes. 2. [Interviews](#2-exit-interview): transcripts of the audio recordings for all semi-structured exit interviews. 3. [Gas Sensor Data](#3-gas-sensor-data): includes sensor readings from participants exposed to emotional videos for 33 minutes, with some experiencing a mint fresh breath strip in the last 5 minutes, for analyzing reactions via wearable and desktop gas sensors. It features raw gas sensor data, including NO2, ethyl alcohol, VOC, and CO levels. 4. [Emotional Ratings](#4-emotional-ratings): psychophysiological experiment outcomes, including participant responses to stimuli via valence, arousal, and familiarity ratings, enriched with detailed timestamps for video segments and associated gas sensor data for precise analysis, capturing both subjective ratings and objective timestamps across various study segments. ## 1. Participant Data ### Intake Survey `Qualtrics_Intake.csv` compiles responses from participants to the initial intake form, which includes inquiries on: (1) the timing and nature of their last consumed beverage and meal, (2) sleep quality via the Richards-Campbell Sleep Questionnaire, (3) personality traits through the short Big Five personality test, (4) current and general anxiety levels measured by the short State-Trait Anxiety Inventory (STAI), (5) mood states assessed by the short form of the International Positive and Negative Affect Schedule (I-PANAS-SF) and (6) emotional regulation skills via the ERQ Questionnaire. **Key Columns:** * `ID`: Participant ID Number, a unique numerical identifier for each participant. * Demographics: * `Q1`: Age. * `Q2`: Gender. * Last meal: * `Q3`: Timing and content of last caffeine or taurine intake. * `Q4`: Timing and content of last meal. * Richards-Campbell Sleep Questionnaire (RCSQ): * `Q5_1`: RCSQ question 1 - Sleep depth. * `Q6_1`: RCSQ question 2 - Time to fall asleep. * `Q7_1`: RCSQ question 3 - Time awake. * `Q8_1`: RCSQ question 4 - Woke up in night. * `Q9_1`: RCSQ question 5 - Sleep quality. * Big Five Inventory (BFI) short version: `Q11_1` - `Q11_12`. * `Q11_1` - `Q11_12`: Self-perceptions across various dimensions. * State-Trait Anxiety Inventory (STAI) short version: * `Q12_1` - `Q12_10`: Current feelings of anxiety (state). * `Q13_1` - `Q13_10`: General feelings of anxiety (trait). * Short Form International Positive and Negative Affect Schedule (I-PANAS-SF): * `Q14_1` - `Q14_10`: Present emotional state. * Emotion Regulation Questionnaire (ERQ): * `Q15_1` - `Q15_10`: Emotional regulation. ### Participant Logs `Participant_Logs.xlsx` contains a log of participants who took part in the experiment. It provides insights into each participant's session, notes, and details about the data collected. Key columns: - `Participant`: Participant ID number, a unique numerical identifier for each participant. - `Listerine`: Indicates if a Listerine strip was used in the participant's final video viewing. - `Status`: Status of the participant's session. - `Date`: Date of the session. - `Notes`: Observations or notes from the session (e.g., coughs). - `Data_Status`: Session data status. - `Sampling_Freq`: Precise sampling frequency for raw gas data. - `Breathing_Type`: Predominant breathing method/type (nose or mouth) during the session, as determined in the exit interview. ## 2. Exit Interview The `Interview Transcripts` folder includes each participant's exit interview transcript as `
_InterviewTranscript.txt` where `
` denotes the participant ID (34 in total). Due to initial synchronization issues in our recording software, data from 11 sensor sessions were lost from [gas sensor data](#3-gas-sensor-data), but their exit interviews were kept. Participant IDs: 708048, 29054, 150054, 174036, 176572, 381770, 523122, 547253, 675300, 831769, 871253. ## 3. Gas Sensor Data This folder presents timestamped sensor readings from 23 participants while watching approximately 33 minutes of different emotional videos (randomized). During the last approximately 5 minutes of the recording, half of the participants were randomly assigned to dissolve a Listerine (mint) fresh breath strip in their mouth. (See Emotional_Rating_Data for exact timestamps to segment the Gas Sensor Data.). The files feature gas sensor data from both the wearable headset and standalone desktop prototypes for each participant with the following format: `
_Gas-Data_Raw.csv` where `
` denotes the participant ID. Both prototypes are equipped with Winsen metal oxide gas sensors (GM102B, GM302B, GM502B, GM702B), with specific sensor readings identifiable by column names: ` _Gas-Data_Raw.csv`) segmentation. ` ` is the participant number and `