CogPilot Data Challenge 2.0 Description

Mission
Background

Figure 1. Data collection via physiological measurements recorded during immersive VR flight training simulations.
Figure 2. Flight simulations were designed to span four levels of difficulty. Shown here are screenshots from the easiest (left) and most difficult (right) levels.

Figure 3. Measures of Flight Performance Error shown here were summed and aggregated over each flight simulation run to obtain Cumulative Flight Performance.
CogPilot Data Challenge 2.0 Tasks
  • Task 1: Classify the difficulty level (1-4) of a flight simulation run using only physiological metrics. This task will be evaluated using F1 score and Area Under the ROC Curve (AUC) to assess classification accuracy between predicted and actual difficulty level.
  • Task 2: Estimate the pilot’s Cumulative Flight Performance during a flight simulation run using only physiological metrics. This task will be evaluated using a combination of Root Mean Squared Error, Pearson Correlation, and Spearman Correlation to assess accuracy of estimated Cumulative Flight Performance values relative to actual values.
Process
Adjudication
TaskEvaluation MetricRationale
Task 1: Classify Difficulty LevelArea Under the ROC Curve (AUC
  • Capture tradeoffs between True Positive & False Positive Rates

  • Identify optimal decision threshold for class prediction scores
F1 Score = 2 (precision * recall) / (precision + recall)
  • Capture precision and recall in one score

  • Represent decision threshold of the actual submitted model
Root Mean Squared Error (RMSE)
  • Measure absolute differences of predicted vs. truth
Task 2: Predict Performance ErrorPearson Correlation
  • Capture strength of relationship between predicted vs. truth
Spearman Correlation
  • Assess robustness to outlier influence