Skip to content

Research Ideas

Open questions and unexplored directions using OGAL's 4.6M experiments.


10 Research Directions

Idea What OGAL Enables Key Files
1. Stopping point analysis Per-cycle metrics reveal when AL should stop _TS/weighted_f1-score.parquet
2. Strategy recommendation (meta-learning) Predict best strategy from dataset features _/<CATEGORIZER>/<DATASET>.npz
3. Sample selection patterns Which samples do strategies pick? When does it matter? selected_indices.csv.xz
4. Dataset difficulty taxonomy Cluster datasets by AL behavior plots/final_leaderboard/*.parquet
5. Hyperparameter sensitivity Which hyperparameters matter? Interactions? _TS/*.parquet + groupby
6. Failure mode discovery When/why do strategies fail? 05_failed_workloads.csv
7. Budget allocation Variable batch sizes during AL Per-cycle metrics
8. Early stopping prediction Can first 5 iterations predict final performance? Per-cycle CSVs
9. Learner-strategy interactions Do strategies behave differently with RF vs MLP vs SVM? Time series grouped by model
10. Cross-dataset generalization Which strategies transfer best? 92 datasets × 28 strategies

Quick Start Template

import pandas as pd

RESULTS_DIR = "/path/to/results"  # Should match OUTPUT_PATH in .server_access_credentials.cfg

# Load experiments
done = pd.read_csv(f"{RESULTS_DIR}/full_exp_jan/05_done_workload.csv")

# Load time series
ts = pd.read_parquet(f"{RESULTS_DIR}/full_exp_jan/_TS/full_auc_weighted_f1-score.parquet")

# Load leaderboard
lb = pd.read_parquet(f"{RESULTS_DIR}/full_exp_jan/plots/final_leaderboard/rank_sparse_zero_full_auc_weighted_f1-score.parquet")

# Your analysis here...

Citation

@misc{gonsior2025surveyactivelearninghyperparameters,
  title={{Survey of Active Learning Hyperparameters: Insights from a Large-Scale Experimental Grid}},
  author={Julius Gonsior and Tim Rie{\ss} and Anja Reusch and Claudio Hartmann and Maik Thiele and Wolfgang Lehner},
  year={2025},
  eprint={2506.03817},
  archivePrefix={arXiv},
  primaryClass={cs.LG},
  url={https://arxiv.org/abs/2506.03817}
}

Dataset DOI: 10.25532/OPARA-862


Next Steps

Goal Page
Load the data Analyze OPARA
Run your own experiments Extend the Benchmark
Full pipeline & reproduction Reproduce & Run