Low Query Budget Active Learning for Classification and Regression
B. Jaster, A. Tharwat, E.M. Sheikh, M. Kohlhase, W. Schenck, in: I. Koprinska, J. Mendes-Moreira, P. Branco (Eds.), Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV, Springer Nature Switzerland, Cham, 2026, pp. 5–21.
Download (ext.)
Konferenzbeitrag
| Veröffentlicht
| Englisch
Autor*in
Herausgeber*in
Koprinska, Irena;
Mendes-Moreira, João;
Branco, Paula
Abstract
The labeling process for supervised learning is costly and time-consuming, and is often impractical to scale due to real-world constraints. Active learning (AL) addresses this challenge by strategically selecting representative and informative data points to reduce labeling efforts. This paper focuses on an AL scenario in which only a very limited number of labels can be acquired. We propose an algorithm operating in two phases: (1) an exploration phase that prioritizes representative and diverse data points using density-driven criteria, and (2) an exploitation phase that combines predictive uncertainty with density weighting to select informative samples from densely populated regions. This enhances both representativeness and informativeness. Our results demonstrate significant improvements in model quality compared to other algorithms typically employed for this scenario, across various scenarios involving imbalanced data in classification tasks and skewness in regression tasks. Through this work, we aim to provide a new algorithm for this scenario and investigate general principles for AL. While most AL studies focus on either classification or regression, our work applies the algorithms to both. Therefore, we can analyze the differences between classification and regression problems and their effects on AL strategies. Furthermore, we explore different categories of AL criteria and their effectiveness in the low-budget regime. These results also provide insight into the cold-start problem, which involves selecting an initial labeled set and is faced by many model-based AL methods.
Erscheinungsjahr
Titel des Konferenzbandes
Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV
Seite
5-21
Konferenz
ECML PKDD 2025
Konferenzort
Porto, Portugal
Konferenzdatum
2025-09-15 – 2025-09-19
ISBN
ISSN
eISSN
FH-PUB-ID
Zitieren
Jaster, Bjarne ; Tharwat, Alaa ; Sheikh, Eiram Mahera ; Kohlhase, Martin ; Schenck, Wolfram: Low Query Budget Active Learning for Classification and Regression. In: Koprinska, I. ; Mendes-Moreira, J. ; Branco, P. (Hrsg.): Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV, Communications in Computer and Information Science. Cham : Springer Nature Switzerland, 2026, S. 5–21
Jaster B, Tharwat A, Sheikh EM, Kohlhase M, Schenck W. Low Query Budget Active Learning for Classification and Regression. In: Koprinska I, Mendes-Moreira J, Branco P, eds. Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV. Communications in Computer and Information Science. Cham: Springer Nature Switzerland; 2026:5-21. doi:10.1007/978-3-032-19105-2_1
Jaster, B., Tharwat, A., Sheikh, E. M., Kohlhase, M., & Schenck, W. (2026). Low Query Budget Active Learning for Classification and Regression. In I. Koprinska, J. Mendes-Moreira, & P. Branco (Eds.), Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV (pp. 5–21). Cham: Springer Nature Switzerland. https://doi.org/10.1007/978-3-032-19105-2_1
@inproceedings{Jaster_Tharwat_Sheikh_Kohlhase_Schenck_2026, place={Cham}, series={Communications in Computer and Information Science}, title={Low Query Budget Active Learning for Classification and Regression}, DOI={10.1007/978-3-032-19105-2_1}, booktitle={Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV}, publisher={Springer Nature Switzerland}, author={Jaster, Bjarne and Tharwat, Alaa and Sheikh, Eiram Mahera and Kohlhase, Martin and Schenck, Wolfram}, editor={Koprinska, Irena and Mendes-Moreira, João and Branco, PaulaEditors}, year={2026}, pages={5–21}, collection={Communications in Computer and Information Science} }
Jaster, Bjarne, Alaa Tharwat, Eiram Mahera Sheikh, Martin Kohlhase, and Wolfram Schenck. “Low Query Budget Active Learning for Classification and Regression.” In Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV, edited by Irena Koprinska, João Mendes-Moreira, and Paula Branco, 5–21. Communications in Computer and Information Science. Cham: Springer Nature Switzerland, 2026. https://doi.org/10.1007/978-3-032-19105-2_1.
B. Jaster, A. Tharwat, E. M. Sheikh, M. Kohlhase, and W. Schenck, “Low Query Budget Active Learning for Classification and Regression,” in Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV, Porto, Portugal, 2026, pp. 5–21.
Jaster, Bjarne, et al. “Low Query Budget Active Learning for Classification and Regression.” Machine Learning and Principles and Practice of Knowledge Discovery in Databases. International Workshops of ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Revised Selected Papers, Part IV, edited by Irena Koprinska et al., Springer Nature Switzerland, 2026, pp. 5–21, doi:10.1007/978-3-032-19105-2_1.
Closed Access