The QCG-PilotJob system is designed to schedule and execute many small jobs inside one scheduling system allocation. Direct submission of a large group of jobs to a scheduling system can result in long aggregated time to finish as every single job is scheduled independently and waits in a queue. On the other hand, the submission of a group of jobs can be restricted or even forbidden by administrative policies defined on clusters. One can argue that there are available job array mechanisms in many systems, however, the traditional job array mechanism allows to run only a bunch of jobs having the same resource requirements while jobs being parts of larger workflows by nature vary in requirements and therefore need more flexible solutions.
- B. Bosak, T. Piontek, P. Karlshoefer, E. Raffin, J. Lakhlili, P. Kopta, “Verification, Validation and Uncertainty Quantification of Large-Scale Applications with QCG-PilotJob”, ICCS 2021, Lecture Notes in Computer Science, 12746 (2021), DOI: 10.1007/978-3-030-77977-1_39
- D. Groen, H. Arabnejad, V. Jancauskas, W. N. Edeling, F. Jansson, R. A. Richardson, J. Lakhlili, L. Veen, B. Bosak, P. Kopta, D. W. Wright, N. Monnier, P. Karlshoefer, D. Suleimenova, R. Sinclair, M. Vassaux, A. Nikishova, M. Bieniek, Onnie O. Luk, M. Kulczewski, E. Raffin, D. Crommelin, O. Hoenen, D. P. Coster, T. Piontek and P. V. Coveney, “VECMAtk: a scalable verification, validation and uncertainty quantification toolkit for scientific simulations”, Phil. Trans. R. Soc. A. 379, 20200221 (2021), DOI:10.1098/rsta.2020.0221
- D. W. Wright, R. A. Richardson, W. Edeling, J. Lakhlili, R. C. Sinclair, V. Jancauskas, D. Suleimenova, B. Bosak, M. Kulczewski, T. Piontek, P. Kopta, I. Chirca, H. Arabnejad, O. O. Luk, O. Hoenen, J. Weglarz, D. Crommelin, D. Groen, and P. V. Coveney, “Building confidence in simulation: Applications of EasyVVUQ”, Advanced Theory and Simulations, 1900246 (2020) DOI:10.1002/adts.201900246