2011
Towards Peer-to-Peer Scheduling Architecture for the Czech National Grid
TÓTH, Šimon; Miroslav RUDA a Luděk MATYSKAZákladní údaje
Originální název
Towards Peer-to-Peer Scheduling Architecture for the Czech National Grid
Autoři
Vydání
Krakow, Poland, od s. 92-101, 2011
Nakladatel
Academic Computer Centre CYFRONET AGH
Další údaje
Typ výsledku
Stať ve sborníku
Utajení
není předmětem státního či obchodního tajemství
Forma vydání
tištěná verze "print"
Označené pro přenos do RIV
Ne
ISBN
978-83-61433-03-3
Příznaky
Mezinárodní význam, Recenzováno
Změněno: 9. 11. 2014 16:08, RNDr. Šimon Tóth
Anotace
Anglicky
The Czech National Grid Infrastructure MetaCentrum has been using a central scheduler infrastructure for approximately the past 10 years. This facilitated simple administration and direct support for large jobs running across several geographical sites. The knowledge of complete state allowed the scheduler to provide high quality decision making incorporating features like fairshare. On the other hand, this central setup created a single point of failure issue and also reached its scalability limits. In this paper we describe our work towards a new distributed architecture that maintains high scheduling quality while solving most of the single server issues. Our new distributed architecture provides both local autonomy and high scheduling quality. Users can still submit jobs locally even when cross-site connectivity is lost. Individual schedulers work primarily with their local server but still maintain global state, that allows them to mimic centralised scheduling features. The architecture still supports central accounting and fairshare across the entire grid. Implementation is based on the open-source Torque batch system, which replaced the previous commercial PBSPro central server installation. Torque provides a similar codebase as it has a common ancestor with PBSPro in OpenPBS. Torque therefore provides familiar interface for both users and developers.