D 2023

Using Kubernetes in Academic Environment : Problems and Approaches

SPIŠAKOVÁ, Viktória, Dalibor KLUSÁČEK and Lukáš HEJTMÁNEK

Basic information

Original name

Using Kubernetes in Academic Environment : Problems and Approaches

Authors

SPIŠAKOVÁ, Viktória (703 Slovakia, belonging to the institution), Dalibor KLUSÁČEK (203 Czech Republic) and Lukáš HEJTMÁNEK (203 Czech Republic, belonging to the institution)

Edition

1. vyd. Cham (Switzerland), Job Scheduling Strategies for Parallel Processing, p. 235-253, 19 pp. 2023

Publisher

Springer

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10200 1.2 Computer and information sciences

Country of publisher

Switzerland

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

printed version "print"

Impact factor

Impact factor: 0.402 in 2005

RIV identification code

RIV/00216224:14610/23:00130019

Organization unit

Institute of Computer Science

ISBN

978-3-031-22697-7

ISSN

UT WoS

000972597400013

Keywords in English

cloud;HPC;scheduling;Kubernetes;resource management

Tags

Tags

International impact, Reviewed
Změněno: 8/4/2024 09:20, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

In this work, we discuss our experience when utilizing the Kubernetes orchestrator (K8s) to efficiently allocate resources in a heterogeneous and dynamic academic environment. In the commercial world, the "pay per use" model is a strong regulating factor for efficient resource usage. In the academic environment, resources are usually provided "for free" to the end-users, thus they often lack a clear motivation to plan their use efficiently. In this paper, we show three major sources of inefficiencies. One is the users' requirement to have interactive computing environments, where the users need resources for their application as soon as possible. Users do not appreciate waiting for interactive environments, but constantly keeping some resources available for interactive tasks is inefficient. The second phenomenon is observable in both interactive and batch workloads; users tend to overestimate necessary limits for their computations, thus wasting resources. Finally, Kubernetes does not support fair-sharing functionality (dynamic user priorities) which hampers the efforts when developing a fair scheme for Pod/job scheduling and/or eviction. We discuss various approaches to deal with these problems such as scavenger jobs, placeholder jobs, Kubernetes-specific resource allocation policies, separate clusters, priority classes, and novel hybrid cloud approach. We also show that all these proposals open interesting scheduling-related questions that are hard to answer with existing Kubernetes tools and policies. Last but not least, we provide a real workload trace from our installation to the scheduling community which captures these phenomena.

Links

LM2018140, research and development project
Name: e-Infrastruktura CZ (Acronym: e-INFRA CZ)
Investor: Ministry of Education, Youth and Sports of the CR