Underutilization in Research GPU Clusters: SE Challenges (MSR 2026 - Industry Track) - MSR 2026

Mon 13 - Tue 14 April 2026 Rio de Janeiro, Brazil

co-located with ICSE 2026

Who

Krzysztof Kaczmarski, Jakub Narębski, Piotr Przymus

Track

MSR 2026 Industry Track

Time Zone

The program is currently displayed in (GMT-03:00) Brasilia, Distrito Federal, Brazil.

Use conference time zone: (GMT-03:00) Brasilia, Distrito Federal, BrazilSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

When

Tue 14 Apr 2026 15:15 - 15:20 at Oceania V - Session 2-A: Ecosystems & Methods

Abstract

GPU clusters underpin modern deep learning, yet studies across industry and academia consistently report widespread GPU underutilization. Prior work and our own analysis indicate that inefficiency often stems from recurring patterns in code, job scripts, and runtime behaviour that users rarely detect. We argue that addressing this issue is an emerging SE and MSR challenge: it requires mining inefficiency patterns, combining static and dynamic signals for actionable feedback, validating job-submission artefacts, and developing privacy-aware datasets linking code, configuration, and runtime metrics. These directions point toward user-centered tools that can prevent underutilization before jobs reach the queue.

Krzysztof Kaczmarski

Warsaw University of Technology

Jakub Narębski

Nicolaus Copernicus University in Toruń

Poland

Piotr Przymus

Nicolaus Copernicus University in Toruń, Poland

Poland

Time Zone

The program is currently displayed in (GMT-03:00) Brasilia, Distrito Federal, Brazil.

Use conference time zone: (GMT-03:00) Brasilia, Distrito Federal, BrazilSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Session Program

Tue 14 Apr
Displayed time zone: Brasilia, Distrito Federal, Brazil change

	14:00 - 15:30	Session 2-A: Ecosystems & MethodsTechnical Papers / Industry Track / Data and Tool Showcase Track / MSR Program at Oceania V

	14:00 10m Talk		Analyzing GitHub Issues and Pull Requests in nf-core Pipelines: Insights into nf-core Pipeline Repositories Technical Papers Khairul Alam University of Saskatchewan, Banani Roy University of Saskatchewan
	14:10 10m Talk		Modeling Sampling Workflows for Code Repositories Technical Papers Romain Lefeuvre University of Rennes, Maiwenn Le Goasteller University of Rennes, Inria, CNRS, IRISA, Jessie Galasso-Carbonnel McGill University, Benoit Combemale Inria, Univ Rennes, CNRS, IRISA, Quentin Perez INSA Rennes, Houari Sahraoui DIRO, Université de Montréal Pre-print
	14:20 10m Talk		Quantifying Competitive Relationships Among Open-Source Software Projects Technical Papers Yuki Takei Japan Advanced Institute of Science and Technology, Toshiaki Aoki JAIST, Chaiyong Rakhitwetsagul Mahidol University, Thailand Pre-print
	14:30 10m Talk		Role of CI Adoption in Mobile App Success: An Empirical Study of Open-Source Android Projects Technical Papers xiaoxin zhou University of Toronto, Taher A. Ghaleb Trent University, Safwat Hassan University of Toronto Pre-print
	14:40 10m Talk		ML in a Box: Analyzing Containerization Practices in Open Source ML Projects Technical Papers Faten Jebari Grand Valley State University, Emna Ksontini University of North Carolina Wilmington, Amine Barrak Oakland University, USA, Wael Kessentini DePaul University
	14:50 10m Talk		An Empirical Study of Policy as Code: Adoption, Purpose, and Maintenance Technical Papers Ruben Opdebeeck Vrije Universiteit Brussel, Mahmoud Alfadel University of Calgary, Akond Rahman Auburn University, Yutaro Kashiwa Nara Institute of Science and Technology, João F. Ferreira Faculty of Engineering, University of Porto & INESC-ID, Raula Gaikovina Kula The University of Osaka, Coen De Roover Vrije Universiteit Brussel Pre-print
	15:00 10m Talk		Tracing Stereotypes in Pre-trained Transformers: From Biased Neurons to Fairer Models Technical Papers Gianmario Voria University of Salerno, Moses Openja Polytechnique Montreal, Foutse Khomh Polytechnique Montréal, Gemma Catolino University of Salerno, Fabio Palomba University of Salerno Pre-print
	15:10 5m Industry talk		Can Data Mining Help to Survive the Annual Compiler Upgrade? Industry Track Gunnar Kudrjavets Amazon Web Services, USA, Aditya Kumar Google, Piotr Przymus Nicolaus Copernicus University in Toruń, Poland Pre-print
	15:15 5m Talk		Underutilization in Research GPU Clusters: SE Challenges Industry Track Krzysztof Kaczmarski Warsaw University of Technology, Jakub Narębski Nicolaus Copernicus University in Toruń, Piotr Przymus Nicolaus Copernicus University in Toruń, Poland
	15:20 10m Talk		LILA: Decentralized Build Reproducibility Monitoring for the Functional Package Management Model Data and Tool Showcase Track Julien Malka LTCI, Télécom Paris, Institut Polytechnique de Paris, France, Arnout Engelen Independent