MSR 2026
Mon 13 - Tue 14 April 2026 Rio de Janeiro, Brazil
co-located with ICSE 2026

In the digital era, accidental exposure of sensitive information such as API keys, tokens, and credentials is a growing security threat. While most prior work focuses on detecting secrets in source code, leakage in software issue reports remains largely unexplored. This study fills that gap through a large-scale analysis and a practical detection pipeline for exposed secrets in GitHub issues. Our pipeline combines regular expression–based extraction with large language model (LLM)–based contextual classification to detect real secrets and reduce false positives. We build a benchmark of 54,148 instances from public GitHub issues, including 5,881 manually verified true secrets. Using this dataset, we evaluate entropy-based baselines and keyword heuristics used by prior secret detection tools, classical machine learning, deep learning, and LLM-based methods. Regex and entropy based approaches achieve high recall but poor precision, while smaller models such as RoBERTa and CodeBERT greatly improve performance (F1 = 92.70%). Proprietary models like GPT-4o perform moderately in few-shot settings (F1 = 80.13%), and fine-tuned open-source larger LLMs such as Qwen and LLaMA reach up to 94.49% F1. Finally, we also validate our approach on 178 real-world GitHub repositories, achieving an F1-score of 81.6% which demonstrates our approach’s strong ability to generalize to in-the-wild scenarios.

Tue 14 Apr

Displayed time zone: Brasilia, Distrito Federal, Brazil change

11:00 - 12:30
Session 1-B: Maintenance, Evolution & ProcessesTechnical Papers / MSR Program at Oceania IV
Chair(s): Gregorio Robles Universidad Rey Juan Carlos
11:00
10m
Talk
Source Code Hotspots: A Diagnostic Method for Quality Issues
Technical Papers
Saleha Muzammil University of Virginia, Mughees Ur Rehman Virginia Tech, Zoe Kotti AUEB & DeepSea Technologies, Diomidis Spinellis AUEB & TU Delft
Pre-print
11:10
10m
Talk
The Value of Effective Pull Request Description
Technical Papers
Shirin Pirouzkhah University of Zurich, Pavlina Wurzel Goncalves University of Zurich, Alberto Bacchelli IfI, University of Zurich
Pre-print
11:20
10m
Talk
How do third-party Python libraries use type annotations?
Technical Papers
Eric Asare New York University Abu Dhabi, Sarah Nadi New York University Abu Dhabi
Pre-print
11:30
10m
Talk
Coordination at Scale in Large Distributed Development: The Case of Kubernetes
Technical Papers
Sabrina Aufiero University College London (UCL), Matteo Vaccargiu University of Cagliari, Silvia Bartolucci University College London, Fabio Caccioli University College London (UCL), Giuseppe Destefanis University College London
11:40
10m
Talk
Combining Example-Based and Rule-Based Program Transformations to Resolve Build Conflicts
Technical Papers
Sheikh Shadab Towqir Virginia Tech, Fei He Tsinghua University, Todd Mytkowicz Google, Na Meng Virginia Tech
Pre-print
11:50
10m
Talk
Mining Quantum Software Patterns in Open-Source Projects
Technical Papers
Neilson Carlos Leite Ramalho Universidade de São Paulo, Erico Augusto Da Silva Universidade de São Paulo, Higor Amario de Souza University of São Paulo, Marcos Lordello Chaim University of São Paulo
12:00
10m
Talk
Analyzing Dependency Distribution Changes Arising from Code Smell InteractionsVirtual Attendance
Technical Papers
Zushuai Zhang University of Auckland, Elliott Wen , Ewan Tempero The University of Auckland
Pre-print Media Attached File Attached
12:10
10m
Talk
Evolving Kubernetes: A Technical Debt Perspective
Technical Papers
Jesse Maarleveld University of Groningen, Giuseppe Destefanis University College London, Daniel Feitosa University of Groningen
12:20
10m
Talk
Secret Leak Detection in Software Issue Reports using LLMs: A Comprehensive Evaluation
Technical Papers
Sadif Ahmed Bangladesh University of Engineering and Techonology, Md Nafiu Rahman Bangladesh University of Engineering and Technology, Zahin Wahab The University of British Columbia, Gias Uddin York University, Canada, Rifat Shahriyar Bangladesh University of Engineering and Technology Dhaka, Bangladesh
Pre-print Media Attached File Attached