ICSE 2025
Sat 26 April - Sun 4 May 2025 Ottawa, Ontario, Canada
Fri 2 May 2025 11:30 - 11:45 at 212 - AI for Analysis 4 Chair(s): Maliheh Izadi, Ali Al-Kaswan, Jonathan Katzy

As Large-Scale Cloud Systems (LCS) become increasingly complex, effective anomaly detection is critical for ensuring system reliability and performance. However, there is a shortage of large-scale, real-world datasets available for benchmarking anomaly detection methods.

To address this gap, we introduce a new high-dimensional dataset from IBM Cloud, collected over 4.5 months from the IBM Cloud Console. This dataset comprises 39,365 rows and 117,448 columns of telemetry data. Additionally, we demonstrate the application of machine learning models for anomaly detection and discuss the key challenges faced in this process.

This study and the accompanying dataset provide a resource for researchers and practitioners in cloud system monitoring. It facilitates more efficient testing of anomaly detection methods in real-world data, helping to advance the development of robust solutions to maintain the health and performance of large-scale cloud infrastructures.

Fri 2 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
AI for Analysis 4Research Track / New Ideas and Emerging Results (NIER) / SE In Practice (SEIP) at 212
Chair(s): Maliheh Izadi Delft University of Technology, Ali Al-Kaswan Delft University of Technology, Netherlands, Jonathan Katzy Delft University of Technology
11:00
15m
Talk
RepairAgent: An Autonomous, LLM-Based Agent for Program RepairArtifact-FunctionalArtifact-AvailableArtifact-Reusable
Research Track
Islem BOUZENIA University of Stuttgart, Prem Devanbu University of California at Davis, Michael Pradel University of Stuttgart
Pre-print
11:15
15m
Talk
Evaluating Agent-based Program Repair at Google
SE In Practice (SEIP)
Patrick Rondon Google, Renyao Wei Google, José Pablo Cambronero Google, USA, Jürgen Cito TU Wien, Aaron Sun Google, Siddhant Sanyam Google, Michele Tufano Google, Satish Chandra Google, Inc
11:30
15m
Talk
Anomaly Detection in Large-Scale Cloud Systems: An Industry Case and DatasetArtifact-AvailableArtifact-FunctionalArtifact-Reusable
SE In Practice (SEIP)
Mohammad Saiful Islam Toronto Metropolitan University, Toronto, Canada, Mohamed Sami Rakha Toronto Metropolitan University, Toronto, Canada, William Pourmajidi Toronto Metropolitan University, Toronto, Canada, Janakan Sivaloganathan Toronto Metropolitan University, Toronto, Canada, John Steinbacher IBM, Andriy Miranskyy Toronto Metropolitan University (formerly Ryerson University)
Pre-print
11:45
15m
Talk
Crash Report Prioritization for Large-Scale Scheduled Launches
SE In Practice (SEIP)
Nimmi Rashinika Weeraddana University of Waterloo, Sarra Habchi Ubisoft Montréal, Shane McIntosh University of Waterloo
12:00
15m
Talk
LogLM: From Task-based to Instruction-based Automated Log Analysis
SE In Practice (SEIP)
Yilun Liu Huawei co. LTD, Yuhe Ji Huawei co. LTD, Shimin Tao University of Science and Technology of China; Huawei co. LTD, Minggui He Huawei co. LTD, Weibin Meng Huawei co. LTD, Shenglin Zhang Nankai University, Yongqian Sun Nankai University, Yuming Xie Huawei co. LTD, Boxing Chen Huawei Canada, Hao Yang Huawei co. LTD
Pre-print
12:15
7m
Talk
Using ML filters to help automated vulnerability repairs: when it helps and when it doesn’tSecurity
New Ideas and Emerging Results (NIER)
Maria Camporese University of Trento, Fabio Massacci University of Trento; Vrije Universiteit Amsterdam
Pre-print
OSZAR »