MLCS 2025 - The seventh Workshop on Machine Learning for CyberSecurity

About MLCS 2025

Short description

The last decade has been a critical one regarding cybersecurity, with studies predicting the worldwide cost of global cybercrime damage to hit 0.5 trillion dollars annually by 2025. Detecting, analysing, and defending against threats in (near) real-time conditions is impossible without employing machine learning techniques and big data infrastructures. This gives rise to cyberthreat intelligence and analytic solutions, such as (informed) machine learning on big data and open-source intelligence, to perceive, reason, learn, and act against cyber adversary techniques and actions. Moreover, organisations’ security analysts have to manage and protect systems and deal with the privacy and security of all personal and institutional data under their control. This workshop aims to provide researchers with a forum to exchange and discuss scientific contributions, open challenges and recent achievements in machine learning and their role in developing secure systems. If it will be considered, as in the previous years, we would join an LNCS proceedings volume.

Relevance to the Machine Learning Community

Cybersecurity is of the utmost importance for computing systems. The ethics guidelines for trustworthy artificial intelligence authored by the European Commission’s Independent High Level Expert Group on Artificial Intelligence on April 2019 have highlighted that machine learning-based artificial intelligence developments in various fields, including cybersecurity, are improving the quality of our lives every day, that AI systems should be resilient to attacks and security, and that they should consider security-by-design principles.

Due to the scale and complexity of current systems, it is a permanent and growing concern in industry and academia. On the one hand, the volume and diversity of functional and non-functional data, including open source information, along with increasingly dynamic operating environments, create additional obstacles to the security of systems and the privacy and security of data. On the other hand, it creates an information-rich environment that, leveraged by techniques in the crossing of modern machine learning, data science and visualisation fields, will contribute to improving systems and data security and privacy. This poses significant, industry-relevant challenges to the machine learning and cybersecurity communities, as the main problems arise in contexts of dynamic operating environments and unexpected operating conditions, motivating the demand for production-ready systems able to improve and adaptively maintain the security of computing systems as well as the security and privacy of data.

Based on the recent history, we plan to organise this workshop as a European forum for cybersecurity researchers and practitioners who wish to discuss the recent developments of machine learning for developing cybersecurity, by paying special attention to solutions rooted in adversarial learning, pattern mining, neural networks and deep learning, probabilistic inference, anomaly detection, stream learning and mining, and big data analytics.

Motivation

The last decade has been a critical one regarding cybersecurity, with studies predicting that the worldwide cost of global cybercrime damage will hit 0.5 trillion dollars annually by 2025. Cyberthreats have increased dramatically, exposing sensitive personal and business information, disrupting critical operations and imposing high economic costs. The number, frequency, and sophistication of threats will only increase and will become more targeted in nature. Furthermore, today’s computing systems operate under increasing scales and dynamic environments, ingesting and generating more and more functional and non-functional data. Detecting, analysing, and defending against threats in (near) real-time conditions is impossible without employing machine learning techniques and big data infrastructure. This gives rise to cyber threat intelligence and analytic solutions, such as (informed) machine learning on big data and open-source intelligence, to perceive, reason, learn, and act against cyber adversary techniques and actions. Moreover, organisations’ security analysts must manage and protect these systems and deal with the privacy and security of all personal and institutional data under their control. This calls for tools and solutions combining the latest advances in data science, visualisation, and machine learning. We strongly believe that the significant advance of state-of-the-art machine learning over the last few years has not been fully exploited to harness the potential of available data for the benefit of systems, data security, and privacy. While machine learning algorithms have already proven beneficial for the cybersecurity industry, they have also highlighted several shortcomings that impact their reliability and safety. As proved multiple times in the literature, machine learning models are vulnerable to adversarial machine learning attacks, which cause target algorithms to misbehave, provide unethical answers to users’ prompts, or reveal sensitive information about their inner workings. As machine learning-based capabilities become incorporated more frequently into cyber assets, the urgency of understanding these threats' impact and mitigations rises exponentially. On the other hand, the community must not ignore that it is still challenging to mine, interpret and extract knowledge from security-related data harvested from multiple sources and encoded in different formats. Hence, the need for learning techniques that can provide accurate predictions while withstanding adversarial attacks is of paramount importance. On the other side, although the priority of machine learning methods today is to perform accurate detection strengthening their robustness to adversarial attacks, explainability of security systems has recently emerged as a very active research field. Explaining the effect of certain data features on security decisions can contribute to allowing security systems to benefit better from the trust of security stakeholders.

Goals

The workshop follows the success of the four previous editions( MLCS 2019, MLCS 2020, MLCS 2021, MLCS 2022, MLCS 2023, and MLCS 2024) co-located with ECML-PKDD. In all the previous editions, the workshop gained strong interest, with attendance of between 20 and 30 participants, lively post-presentation discussions, amazing invited talks in all the editions and a vibrant panel discussion in the 2019 and 2021 editions. We strive to provide researchers with a forum to exchange and discuss scientific contributions and open challenges, both theoretical and practical, related to using machine-learning approaches in cybersecurity. We want to foster joint work and knowledge exchange in the cybersecurity community by increasing the connection between researchers and practitioners from not only the machine learning area but also experts from the big data, data science, and visualisation areas. The workshop shall provide a forum for discussing novel trends and achievements in machine learning and their role in developing secure systems. We want to highlight the latest research trends in machine learning, privacy of data, big data, deep learning, incremental and stream learning, and adversarial machine learning. In particular, the goal of this workshop is to promote the application of these emerging machine learning techniques to cybersecurity by measuring how they improve the field with their predictive capabilities.

The workshop shall contribute to identifying new application areas as well as open and future research problems related to the application of machine learning in the cybersecurity field.

Call for papers

MLCS welcomes both research papers reporting results from mature work, recently published work, as well as more speculative papers describing new ideas or preliminary exploratory work. Papers reporting industry experiences and case studies will also be encouraged. However, it should be noticed that papers based on recently published work will not be considered for publication in the proceedings.

Workshop Topics

All topics related to the contribution of machine learning approaches to the security of organisations’ systems and data are welcome. These include, but are not limited to:

Machine learning for:

the security and dependability of networks, systems, and software
open-source threat intelligence and cybersecurity situational awareness
data security and privacy
cybersecurity forensic analysis
the development of smarter security control
the fight against (cyber)crime, e.g., biometrics, audio/image/video analytics
vulnerability analysis
the analysis of distributed ledgers
malware, anomaly, intrusion and spam detection

Adversarial machine learning and the robustness of AI models against malicious actions
Interpretability and Explainability of machine learning models in cybersecurity
Privacy preserving machine learning
Trusted machine learning
Data-centric security
Scalable / big data approaches for cybersecurity
Deep learning for automated recognition of novel threats
Graph representation learning in cybersecurity
Continuous and one-shot learning
Informed machine learning for cybersecurity
User and entity behavior modeling and analysis

Paper submission

MLCS welcomes both research papers reporting results from mature work, recently published work, as well as more speculative papers describing new ideas or preliminary exploratory work. Papers reporting industry experiences and case studies will also be encouraged. However, it should be noted that papers based on recently published work will not be considered for publication in the proceedings.

Submissions are accepted in two formats:

Regular research papers with 12 to 16 pages including references. To be published in the proceedings, research papers must be original, not published previously, and not submitted concurrently elsewhere.
Short research statements of at most 6 pages. Research statements aim at fostering discussion and collaboration. They may review research published previously or outline new emerging ideas.

All submissions should be made in PDF using the Microsoft CMT and must adhere to the Springer LNCS style.

Based on the quality and number of accepted regular papers, regular workshop papers (except papers reporting recently published work or preliminary work) will be “tentatively” published in the workshop post-proceedings.

At least one author of each accepted paper must have a full registration and be in Vilnius to present the paper . Papers without a full registration or in-presence presentation won't be included in the post-workshop Springer proceedings.

To submit your paper, kindly refer to the instructions provided on the Microsoft Conference Management Tool (CMT) platform. You can access these instructions by visiting the following link: https://cmt3.research.microsoft.com/docs/help/author/author- submission-form.html. Once on the platform, utilize the filter option to search for the "European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - Workshop and Tutorial Track". Then, select "Machine Learning for Cybersecurity (MLCS 2025)" under the "Create new submission" tab. Alternatively, you may use this link to access the submission page directly.

Organizing Committee

Web and publicity chairs

Nuno Dionísio

Universidade de Lisboa
Faculdade de Ciências
LASIGE
Portugal

Samaneh Shafee

Universidade de Lisboa
Faculdade de Ciências
LASIGE
Portugal

Program Committee

Alan Oliveira, University of Lisbon, Portugal

Altair Santin, Pontifical Catholic University of Parana, Brazil

Al-Essa Malik , Al-Ahliyya Amman University, Jordan

Annalisa Appice, University of Bari Aldo Moro, Italy

Antonio Pecchia, University of Sannio, Italy

Bruno Zarpelao, State University of Londrina, Brazil

Francesco Mercaldo, University of Molise, Italy

Gianluigi Folino, ICAR-CNR, Italy

Giorgio Piras, University of Cagliari, Italy

Giovanni Apruzzese , University of Liechtenstein, Liechtenstein

Leonardo Aniello, University of Southampton, United Kingdom

Maura Pintor , University of Cagliari, Italy

Nuno Neves, University of Lisbon, Portugal

Tommaso Zoppi, University of Florence, Italy

Vera Rimmer, Katholieke Universiteit Leuven, Belgium

Program

Keynote speaker

Maura Pintor

University of Cagliari
Italy

Maura Pintor is an Assistant Professor at the PRA Lab, in the Department of Electrical and Electronic Engineering of the University of Cagliari, Italy. She received her PhD in Electronic and Computer Engineering from the University of Cagliari in 2022. Her research interests mostly focus on optimizing and debugging adversarial robustness evaluations

She was a visiting student at the University of Tuebingen, Germany, from March to June 2020 and at the Software Competence Center Hagenberg (SCCH), Austria, from May to August 2021, and at the Universitat Autònoma de Barcelona (UAB), in the Computer Vision Center (CVC), from July to October 2024.

She is area chair for NeurIPS, Associate Editor for Pattern Recognition, and reviewer for ACM CCS, ECCV, ICLR, ICCV, and for the journals IEEE TIFS, IEEE TIP, IEEE TDSC, IEEE TNNLS, TOPS. She is co-chair of the ACM Workshop on Artificial Intelligence and Security (AISec), co-located with ACM CCS.

MLCS 2025 programme

15/09/2025 14:00 - 17:30

14:00	Opening remarks: Welcome to MLCS2025 !
14:10	Keynote speech Session chair: Luca Demetrio Reliable Evaluation and Benchmarking of Machine Learning Models for Real-World Deployments Maura Pintor (University of Cagliari, Italy) Abstract: Rigorous evaluation of machine learning (ML) models is essential before deployment. To understand ML’s sensitivity to attacks and real-world challenges, ML model designers craft worst-case perturbations and test them against their products. However, many of the proposed defenses have been shown to provide a false sense of security due to failures of the attacks rather than actual robustness. To this end, it’s important to set up trustworthy evaluation tools. In this talk, we will investigate existing benchmarking tools and we will highlight their issues, avoiding known mistakes to ensure high-quality evaluations. Moreover, current ML benchmarks are a first step, but they only offer an in-vitro evaluation. Addressing practical aspects like how predictions react to data drift over time and model updates is also important in real-world applications. For this reason, we will provide insights on analyzing how both performance and robustness evolve over time. Finally, we will discuss new testing and benchmarking guidelines to develop novel techniques to ensure models behave robustly in real-world scenarios, where not only are they the target of attacks, but they are also subject to data drifts and situations unseen in training.
	Paper session: Adversarial Learning Session chair: Giuseppina Andresini
15:00	Towards an Adversarial Model for Fraud Detection Systems Chloé Przemyski, Geoffray Bonnin, Julien Polge, Armelle Brun
15:15	An Object-Level Entropy-Based Adversarial Attack for Image Privacy Wasaif Alsolami, Raul Santos-Rodriguez, Zahraa S Abdallah, James Pop
	Paper session: Intrusion Detection Session chair: Giuseppina Andresini
15:30	From One to Many: Few-Shot Deep Ensembles for Slow DoS Attack Detection Alberto Falcone, Massimo Guarascio,Angelica Liguori, Francesco Sergio Pisani, Francesco Scala
15:45	RAMPART-FL: Federated Learning for Intrusion Detection in the Edge through Reinforcement-based Multi-Criteria Participant Selection Lucas Sousa, Daniel Castro Silva, Sinan Wannous, Isabel Praça
16:00	Coffee break
16:30	Paper session: LLMs in Cybersecurity Session chair: Pedro Ferreira
16:30	Small but Dangerous: Evaluating and Mitigating Jailbreak Vulnerabilities in Small Language Models Leonardo Piano, Claudia Battistin, Jeriek Van den Abeele, Livio Pompianu
16:45	AnomalyExplainerBot: Explainable AI for LLM-based anomaly detection using BERTViz & Captum Prasasthy Balasubramanian, Dumindu Kankanamge, Ekaterina Gilman, Mourad Oussala
17:00	The Bitter Lesson Might Also Apply to Misuse Detection Robustness Hadrien Mariaccia, Charbel-Raphaël Segerie
	Paper session: Cyber threat detection Session chair: Pedro Ferreira
17:15	ATTAXML: Behaviour-Based Prediction of MITRE ATT&CK Techniques in Ransomware with Extreme Multi-Label Learning Faithful Onwuegbuche, Anca Delia Jurcut, Liliana Pasquale
17:30	Conclusions : Giuseppina Andresini, Luca Demetrio, Pedro Ferreira

About MLCS 2025

Short description

Relevance to the Machine Learning Community

Motivation

Goals

Call for papers

Workshop Topics

Paper submission

Important dates

Regular and research statement papers

June 14

23th Jun

14th July

25th July

Organizing Committee

Giuseppina Andresini

Luca Demetrio

Pedro M. Ferreira