New Publications | Number |
---|---|
Monographs and Edited Volumes | 0 |
PhD Theses | 0 |
Journal Articles | 5 |
Book Chapters | 0 |
Conference Publications | 5 |
Technical Reports | 0 |
White Papers | 0 |
Magazine Articles | 0 |
Working Papers | 0 |
Datasets | 0 |
Total New Publications | 10 |
Projects | |
New Projects | 1 |
Ongoing Projects | 1 |
Completed Projects | 0 |
Members | |
Faculty Members | 5 |
Senior Researchers | 7 |
Associate Researchers | 8 |
Researchers | 25 |
Total Members | 45 |
New Members | 7 |
PhDs | |
Ongoing PhDs | 5 |
Completed PhDs | 0 |
New Seminars | |
New Seminars | 10 |
Date: 09 January 2023
Presenter: Zoe Kotti
Abstract
Existing work on the practical impact of software engineering (SE) research examines industrial relevance rather than adoption of study results, hence the question of how results have been practically applied remains open. To answer this and investigate the outcomes of impactful research, we performed a quantitative and qualitative analysis of 4,354 SE patents citing 1,690 SE papers published in four leading SE venues between 1975-2017. Moreover, we conducted a survey on 475 authors of 593 top-cited and awarded publications, achieving 26% response rate. Overall, researchers have equipped practitioners with various tools, processes, and methods, and improved many existing products. SE practice values knowledge-seeking research and is impacted by diverse cross-disciplinary SE areas. Practitioner-oriented publication venues appear more impactful than researcher-oriented ones, while industry-related tracks in conferences could enhance their impact. Some research works did not reach a wide footprint due to limited funding resources or unfavorable cost-benefit trade-off of the proposed solutions. The need for higher SE research funding could be corroborated through a dedicated empirical study. In general, the assessment of impact is subject to its definition. Therefore, academia and industry could jointly agree on a formal description to set a common ground for subsequent research on the topic.
Date: 01 March 2023
Presenter: Georgia Kapitsaki
Abstract
Existing work on the practical impact of software engineering (SE) research examines industrial relevance rather than adoption of study results, hence the question of how results have been practically applied remains open. To answer this and investigate the outcomes of impactful research, we performed a quantitative and qualitative analysis of 4,354 SE patents citing 1,690 SE papers published in four leading SE venues between 1975-2017. Moreover, we conducted a survey on 475 authors of 593 top-cited and awarded publications, achieving 26% response rate. Overall, researchers have equipped practitioners with various tools, processes, and methods, and improved many existing products. SE practice values knowledge-seeking research and is impacted by diverse cross-disciplinary SE areas. Practitioner-oriented publication venues appear more impactful than researcher-oriented ones, while industry-related tracks in conferences could enhance their impact. Some research works did not reach a wide footprint due to limited funding resources or unfavorable cost-benefit trade-off of the proposed solutions. The need for higher SE research funding could be corroborated through a dedicated empirical study. In general, the assessment of impact is subject to its definition. Therefore, academia and industry could jointly agree on a formal description to set a common ground for subsequent research on the topic.
Georgia (Zeta) M. Kapitsaki is an Associate Professor at the Department of Computer Science, UCY. She received her Ph.D. degree in Electrical and Computer Engineering from the School of Electrical and Computer Engineering of National Technical University of Athens (NTUA), her M.Sc. in Technoeconomical Systems and her Diploma in Electrical and Computer Engineering. She has worked as a software engineer in Germany, as a research associate at NTUA and as a laboratory assistant at the Technical Institute of Piraeus. She has been the principal investigator of national and EU funded research projects (e.g. SocioCoast, CYberSafetyIII) and has worked on EU projects (e.g. PaaSage, VALS). She has been involved in the organization of international conferences (e.g. ICSME 2022, SAC 2019). She has served as member of the program committee of international conferences (e.g. WISE 2022, ICWS 2022, ENASE 2022). She is a member of the editorial board of ERCIM News. She has published over 60 papers in peer-reviewed journals, and scientific conferences and workshops. Her research interests include Software Engineering, Open source software and reuse, Privacy Enhancing Technologies and Context-aware Applications. She is a faculty member of the Software Engineering and Internet Technologies Laboratory (SEIT).
Date: 29 May 2023
Presenter: Georgios Liargkovas
Abstract
The research and academic environment in the United States is a topic of great interest to students and researchers worldwide. In this presentation, as an exchange student who recently returned from a program at Brown University, I will share my experience and provide (I think valuable) insights regarding the research environment, life in the US, and significant lessons I learned during this period.
The presentation will begin with an introduction, covering the basic details of the program. Details about the US research environment, student life, challenges faced, and lessons learned will be then discussed. Finally, a brief reference will be made to the recently-accepted paper at HotOS '23 "Executing Shell Scripts in the Wrong Order, Correctly", which will be presented in June.
Date: 20 September 2023
Presenter: Charalambos Mitropoulos
Abstract
We introduce Fuzzol, the first syntax-aware mutation fuzzer for systematically testing the security and reliability of solc, the standard Solidity compiler. Fuzzol addresses a challenge of existing fuzzers when dealing with structured inputs: the generation of inputs that get past the parser checks of the system under test. To do so, Fuzzol introduces a novel syntax-aware mutation that breaks into three strategies, each of them making different kind of changes in the inputs. Moreover, to explore new paths in the compiler’s codebase faster, we introduce a mutation strategy prioritization algorithm that allows Fuzzol to identify and apply only those mutation strategies that are most effective in exercising new inter- esting paths. To evaluate Fuzzol, we test 33 of the latest solc stable re- leases, and compare fuzzol with (1) Superion, a grammar-aware fuzzer, (2) AFL-compiler-fuzzer, a text-mutation fuzzer and (3) two grammar-blind fuzzers with advanced test input generation schedules: AFLFast and MOpt-AFL. fuzzol identified 19 bugs in total (7 of which were previously unknown to Solidity developers), while the other fuzzers missed half of these bugs.
This work has been accepted at ESORICS '23 and will be presented in September 2023.
Date: 04 October 2023
Presenter: Apostolos Glenis
Abstract
Time-Series Classification (TSC) is an important problem in many fields across sciences. Many algorithms for TSC use symbolic representation to combat noise. In this paper we propose a framework, namely SCALE-BOSS, to build TSC algorithms that exploit time-series models based on symbolic representations. While alternative symbolic representations can be incorporated, we have opted to use the Bag-Of-SFA (BOSS) approach, and thus SFA, as a state-of-the-art symbolic time series representation. We investigate the efficiency of several instantiations of this framework based on two main variations, where the TSC model is built either by a time-series classification or by a clustering algorithm. The objective is to advance the computational efficiency of TSC classification algorithms without sacrificing their accuracy. We evaluate the instantiations of the SCALE-BOSS framework on those datasets in the UCR time-series repository that include the largest training sets. Comparisons with state of the art methods on TSC show the balance between computational efficiency and accuracy on predictions achieved.
Date: 16 October 2023
Presenter: Tushar Sharma, Dalhousie University
Abstract
Machine Learning (ML) has emerged as a transformative force across various domains, and Software Engineering is no exception. In this talk, we will explore how ML is currently being applied in software engineering, delve into the exciting opportunities it offers, and discuss the key challenges that must be overcome to unlock its full potential. Using a recently completed survey paper as a reference, Tushar will discuss the fine-grained tasks, tools, and techniques for applying ML in various software engineering tasks. Tushar will provide an overview of the recent research studies from his lab in this direction. The talk will offer an opportunity to discuss the rapidly changing software engineering landscape due to recent disruptive advancements in ML.
Tushar Sharma is an assistant professor at Dalhousie University, Canada. The topics related to software design and architecture, refactoring, software code quality, technical debt, and machine learning for software engineering (ML4SE) define his career interests. He earned a PhD from Athens University of Economics and Business, Athens, Greece, specializing in software engineering in May 2019. Earlier, he obtained an MS in Computer Science from the Indian Institute of Technology-Madras, Chennai, India. His professional experience includes working with Siemens Research (formally, Siemens Technology), Charlotte, USA for approximately two years (2019-2021) as well as Siemens Corporate Technology, Bangalore, India for more than seven years (2008-2015). He was the principal investigator for MINDSIGHT team of DARPA AMP program consisting of researchers from Siemens, JHU/APL, BAE systems, and UC Irvine. He co-authored Refactoring for Software Design Smells: Managing Technical Debt and two Oracle Java certification books. He founded and developed Designite which is a software design quality assessment tool used by many practitioners and researchers worldwide. He is an IEEE Senior Member.
Date: 01 November 2023
Presenter: Panos Louridas
Abstract
Sequential recommendation problems have received increasing attention in research during the past few years, leading to the inception of a large variety of algorithmic approaches. In this work, we explore how large language models (LLMs), which are nowadays introducing disruptive effects in many AI-based applications, can be used to build or improve sequential recommendation approaches. Specifically, we devise and evaluate three approaches to leverage the power of LLMs in different ways. Our results from experiments on two datasets show that initializing the state-of-the-art sequential recommendation model BERT4Rec with embeddings obtained from an LLM improves NDCG by 15-20% compared to the vanilla BERT4Rec model. Furthermore, we find that a simple approach that leverages LLM embeddings for producing recommendations, can provide competitive performance by highlighting semantically related items. We publicly share the code and data of our experiments to ensure reproducibility.
Date: 13 November 2023
Presenter: Diomidis Spinellis
Abstract
Developers, system administrators, and data scientists often struggle with the powerful yet often cryptic command-line interfaces. The solution? ai-cli-lib, an open-source library that converts natural language prompts into executable commands for diverse command-line tools. Its operation is based on dynamic linking, configurable AI API interfaces, and dynamic prompt engineering. The talk introduces ai-cli-lib as an AI-based productivity booster for software developers, presents an overview of building an AI-enabled software product, and discusses the use of AI in software development informed through the 261 ChatGPT interactions that aided ai-cli-lib's development.
Date: 13 December 2023
Presenter: Ilias Balampanis
Abstract
Intrusion Detection Systems (IDS) based on Machine Learning (ML) techniques are essential for cybersecurity, utilizing datasets to detect and mitigate malicious network or system activities. The efficacy of IDSs is contingent on datasets that are both extensive and representative of actual cyber threats, ensuring accurate and robust system performance evaluation. This paper reports on a systematic literature review (SLR) that examines the landscape of datasets for IDS training and evaluation. The SLR explores the traits, creation methods, and constraints of these datasets, alongside the identification of challenges in their generation and utility. Highlighted is the variance in dataset quality, the gradual pace of development, and the ongoing deficit of datasets in the intrusion detection domain. With the continuous evolution of cyber threats, a persistent reassessment of these datasets is imperative for maintaining their pertinence and efficacy. Our SLR aggregates and dissects the extant research to elucidate the strengths and weaknesses of current datasets, determining their aptness for varied IDS contexts. The objective is to delineate the present state of IDS datasets, suggest measures for their enhancement, and propose future research directions. We emphasize the need for standardized, comprehensive, and ethically sound datasets that reflect the evolving threat landscape. Future research should focus on data augmentation to cover a broader spectrum of attack scenarios, improving the robustness of IDS against diverse threats. Additionally, fostering open-source datasets and cross-sector collaboration is crucial for integrating practical, real-world cybersecurity challenges into academic research, thereby democratizing access and promoting innovation in IDS development.
Date: 15 December 2023
Presenter: D. Karlis, G. Vouros, and G. Giannakopoulos
Abstract