Skip to main navigation Skip to search Skip to main content

LLM-Guided Genetic Improvement: Envisioning Semantic Aware Automated Software Evolution (arXiv version)

  • Karine Even-Mendoza
  • , Alexander Brownlee
  • , Alina Geiger
  • , Carol Hanna
  • , Justyna Petke
  • , Federica Sarro
  • , Dominik Sobania
  • University of Stirling
  • Johannes Gutenberg University Mainz
  • Johannes Gutenberg-Universität Mainz
  • University College London, UK.
  • UCL University College London

Research output: Working paper/PreprintPreprint

Abstract

Genetic Improvement (GI) of software automatically creates alternative software versions that are improved according to certain properties of interests (e.g., running-time). Search-based GI excels at navigating large program spaces, but operates primarily at the syntactic level. In contrast, Large Language Models (LLMs) offer semantic-aware edits, yet lack goal-directed feedback and control (which is instead a strength of GI). As such, we propose the investigation of a new research line on AI-powered GI aimed at incorporating semantic aware search. We take a first step at it by augmenting GI with the use of automated clustering of LLM edits. We provide initial empirical evidence that our proposal, dubbed PatchCat, allows us to automatically and effectively categorize LLM-suggested patches. PatchCat identified 18 different types of software patches and categorized newly suggested patches with high accuracy. It also enabled detecting NoOp edits in advance and, prospectively, to skip test suite execution to save resources in many cases. These results, coupled with the fact that PatchCat works with small, local LLMs, are a promising step toward interpretable, efficient, and green GI. We outline a rich agenda of future work and call for the community to join our vision of building a principled understanding of LLM-driven mutations, guiding the GI search process with semantic signals.
Original languageEnglish
PublisherarXiv
Number of pages5
DOIs
Publication statusPublished - 25 Aug 2025

Fingerprint

Dive into the research topics of 'LLM-Guided Genetic Improvement: Envisioning Semantic Aware Automated Software Evolution (arXiv version)'. Together they form a unique fingerprint.

Cite this