Michael Niebisch
Leveraging Large Language Models for Legacy Code Translation: Challenges and Solutions
#1about 5 minutes
Motivations for translating legacy MATLAB code to Python
The project aimed to explore LLMs for modernizing a large, legacy MATLAB codebase due to the scarcity of MATLAB developers and the rise of Python.
#2about 4 minutes
Using a semi-automatic workflow with ChatGPT for translation
The initial approach involved a manual copy-paste workflow using the ChatGPT web interface, which saved time on boilerplate but struggled with large code chunks and introduced errors.
#3about 4 minutes
Overcoming language-specific challenges in code translation
Key translation challenges arose from fundamental differences between MATLAB and Python, such as array indexing and memory layout, requiring a divide-and-conquer approach and robust unit tests.
#4about 5 minutes
Developing an automated pipeline for translation and auto-fixing
To improve efficiency, an automated pipeline was built to first annotate code with type and shape information before translation and then use an agent-based tool to automatically fix bugs based on test failures.
#5about 4 minutes
Evaluating LLM performance and providing debugging support
A framework was developed to evaluate translation quality by testing against known failure cases, and a debugging tool uses LLMs to compare execution logs from both languages to pinpoint errors.
#6about 3 minutes
Considering local LLMs for security and summarizing key learnings
Due to IP and security concerns with cloud APIs, local models like Llama 2 were explored, and the project concluded that while LLMs are promising tools, fully automated, error-free translation remains a significant challenge.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
28:37 MIN
How to effectively leverage AI for legacy code
Getting to Know Your Legacy (System) with AI-Driven Software Archeology
03:36 MIN
The rapid evolution and adoption of LLMs
Building Blocks of RAG: From Understanding to Implementation
17:00 MIN
Designing developer tools and documentation for LLMs
WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis
22:34 MIN
The limitations and frustrations of coding with LLMs
WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis
09:55 MIN
Shifting from traditional code to AI-powered logic
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
16:54 MIN
Comparing LLM performance and planning next steps
Build Your First AI Assistant in 30 Minutes: No Code Workshop
23:35 MIN
Defining key GenAI concepts like GPT and LLMs
Enter the Brave New World of GenAI with Vector Search
09:43 MIN
The technical challenges of running LLMs in browsers
From ML to LLM: On-device AI in the Browser
Featured Partners
Related Videos
Using LLMs in your Product
Daniel Töws
Inside the Mind of an LLM
Emanuele Fabbiani
Useful AI friends for developers – building a multilingual app
Andreas Erben & Tim Cadenbach
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
Bringing the power of AI to your application.
Krzysztof Cieślak
Self-Hosted LLMs: From Zero to Inference
Roberto Carratalá & Cedric Clyburn
The State of GenAI & Machine Learning in 2025
Alejandro Saucedo
Unlocking the Power of AI: Accessible Language Model Tuning for All
Cedric Clyburn & Legare Kerrison
From learning to earning
Jobs that call for the skills explored in this talk.

Machine Learning Engineer
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Machine Learning
Structured Query Language (SQL)

Senior AI Software Developer & Mentor
Dynatrace
Linz, Austria
Senior
Java
TypeScript
AI Frameworks
Agile Methodologies


Data Scientist- Python/MLflow-NLP/MLOps/Generative AI
ITech Consult AG
Azure
Python
PyTorch
TensorFlow
Machine Learning



Machine Learning Engineer, Machine Translation
Apple Firmenprofil
C++
PyTorch
TensorFlow
Machine Learning


Machine Learning Algorithm/SW Optimization Engineer
Leuven MindGate
Python
PyTorch
TensorFlow
Machine Learning