Markus Harrer
Data Science on Software Data
#1about 4 minutes
The challenge of justifying legacy system improvements
Technical debt in legacy systems is difficult to communicate to management because its impact is less visible than new features or bugs.
#2about 4 minutes
The promise and failure of universal software quality metrics
Early software analytics aimed to create universal quality dashboards but failed because metrics and models are not transferable between unique projects.
#3about 5 minutes
Adopting analytics approaches for project-specific questions
Instead of reusing non-transferable results, teams can adapt the methodologies and tools from software analytics to answer their own unique, high-impact questions.
#4about 5 minutes
Using data science as a foundation for software analytics
Reproducible data science provides the necessary methodologies and tools for open and automated analysis, leveraging skills developers already possess.
#5about 6 minutes
Exploring software data types and practical analysis use cases
Analyzing static, runtime, chronological, and community data can reveal code ownership gaps, performance bottlenecks, and opportunities for modularization.
#6about 13 minutes
Analyzing code coverage with Python, pandas, and Jupyter
A live coding demo shows how to use Python, pandas, and Jupyter notebooks to analyze production code coverage data and visualize unused code packages.
#7about 3 minutes
An introduction to graph analytics for software systems
Graph analytics with tools like jQAssistant and Neo4j helps visualize and query interconnected software data like class dependencies and method calls.
#8about 1 minute
Key principles for effective software data analysis
Successful software data analysis requires focusing on solving specific problems, working openly, automating processes, and deriving actionable next steps.
#9about 8 minutes
Q&A on production code analysis and performance bottlenecks
The speaker answers questions about analyzing production codebases, sharing examples of identifying performance bottlenecks and justifying technology choices with data.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
08:36 MIN
Why data engineering needs software engineering discipline
Modern Data Architectures need Software Engineering
17:41 MIN
Presenting live web scraping demos at a developer conference
Tech with Tim at WeAreDevelopers World Congress 2024
26:28 MIN
Practical tools and education for developers and users
Responsible AI in Practice: Real-World Examples and Challenges
39:20 MIN
Using AI to manage legacy code and technical debt
Transforming Software Development: The Role of AI and Developer Tools
23:19 MIN
Building a company-wide data culture and literacy
Modern Data Architectures need Software Engineering
18:14 MIN
Building a self-service tool for domain experts
Solving the puzzle: Leveraging machine learning for effective root cause analysis
14:10 MIN
Leveraging open software and AI for code development
The Future of Computing: AI Technologies in the Exascale Era
22:14 MIN
Integrating ethics and data governance into development
The Future of Developer Experience with GenAI: Driving Engineering Excellence
Featured Partners
Related Videos
Getting to Know Your Legacy (System) with AI-Driven Software Archeology
Markus Harrer
Modern Data Architectures need Software Engineering
Matthias Niehoff
Grappling With Clunky Old Software? Start by Understanding What’s Inside!
Luc Perard
Enjoying SQL data pipelines with dbt
Matthias Niehoff
From Monolith Tinkering to Modern Software Development
Lars Gentsch
The Road to One Billion Developers
Thomas Dohmke & Demetris Cheatham
New AI-Centric SDLC: Rethinking Software Development with Knowledge Graphs
Gregor Schumacher, Sujay Joshy & Marcel Gocke
The Clean as You Code Imperative
Olivier Gaudin
From learning to earning
Jobs that call for the skills explored in this talk.


Senior Systems/DevOps Developer (f/m/d)
Bonial International GmbH
Berlin, Germany
Senior
Python
Terraform
Kubernetes
Elasticsearch
Amazon Web Services (AWS)


Team Lead and Senior Software Engineer with focus on AI
Dynatrace
Linz, Austria
Senior
Java
Team Leadership



Software Developer with Data Science Expertise in Energy System Modelling
ETH Zürich
€208K
C++
GIT
Python
Software Architecture

Student assistant: Software Developer for Data-Driven Applications
Fraunhofer-Gesellschaft
API
.NET
React
Python
Next.js
+3

Data Scientist / ML Engineer
Collaboration Betters The World GmbH
€68-84K
Intermediate
Azure
Scrum
Python
PyTorch
+7