Bringing AI Model Testing and Prompt Management to Your Codebase with GitHub Models
Is your AI development just 'vibes-based'? Learn how to run automated prompt evaluations as a blocking check on every pull request.
#1about 3 minutes
The challenge of testing non-deterministic AI features
Traditional development relies on rigorous testing, but AI features are often implemented based on intuition without a structured evaluation process.
#2about 5 minutes
Managing prompts as code with GitHub Models
GitHub Models integrates AI development into your repository by defining prompts, models, and parameters in a version-controlled YAML file.
#3about 6 minutes
Using evaluators to compare AI model variants
The platform allows you to run multiple prompt and model variations against a test dataset to compare outputs on metrics like latency, coherence, and similarity.
#4about 5 minutes
Consuming prompt files in your application code
Use the GitHub Models inference API or the Azure AI Inference SDK to load your version-controlled prompt files and integrate AI calls directly into your application.
#5about 2 minutes
Local development and testing with the CLI
The GitHub CLI extension allows you to run prompts and execute model evaluations directly from your terminal for rapid, local iteration before committing changes.
#6about 4 minutes
Automating repository tasks with AI-powered actions
Use GitHub Actions to automate common repository tasks like generating changelogs from pull requests, triaging bug reports, or creating weekly issue summaries.
#7about 1 minute
Implementing CI/CD for AI prompt changes
Integrate prompt evaluations into your CI/CD pipeline using GitHub Actions to automatically run tests and block pull requests that degrade model performance.
#8about 2 minutes
Adopting GitHub Models in existing projects
You can quickly convert existing prompt files to the GitHub Models format to gain access to powerful evaluation, comparison, and automation capabilities.
Related jobs
Jobs that call for the skills explored in this talk.
GitHub Copilot: Beyond the Basics – 10 Ways to Elevate Your CodingWelcome to an in-depth exploration of GitHub Copilot and its capabilities. If you're a software developer or someone intrigued by AI's potential to revolutionize coding, this post is for you. GitHub Copilot, an AI-powered code completion tool, offers...
Daniel Cranney
GitHub's Copilot Ads and Opt-out for AI Training DataOur newsletter - The Dev Digest - is packed with links to all kinds of tech content, but we just can’t cover everything. That’s why we put together the Overflow, where we share some of our favourites in bonus posts and videos, and this time we’re ta...
Daniel Cranney
Panel Discussion: Responsible AI in Practice - Real-World Examples and ChallengesIntroductionIn the ever-evolving landscape of artificial intelligence, the concept of "responsible AI" has emerged as a cornerstone for ethical and practical AI implementation. During the WWC24 Panel discussion, three eminent experts—Mina, Bjorn Brin...
Chris Heilmann
Exploring AI: Opportunities and Risks for DevelopersIn today's rapidly evolving tech landscape, the integration of Artificial Intelligence (AI) in development presents both exciting opportunities and notable risks. This dynamic was the focus of a recent panel discussion featuring industry experts Kent...
From learning to earning
Jobs that call for the skills explored in this talk.