I build quality assurance systems for Generative AI — from evaluation frameworks for enterprise deployment validation.
- LLM Evaluation & Validation — behavioral testing, output quality scoring, safety, and bias assessment for large language models
- AI Implementation Consulting — requirements-to-deployment pipelines for enterprise AI systems (IBM Watsonx, Healthcare AI)
- Automated QA Engineering — Python-based test automation, regression frameworks, edge case detection for AI outputs
- GRADE Framework — author of a structured methodology for grading and evaluating generative AI model outputs
Post-Discharge Patient Care Agent built with IBM Watsonx.
Stack: watsonx.ai · watsonx Assistant · PostgreSQL/DB2 ·
Node.js · Flask · React · MLflow · Tableau
Role: Full delivery cycle — research design, KPI dashboards,
conversation flows, escalation logic, and MLflow performance tracking.
Locally-hosted single-page application for IP and latency metrics.
Designed for field technicians. Built with React.
Full-stack web development portfolio:
JavaScript · Database Foundations · Mobile App Development ·
Web Design & Development
AI / ML: IBM Watsonx · MLflow · LLM Evaluation · Prompt Engineering
Languages: Python · JavaScript · C# · HTML/CSS
Frameworks: React · ASP.NET Core MVC · Entity Framework Core
Databases: PostgreSQL · DB2 · SQL
Tools: Git · Tableau · Power BI · Enterprise Design Thinking
Domains: Healthcare AI · QA Automation · Data Quality
📍 Boise, Idaho, USA
🏆 Idaho CTE AI Panel Speaker — July 2026
🔗 LinkedIn
Open to AI QA roles, LLM evaluation contracts, and implementation consulting engagements.


