Craig Stueber
Applied AI Engineer
Builds and ships production LLM systems end to end. Doctoral researcher in AI safety.
“Work hard and be nice to people.”

experience
Work History
Senior Full Stack Engineer
AI Systems Integration- Tech lead and people lead for a team of 6 engineers building Dekaflow 2.0, transitioning $100B+ in annual energy movement from on-prem to cloud.
- Led early-stage AI agent R&D designing a six-agent LangGraph pipeline for enterprise data understanding, decreasing business stakeholder analysis time by 90%.
- Built and owned full-stack features end to end across Next.js, Java, MongoDB, and Azure supporting gas flow scheduling, hourly quantity tracking, and a cross-cutting user preferences system, saving 10K+ hours monthly on operations.
- Led enterprise-wide GitHub Copilot deployment across 200+ engineers, establishing behavioral guardrails and governance practices, reducing AI-introduced defects in production codebases.
Senior Full Stack Developer
LLM-Integrated Systems- Sole engineer across 6 independent brand teams, building all customer-facing applications from 0 to 1, delivering 140+ features across all brands.
- Developed customer service tool used daily by 10 reps, reducing critical issue resolution time from 36+ hours to 6-8 hours.
- Automated priority classification of service requests, routing emergency-tier messages without manual triage and eliminating bottlenecks across 100+ daily incoming requests.
- Built LLM-integrated pipelines for classification, summarization, and automated routing with controlled prompt A/B evaluations, saving the IT team 180+ hours monthly on customer processing.
Full Stack Engineer
ML-Enhanced IoT Systems- Integrated ML models for time-series anomaly detection and classification into backend services to identify sensor abnormalities and operational risks.
- Built real-time IoT monitoring dashboards using React, Python, and WebSockets, translating ML outputs into actionable insights for field operators.
Earlier Experience
2017 – 2021Delivered 50+ client projects across React, PHP, WordPress, Shopify, and early AR prototypes. Managed requirements and delivery timelines directly with clients, delivering 100% of projects on time.
Frontend modernization, email system consolidation, mobile UX improvements, and accessibility remediation.
Full WCAG and ADA audit and remediation, Wix platform migration, and staff accessibility training.
Built and delivered 10+ full-stack web applications for clients in publishing, real estate, and nonprofit industries. Owned requirements, scoping, and delivery -- secured 100% of clients through word-of-mouth referrals.
projects
Notable Work
CodeRisk Advisor
AI Safety / LLM SystemsMulti-agent AI security review system for Python, JavaScript, and TypeScript code. Combines OWASP Top 10 vulnerability scanning with AI-specific behavioral risk detection using a panel of specialized LLM agents that synthesize findings into conversational developer guidance.
- LangGraph pipeline orchestrating five specialized agents: VulnScanner, BehavioralRisk, Skeptic, Remediation, Synthesizer
- Skeptic agent actively disputes low-confidence findings to reduce false positives
- Token-by-token SSE streaming with real-time agent status updates in the UI
- Deployed on Google Cloud Run with LangSmith tracing for full observability
DanceCard
Agentic System / Mobile ApplicationCo-founded and led all engineering for a cross-platform social mobile application. Built an agentic onboarding system using CrewAI alongside a full React Native application -- owned architecture, data modeling, and delivery independently from concept to App Store.
- Agentic onboarding system using CrewAI with constrained generation patterns to maintain consistent, safe outputs in a consumer-facing context
- Full cross-platform React Native application with real-time chat, event scheduling, and location-aware discovery across iOS and Android
- Full App Store and Google Play submission including TestFlight and Play Console policy compliance
Dekaflow 2.0
High-stakes enterprise platform managing natural gas scheduling workflows supporting billions in annual east coast energy movement. Built on a modern React and cloud stack integrating with a 25-year-old Java and SQL legacy system.
Hot Tomato Summer
Multi-city restaurant voting platform reaching 30,000+ users in two weeks with rule-based fraud detection and voting anomaly dashboards.
skills
Technical Skills
Languages
AI & LLM Systems
Frameworks & Libraries
Infrastructure & Cloud
Data & Backend
Testing & Quality
Accessibility
Enterprise Tooling
research
Doctoral Research
Evaluating the Security of AI-Generated Code: A Quantitative Study Using a Custom Scoring Framework
Designs and validates a reproducible hybrid vulnerability scoring framework to detect and measure security risks in AI-generated code before deployment. Addresses a validated gap in the literature -- no systematic evaluation framework existed for assessing AI-generated code security across diverse programming tasks and contexts.
writings
Published Work
The Comfortable Apocalypse
When Survival Isn't the Problem — Irrelevance Is
The central risk of the AI age is not domination or rebellion, but displacement. As automation removes friction from daily life, it quietly erodes the cognitive and emotional capacities that effort once built — memory, judgment, curiosity, creativity, identity, and agency. The danger is not hostile AI, but a world where thinking becomes optional and human participation fades without resistance.
education
Academic Background
Doctor of Philosophy
Computer Science- AI safety and behavioral reliability
- Security risks in AI-generated code
- Hybrid vulnerability scoring framework combining OWASP, CVSS, and AI-specific pattern detection
Master of Science
Information Technology- IT management and information security management
- System design and architecture