Director, Site Reliability Engineering
Led distributed SRE teams across North America, expanded operations coverage, and advanced infrastructure as code, security posture, compliance readiness, incident management, and reliability measurement.
DevOps, SRE, production operations
Technologist building secure, resilient, and cost-efficient platforms across cloud and data center environments.
I help organizations modernize how technology is built, operated, secured, and improved. My work spans platform engineering, production reliability, cloud economics, AI-assisted delivery, incident response, and the leadership practices that keep teams moving with purpose.
A lifelong technologist with a career centered on improving, securing, and scaling the systems businesses depend on.
I bring hands-on credibility and senior leadership across DevOps, SRE, IT, cloud operations, compliance readiness, infrastructure automation, incident response, and business continuity. I am strongest where technical depth, strategic execution, calm judgment, and persistent follow-through are all required.
I also know how to make AI useful in real operating environments: accelerating engineering workflows, helping teams reason through complex systems, and implementing cloud-scale AI capabilities without losing sight of reliability, governance, or cost.
Focused on my platform, cloud, operations, security, and leadership impact.
Led distributed SRE teams across North America, expanded operations coverage, and advanced infrastructure as code, security posture, compliance readiness, incident management, and reliability measurement.
Led cloud operations and site reliability teams supporting security products, release stability, business continuity, employee growth, scalable platforms, and secure cloud infrastructure.
Led DevOps teams across AWS operations, media-stack cloud buildouts, Kubernetes and EKS migrations, cloud networking, compliance, and cost efficiency initiatives.
Designed and automated scalable AWS services for GoPro Plus, including logging, infrastructure provisioning, configuration management, and platform migration work.
Helped move production, staging, and development environments from data centers to AWS, then advanced delivery automation, disaster recovery, monitoring, support tools, and cost optimization.
Modern platform leadership across reliability, automation, AI, security, and cost discipline.
Practical use of GitHub Copilot, Cursor AI, ChatGPT Codex, Lovable, AWS Bedrock, Google Gemini, and related AI systems to improve engineering speed, documentation, operations, and cloud-scale implementation.
Known for controlling operating costs through EDPs, PPAs, autoscaling, Spot Instances, Spot Fleets, Savings Plans, Reserved Instances, architectural changes, Spotinst.io, Cloudability, CloudZero, and careful platform design.
Incident management, severity processes, observability, on-call maturity, reliability KPIs, business continuity, and production systems designed for recovery.
Cloud and data center experience across AWS, GCP, Azure, Oracle Cloud, Kubernetes, infrastructure as code, networking, migration, and production operations.
Security-minded platform work with exposure to SOC 2, HIPAA, PCI, SOX, FedRAMP, change control, access patterns, audit readiness, and resilient operational practices.
Leadership for distributed teams, engineering managers, operational coverage, executive communication, delivery planning, mentorship, and sustainable team practices.
A practical view of the tools, platforms, and disciplines I use to build and operate technology.
Hands-on enough to understand the system, senior enough to shape the organization. Technology leadership with a builder's instincts.