Joshua P. Mervine
Principal Site Reliability Engineer | Infrastructure Architect | Engineering Leader
Technical leader with 25+ years of experience in reliability engineering, infrastructure architecture, observability, compliance, and automation. Proven ability to scale systems, lead distributed teams, and align technical initiatives with business objectives. Strengths include systems design, cross-functional collaboration, compliance-focused operations, and mentoring technical staff. Known for building tooling that empowers engineers and shipping reliable platforms at scale.
Core Competencies
- SRE & Infrastructure: Observability, Incident Response, High Availability, Logging Pipelines, Monitoring at Scale
- DevOps & Automation: CI/CD, GitOps, Infrastructure as Code (Terraform, Salt, Ansible, Chef, Puppet, etc.), Kubernetes, Docker
- Leadership & Strategy: Team Building, Technical Roadmaps, Project Planning, Mentorship, Executive Communication
- Security & Compliance: FedRAMP, HIPAA, PCI, SOX, Secure Logging, Access Control
- Languages: Go, Ruby, Python, JavaScript (Node.js), Bash
- Cloud & Tools: AWS, GCP, Splunk, Git, NGINX, Prometheus, Grafana
Professional Experience
Principal Service Reliability Engineer
MuleSoft / Salesforce May 2021 – Present
- Lead architect and technical owner for large-scale observability initiatives spanning both commercial and FedRAMP Moderate government environments across global deployments.
- Design and implement centralized logging pipelines and metrics-based monitoring platforms that balance cost, scalability, and operational coverage.
- Drive cross-functional projects to modernize deployment strategies and improve service reliability across diverse infrastructure stacks.
- Define and lead incident response practices, frameworks, and tooling used by distributed teams worldwide.
- Mentor engineers and collaborate with leadership to align reliability strategy with business and compliance goals.
Sr. Manager, Production Services
Heroku / Salesforce May 2017 – May 2021
- Managed two teams across Shared Service Operations and Compliance & Integration Engineering.
- Acted as Incident Commander for high-severity production incidents.
- Designed and maintained Splunk infrastructure in AWS to handle over 80TB/day of log ingestion.
- Built tools for access control, AWS account automation, and compliance workflows supporting HIPAA, PCI, and SOX standards.
Lead Service Reliability Engineer
Heroku / Salesforce May 2015 – May 2017
- Operated large-scale observability platforms (Splunk, Prometheus) with zero downtime and strict data loss SLAs.
- Developed internal tooling and infrastructure to support SRE functions and developer self-service.
- Led compliance-driven initiatives and improved reliability practices across teams.
Site Reliability Architect
YP.com (YellowPages.com) Aug 2014 – Apr 2015
- Designed and prototyped container-based deployment systems using Docker and MesOS.
- Introduced CI/CD pipelines based on GitOps principles.
- Refactored legacy automation and mentored junior SREs on best practices.
Associate Director of DevOps
YP.com May 2011 – Aug 2014
- Led a high-impact DevOps team for a top 50 US web property with over 30M monthly users.
- Migrated core applications from Ruby on Rails to Node.js with zero downtime.
- Reduced release times by 75% through deployment pipeline optimization.
- Established automated performance testing pipelines to improve release confidence.
Team Manager, Operations
Walt Disney Parks & Resorts Online Jan 2010 – May 2011
- Managed a globally distributed web operations team supporting Disneyland.com and related sites.
- Oversaw production operations, high-availability deployments, and release engineering in a 24/7 environment.
Sr. Release Engineering Manager
EarthLink, Inc. Dec 2006 – Dec 2009
- Owned build and deployment pipelines for 50+ web applications and 15 client apps.
- Created alignment between Product, QA, and Engineering for smoother releases.
Sr. Development Tools Engineer
EarthLink, Inc. Jul 2005 – Nov 2006
- Built a 130+ server internal development data center from scratch—tripling development throughput.
Engineering Leadership Roles
EarthLink, Inc. 1998 – 2005
- Led engineering and maintenance of EarthLink’s customer portal and associated tools.
- Progressed from intern to team lead, managing production web properties and core infrastructure.
Open Source Highlights
BootstrapCDN
Longtime contributor to the public content delivery network for the Bootstrap framework. Worked closely with maintainers of Bootstrap, Font Awesome, and Bootswatch. Contributed to infrastructure, update automation, and site reliability.
shml (Shell Markup Language)
Author of a widely adopted shell scripting library (~445 stars) providing structured, styled terminal output for Bash and Zsh. Previously maintained and used in numerous deployment and CI pipelines. Available via npm and Homebrew.
shunt
Created a lightweight Bash unit-testing framework designed for testing system and deployment scripts where traditional test frameworks are unavailable. Now community-supported and referenced in TDD scripting tutorials.
gojson-http
Built a developer utility for converting JSON to Go structs with a web interface. Supports fast prototyping and API design. Powers json2struct.mervine.net, which continues to see regular usage.
splunking
Developed a Go library to simplify log ingestion into Splunk via HTTP Event Collector (HEC). Lightweight and reusable in observability tooling or internal automation pipelines.
GitHub: github.com/jmervine | github.com/odb |
Additional Information
- Location: Portland, OR Metro
- Work Authorization: U.S. Citizen
- Remote/Hybrid/Onsite: Open to all
- Clearance: Previously supported FedRAMP Moderate systems