WORK EXPERIENCE:
Staff Software Engineer, Platform
Atelio (by FIS) (via acq. of Bond Financial Technologies), USA (Remote)
06/2021 - 02/2025
- Responsible for the operation and maintenance of our cloud infrastructure, reliability, observability, and monitoring
- Oversaw and managed (player/coach methodology) the ongoing maintenance of Kubernetes, terraform, and cloud resources
- Owned organization-wide incident response plans and procedures. Primarily escalation point for all P1/P0 incidents
- Provide mentorship and guidance to junior and mid-level engineers for effective management of projects with appropriate prioritization and communication
- Conduct research and comparative analysis of potential software vendors, and making build vs buy decisions
- Manage relationships and contracts with external software vendors (AWS, Fastly, Datadog, VGS, StrongDM, Vanta)
- Represented the Engineering organization at organization-wide leadership meetings during the post-acqusition period
- Cross-functional collaboration with product teams to deprecate redundent systems and duplicated functionality to reduce operational complexity.
- Reconciled existing infrastructure and tooling into appropriate Terraform projects
- Assisted and Supplemented Product Engineering with new feature development based on priorities and required timelines
Senior Software Engineer - Site Reliability Engineering
Fullstory, Austin, TX, USA (Remote)
02/2019 - 06/2021
- Responsible for the maintainance and functionality of internally-build depployment orchestration system
- Managed production and pre-production Kubernetes environments
- Managed day-to-day operational issues and scaling of our internal Prometheus-based monitoring systems
Senior Software Engineer - Site Reliability Engineering
Yonder (formally New Knowledge), Austin, Texas, USA
02/2019 - 02-2020
- Owned the prioritization and execution of all Devops, Infrastructure, and Site Reliability requirements
- Maintained multiple Kubernetes clusters for both production and staging workloads
- Worked with individual Product Engineering leads to reduce operational complexity and streamline our engineering process
- Actively worked to reduce existing overengineered solutions and improve engineering productivity
Senior Software Engineer - Infrastructure
Pixlee, Austin, Texas, USA
11/2018 - 02/2019
- Updated the development workflow of core applications to include modern and professional software engineering practices
- Designed and developed reproducable and automated developer environments based in a Kubernetes environment
- Identified and communicated fundamental issues in the existing configuration management, and developed a safe migration plan to correct the issues
- Identified and communicated issues in the current production infrastructure which negatively impact system cost, reliabilty, and operational insight
- Delivered a safe, long term plan to migrate to Kubernetes in order to reduce the infrastructure bloat, consolidate services, improve reliability, and ease operational burden
Staff Software Engineer
Cratejoy, Austin, Texas, USA
01/2018 - 10/2018
- Managed our production Kubernetes infrastructure, staging environments, and CI/CD pipelines
- Interfaced with individual product teams in order to plan for upcoming deployment, monitoring, and tooling needs
- Migrated our central application deployments to team-specific automated deployments
- Developed internal services to aid in the ease of development of user facing products
Senior Software Engineer
Cratejoy, Austin, Texas, USA
02/2015 - 01/2018
- Formed and led our Site Reliability Engineering team in order to prioritize stability, reliability, performance, and ease of development
- Identify, investigate, and resolve platform-wide performance and reliability issues
- Developed and released a reliable internal Traffic Analysis system (with full grainularity), used throughout the company to make business critical decisions
- Developed and maintained features for the Merchant Tools section of the Cratejoy Platform