Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Rodrigo Brim Monteiro Braga

DevOps / SRE Engineer
São Paulo, Sao Paulo,São Paulo

Summary

As an experienced engineer with a proven track record across diverse industries including IT, finance, e-commerce, and government, I bring a wealth of knowledge and a passion for technology to every challenge. I am a dedicated tech enthusiast with a commitment to continuous learning, constantly exploring and mastering new tools and technologies, from the latest advancements in monorepos to the exciting possibilities of AI assistance. Driven by a value-delivery mindset, I prioritize measurable outcomes and data-driven decision-making to ensure maximum impact. My leadership approach fosters high-performing teams built on mutual trust and a collaborative, non-opinionated problem-solving environment. While technically proficient, I also possess excellent soft skills that enable effective communication and collaboration. With extensive experience managing cloud, on-premises, and hybrid environments, I bring a comprehensive perspective to infrastructure challenges.

Overview

12
12
years of professional experience
5
5
years of post-secondary education
6
6
Certifications
4
4
Languages

Work History

Platform Engineer

X-Team
Chelsea - Victoria
09.2021 - Current

3rd project: a major pharmaceutical company

  • To modernize its IT operations by migrating existing, disparate automation scripts and manual processes to a standardized, Ansible-driven automation framework. This initiative aimed to improve operational efficiency, reduce human error, and enhance the management of their hybrid Windows and Linux server environment.
  • As a key member of the automation team, I was responsible for the analysis, design, development, and implementation of Ansible-based solutions to automate a wide range of IT operational tasks.
  • I meticulously analyzed existing automation scripts (PowerShell, Bash, and various internal tools) and manual workflows for server management, patching, compliance audits, and software deployments. I identified areas for improvement and designed streamlined, Ansible-based solutions.
  • I developed, tested, and deployed Ansible Playbooks and Ansible Workflows using YAML to automate key operational tasks, including: OS patching, compliance audits, Software Deployment, and Observability Integration.
  • Leveraged Ansible's API integration capabilities to connect with ServiceNow for change management and incident tracking. This ensured that all automated changes were properly documented and tracked within the company's existing IT service management processes.
  • Developed dynamic inventories in Ansible by pulling server information from external databases via API calls. This allowed for accurate and up-to-date inventory management in a constantly evolving environment.
  • Worked closely with cross-functional IT teams (server administrators, security engineers, and application developers) to validate workflows, gather requirements, and ensure seamless integration with existing systems. Conducted comprehensive knowledge transfer sessions and created detailed documentation to empower the internal IT team to manage and maintain the Ansible automation framework.
  • The successful implementation of this Ansible automation project resulted in significant improvements for the company, like Increased Operational Efficiency, Improved Compliance and security, Enhanced Collaboration, and Reduced Operational Costs.

2nd project: Internal project

  • As a member of X-Team's Platform team, I work to provide the best experience for the actual X-Teamers (co-workers) and new/future ones
  • That means I contribute by removing blocks or barriers facilitating staff's work or adding new features that could improve the job experience
  • For example, some failed SQS messages lead to no action being taken until its expiration
  • To solve it, I created a Terraform module to easily document/create CloudFormation metrics, alerts, and thresholds to send those alerts via Slack bot to the desired groups
  • This solution fixed the issue and enforced new/replacement cloud resources compliance
  • Another example: the developers argued about the difficulties caused by the VPN switching requirement when they jump from the Dev and Prod environments
  • To dismiss it, I created a VPC peering connection and concentrated the access from a single VPN instance, controlling the traffic via Iptables and segregating the network groups (ACLs)
  • Additionally, I solved an old demand to redirect the non-VPC traffic outside the VPN, improving the navigation speed, saving AWS costs (traffic), and allowing the developer to be connected for extended periods
  • I also automate the VPN account management via the Slack bot function
  • I've also raised the staff autonomy by extending the bot features, delivering a Self-service credentials reset (AWS and VPN) function, and a bunch of user-account reports, freeing up the engineers to work on value delivery-driven tasks
  • With these improvements combined, we could reduce the toiling, whether by ceasing ops-tickets or increasing the delivery quality and speed
  • Stack tools: AWS (CodeBuild, CodePipeline, API Gateway, CloudFormation, CloudWatch, SQS/SNS, Lambda, SSM, VPC), Terraform/Terragrunt, Go/Golang (Terratest), Serverless(Javascript), Slack (API/Bots/Modals)
  • I have successfully undertaken several impactful projects that significantly enhanced the company's infrastructure and operations
  • These experiences showcase my ability to tackle complex challenges and deliver innovative solutions

1st project: Rstudio / Posit.co

  • Package Manager Migration: I spearheaded the migration of the company's critical and highly lucrative application, the Package Manager, from EC2 instances to microservices
  • This migration involved handling millions of requests per day
  • Leveraging my expertise in AWS-managed services such as EKS, S3, RDS, ALB, EFS, and ECR, I orchestrated the migration using Pulumi's GoLang SDK
  • Additionally, I integrated the deployment process with GitHub Actions and FluxCD, enabling developers to make changes independently without requiring interactions with the operations team
  • This initiative resulted in improved scalability, reliability, and streamlined development processes
  • Binary Build Service Migration: In another significant project, I successfully migrated the Binary Build service from Google Cloud (GKE) to AWS (EKS)
  • To optimize performance and resource utilization, I implemented a high-elastic cluster utilizing multiple node groups, including Linux, Linux ARM, Win2019, Win2022, Core2019, and Core2022 instances
  • Leveraging autoscaling capabilities, the cluster dynamically scaled from 1 to 600 nodes within seconds, efficiently blending on-demand and spot instances
  • This migration enabled improved resource allocation, cost optimization, and enhanced overall performance
  • Observability and Efficiency: I emphasized the importance of observability using Datadog
  • By harnessing the power of observability insights, I facilitated proactive monitoring, efficient troubleshooting, and streamlined maintenance
  • This approach resulted in a significant reduction in toil, enabling the team to focus more on developing new features and driving innovation.As a Platform Engineer, I have successfully undertaken several impactful projects that significantly enhanced the company's infrastructure and operations
  • These experiences showcase my ability to tackle complex challenges and deliver innovative solutions
  • Stack tools: Amazon EKS, AWS Cloud, Go (Programming Language), FluxCD, GitHub Actions, Pulumi, Platform engineering / DevOps.

DevOps Lead

Banfico
London
04.2021 - 09.2021
  • DevOps Leadership and Team Management: Led and organized the DevOps team, including facilitating Scrum routines (planning, daily stand-ups, reviews, and retrospectives). Provided comprehensive support to the team in all critical activities, from complex solution development to troubleshooting.
  • Deployment Automation Optimization: Designed and implemented improvements to the central system deployment automation, reducing deployment time from 2 hours to 30 minutes, with further optimizations planned to achieve a 5-minute deployment target.
  • Data Synchronization Architecture: Designed and implemented an architecture plan to ensure seamless data synchronization between the São Paulo and London zones, enabling global operational efficiency.
  • Sales and Customer Support: Provided technical expertise and support to the sales team, assisting with customer interactions and addressing technical questions during meetings with financial institutions.
  • Open Banking Solutions Delivery: Played a key role in architecting, planning, and implementing DevOps best practices to support the delivery of Open Banking solutions as a service.
  • Technical Leadership and Mentorship: As a technical lead, guided and mentored the team to achieve optimal results, prioritizing code stability, reliability, agility, and supportability. Leveraged Java and Node.js applications deployed on AWS and Azure cloud environments. Used OpenShift (Kubernetes) for container orchestration and Ansible for automation, with GitLab providing CI/CD pipelines.

DevOps Engineer

BairesDev
San Francisco, California
03.2021 - 04.2021

Peptilogics - Early-Stage Product Development: Contributed to the design and implementation of DevOps practices for Peptilogics' early-stage product development. This involved:

  • Designing the Kanban board to facilitate workflow visualization and management.
  • Assisting in the design of the DevOps architecture to ensure efficient software delivery.
    Providing support to the Scrum Master and development team in establishing agile routines and processes.
  • Developing AWS architecture solutions to support the application's infrastructure needs.
  • Removing roadblocks for the development team, granting necessary AWS permissions, and creating S3 buckets with appropriate policies, backups, and cost optimization strategies.
  • Orchestrating ECS deployments using Terraform for infrastructure provisioning and Ansible for application deployment.
    Developing a Python integration with S3 buckets to support application functionality.

DevOps Lead

Dell
São Paulo - São Paulo
09.2017 - 03.2021
  • Carrefour Digital Transformation: Led a successful digital transformation project at Carrefour, implementing DevOps practices to streamline software delivery and enhance operational efficiency. Developed and implemented Ansible playbooks to automate container deployments within an OpenShift (Kubernetes) environment, significantly accelerating deployment speed and consistency. Further automated VM deployments in vRealize Automation (vRA) using pre-built Packer ISO templates, optimizing infrastructure provisioning.
  • Vivo/Telefonica Automation and DevOps Enablement: Drove automation initiatives at Vivo/Telefonica, ranging from simple tasks like DNS management and file parsing to complex solutions such as automating Oracle GRID cluster installations with Ansible. Fostered a DevOps culture by introducing and implementing automation tools (Terraform, Ansible), enhancing observability, establishing CI/CD pipelines, and guiding the team through Kubernetes adoption. The Oracle GRID automation reduced the work time from 40 man-hours to just 2 machine-hours, resulting in a 95% reduction in deployment time and significant cost savings.
  • Getnet DevOps Implementation: Provided comprehensive support for Getnet's adoption of DevOps practices, focusing on automation, monitoring, CI/CD, and cultural change. Streamlined processes, implemented efficient monitoring mechanisms, and built CI/CD pipelines to optimize software delivery. Played a pivotal role in driving cultural shifts to embrace collaboration and DevOps principles.

Senior DevOps Engineer

B3 (Brazil Stock Exchange)
São Paulo - São Paulo
04.2016 - 09.2017
  • DevOps Implementation and JBoss Modernization: Established a robust DevOps framework using Docker, streamlining development and deployment processes. Led the successful upgrade of internal systems from JBoss 5 to JBoss 6, enhancing performance and stability.
  • Technical Leadership and Incident Response: Played a key role in making critical technology decisions and managing daily operational tasks, including responding to critical incidents, implementing ITIL-based change management processes, and maintaining comprehensive documentation.
  • Automation and Kubernetes Adoption: Automated repetitive processes using Ansible and Terraform, significantly improving operational efficiency. Spearheaded the deployment of the company's first Kubernetes cluster, laying the foundation for containerized applications and microservices architecture.

DevOps Consultant

Santander Bank
São Paulo - São Paulo
06.2015 - 04.2016
  • Automated Critical Infrastructure: Spearheaded the automation of mission-critical Linux environments (Red Hat, Solaris) and VMware virtualization infrastructure (1000+ VMs) using Terraform and Ansible, significantly improving provisioning speed and consistency.
  • Strategic Infrastructure Influence: Actively participated in strategic decision-making for key infrastructure projects, contributing expertise to ensure optimal solutions and alignment with business goals.
    VMware vCenter Modernization: Led the successful modernization of the VMware vCenter environment, including upgrading 1000+ virtual machines, implementing vCOPs monitoring, configuring Disaster Recovery (SRM), and restructuring critical LDAP services.
  • Cross-Functional Leadership and Vendor Collaboration: Collaborated effectively with project managers and senior leadership to ensure project visibility and success. Worked directly with major technology vendors to integrate and optimize solutions within the infrastructure.
  • LDAP Environment Optimization: Analyzed, planned, and implemented performance enhancements for a complex, high-availability LDAP environment consisting of 30+ servers across multiple network segments. Achieved a remarkable 954% improvement in write performance through strategic optimization and tuning.
  • Performance Tuning and Problem Resolution: Successfully addressed chronic performance bottlenecks and replication delays within the LDAP environment, significantly enhancing stability and reliability. Minimized points of failure to reduce the impact of external system issues on LDAP services.
  • Resource Optimization: Through meticulous tuning and optimization efforts, reduced required compute resources by 50%, resulting in significant cost savings and improved operational efficiency. Decreased the monthly incident rate from 7.5 to 0.5, demonstrating a dramatic improvement in system stability.

Senior DevOps Engineer

Cast Informática
São Paulo - São Paulo
11.2012 - 06.2015
  • Automated Critical Infrastructure: Spearheaded the automation of mission-critical Linux environments (Red Hat, Solaris) and VMware virtualization infrastructure (1000+ VMs) using Terraform and Ansible, significantly improving provisioning speed and consistency.
  • Strategic Infrastructure Influence: Actively participated in strategic decision-making for key infrastructure projects, contributing expertise to ensure optimal solutions and alignment with business goals.
  • VMware vCenter Modernization: Led the successful modernization of the VMware vCenter environment, including upgrading 1000+ virtual machines, implementing vCOPs monitoring, configuring Disaster Recovery (SRM), and restructuring critical LDAP services, using Ansible playbooks. This resulted in improved uptime, reduced operational overhead and enhanced disaster recovery capabilities.
  • Cross-Functional Leadership and Vendor Collaboration: Collaborated effectively with project managers and senior leadership to ensure project visibility and success. Worked directly with major technology vendors (HP, Oracle, Sun, Cisco, EMC, and Fujitsu) to integrate and optimize solutions within the infrastructure.

Education

Master of Science - Computer Network Management

Faculdade De Informática E Administração Paulista (FIAP)
São Paulo, Sao Paulo, Brazil
01.2004 - 12.2005

Bachelor of Science - Data Processing Technology

Faculdade De Informática E Administração Paulista
São Paulo, Sao Paulo, Brazil
01.2001 - 12.2003

Skills

Infrastructure automation

Configuration management

Software development

Technology integration

DevOps principles

Contiguous integration systems

Problem-solving abilities

Ansible

undefined

Certification

Red Hat Certified System Administrator (RHCE) - Red Hat, Inc.

Timeline

Platform Engineer

X-Team
09.2021 - Current

DevOps Lead

Banfico
04.2021 - 09.2021

DevOps Engineer

BairesDev
03.2021 - 04.2021

GCP - Google Certified Professional Architect

01-2021

Certified ScrumMaster (CSM) - Scrum Alliance.

01-2021

Red Hat Certified Specialist in OpenShift Developer - Red Hat, Inc.

01-2020

Red Hat Certified Specialist in OpenShift Administration - Red Hat, Inc.

01-2020

Red Hat Certified Specialist in Ansible Automation - Red Hat, Inc.

01-2018

DevOps Lead

Dell
09.2017 - 03.2021

Senior DevOps Engineer

B3 (Brazil Stock Exchange)
04.2016 - 09.2017

DevOps Consultant

Santander Bank
06.2015 - 04.2016

Red Hat Certified System Administrator (RHCE) - Red Hat, Inc.

01-2015

Senior DevOps Engineer

Cast Informática
11.2012 - 06.2015

Master of Science - Computer Network Management

Faculdade De Informática E Administração Paulista (FIAP)
01.2004 - 12.2005

Bachelor of Science - Data Processing Technology

Faculdade De Informática E Administração Paulista
01.2001 - 12.2003
Rodrigo Brim Monteiro BragaDevOps / SRE Engineer