[Remote] Intermediate Site Reliability Engineer, Environment Automation

Remote, USA Full-time
Note: The job is a remote job and is open to candidates in USA. GitLab is an open-core software company that develops a comprehensive AI-powered DevSecOps Platform. The Site Reliability Engineer will focus on operating and automating hundreds of GitLab environments, ensuring they remain secure, consistent, and reliable at scale while debugging production issues and contributing to infrastructure automation. Responsibilities • Support Environment Automation at Scale: Contribute to automating the provisioning, configuration, and management of GitLab environments using Terraform, Ansible, and Kubernetes. Follow best practices to support infrastructure across many tenants with guidance from senior team members. • Assist in Debugging Production Issues: Investigate and troubleshoot issues in Kubernetes clusters and GitLab services. Help resolve common problems such as failed deployments, pod crashes, and scheduling conflicts using tools like kubectl. • Contribute to IaC and CI/CD Workflows: Write and maintain Terraform modules and scripts to automate routine operations. Participate in improving CI/CD pipelines for safe and repeatable infrastructure changes. • Participate in Monitoring and Maintenance: Help monitor environment health using tools like Prometheus, ELK, and Grafana. Assist in improving observability and capacity tracking for tenant environments. • Respond to Incidents and Alerts: Take part in the incident response process, helping triage alerts, document issues, and support resolution efforts under the guidance of senior engineers. • Collaborate Across Teams: Work with Infrastructure and Development teams to contribute to solutions that improve platform reliability and operational efficiency. Skills • Experience with Infrastructure as Code: Familiarity with Terraform and Ansible to manage cloud infrastructure. Able to work with modules and understand the basics of state and variable use. • Kubernetes Fundamentals: Experience using kubectl, Helm, or Kustomize to interact with Kubernetes clusters. Understands core concepts such as pods, deployments, and rollouts. • Basic Programming Skills: Able to read and modify infrastructure tooling written in Go, Ruby, or similar languages. • Exposure to Multi-Environment Operations: Experience working with multiple environments or customer setups, even if not at full scale. Understands the challenges of managing consistency and isolation. • Monitoring and Troubleshooting Skills: Familiar with basic observability tools and logs. Can identify service issues using dashboards or metrics and escalate appropriately. • Collaborative Mindset: Works well in cross-functional teams. Eager to learn from others, share knowledge, and contribute to team success. • On-Call Experience: Has participated in on-call rotations for production systems and is comfortable responding to alerts, triaging incidents, and collaborating during recovery efforts. Benefits • Flexible Paid Time Off • Team Member Resource Groups • Equity Compensation & Employee Stock Purchase Plan • Growth and Development Fund • Parental leave • Home office support Company Overview • GitLab is a web-based Git repository manager that offers a variety of features for software development teams. It was founded in 2014, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is Apply tot his job
Apply Now

Similar Jobs

Site Reliability Engineer, Sr. Consultant level

Remote, USA Full-time

Senior Site Reliability Engineer

Remote, USA Full-time

[Remote] Senior Data Engineer - Snowflake (Remote)

Remote, USA Full-time

Snowflake Data Engineer

Remote, USA Full-time

Manager, Solutions Engineering

Remote, USA Full-time

Snowflake Data Engineer

Remote, USA Full-time

DTICI Snowflake Data engineer T8 @ Daimler Truck

Remote, USA Full-time

[Remote] Lead Cloud Data Engineer (Azure + Snowflake)

Remote, USA Full-time

Snowflake Data Engineer - Hybrid in Seattle

Remote, USA Full-time

Principal Data Engineer (Snowflake)

Remote, USA Full-time

Maintenance Technician - Emerald Groves

Remote, USA Full-time

Nurse - Perioperative

Remote, USA Full-time

Product Manager – Neo (PingOne Verify + PingOne Credentials)

Remote, USA Full-time

Earn Extra Cash:Delta Airlines Flight Attendant(Taylorsville)

Remote, USA Full-time

**Experienced Full Stack Customer Support Specialist – Live Chat & Remote Work Opportunities**

Remote, USA Full-time

Surgery Scheduler job at HCA - Hospital Corporation of America in Nashville, TN

Remote, USA Full-time

Audiologist - Cape Cod

Remote, USA Full-time

**Experienced Live Chat Assistant – Delivering Exceptional Customer Service in a Dynamic Remote Environment at blithequark**

Remote, USA Full-time

(CVS HEALTH CAREER) Remote Customer Service Rep – WFH

Remote, USA Full-time

Flexible Remote Junior Content & Support Associate – Engaging Online Opportunities for 15‑Year‑Olds (Work‑From‑Home, Surveys, Social Media, Customer Care)

Remote, USA Full-time
Back to Home