Senior Site Reliability Engineer (SRE)

Remote, USA Full-time
LOCATION : LATAM, ERUOPE CloudDevs works with fast-moving, venture-backed startups across the US. We’re building a pool of world-class Site Reliability Engineers for current roles and for upcoming opportunities. You will either be placed directly into one of our partner startups or added to our vetted SRE network for future projects. This role is ideal for engineers who care about reliability, metrics, performance, and building simple, scalable systems. If you enjoy designing for scale and improving how teams ship software, you’ll fit right in. Key Responsibilities Work as a hands-on engineer focused on system reliability, performance, and observability. Define and track SLIs, SLOs, and error budgets. Optimize monitoring cost and signal quality across metrics, logs, and traces. Improve deployment safety, canary rollouts, and UAT pipelines. Build tools for automated and local performance testing and track benchmarks. Lead resilience work like failover drills, chaos tests, and redundancy checks. Partner with engineering teams to improve scaling patterns and architecture as the product grows. Support incident response processes and help reduce operational noise. Write clean, maintainable code in Go, Python, or Node.js. Contribute to CI/CD improvements and automation efforts. Collaborate with engineers across teams to raise reliability standards. Requirements 5+ years in SRE, DevOps, or Platform Engineering roles. Strong experience with cloud infrastructure (AWS preferred), Terraform, and Kubernetes. Deep knowledge of observability tools like DataDog, Prometheus, or OpenTelemetry. Strong debugging skills across services, networking, and data layers. Hands-on experience designing and monitoring SLIs/SLOs. Experience with CI/CD tools such as GitHub Actions, Jenkins, or ArgoCD. Ability to write production-grade code in Go, Python, or Node.js. Comfort working independently in fast-paced environments. Nice to Have Experience tuning observability costs and optimizing data ingestion. Exposure to chaos engineering and progressive deployments. Background with high-throughput or latency-sensitive systems. AWS at scale (EKS, Lambda, DynamoDB, S3). Experience in regulated industries like fintech, payments, or SOC2 environments. Performance testing pipelines or load-testing automation. Experience handling systems processing tens of millions of API calls. Open Pool for SREs Even if you don’t meet every requirement or aren’t a fit for the current role, strong SREs with real production experience are welcome to join our talent pool. We regularly place engineers with different strengths across reliability, DevOps, platform, observability, backend, and infrastructure engineering. Apply tot his job
Apply Now

Similar Jobs

[Remote] Site Reliability Engineer (Contract outside of IR35)

Remote, USA Full-time

Site Reliability Engineer - SRE

Remote, USA Full-time

IoT/ESP32 Smart City Specialist

Remote, USA Full-time

Data Engineer ? (Oracle / Snowflake / Informatica)

Remote, USA Full-time

Senior Solution Engineer

Remote, USA Full-time

Social Media Coordinator - Fully Remote

Remote, USA Full-time

[Remote] Senior Media Analyst, Paid Social

Remote, USA Full-time

Jr. Social Media Ads and Analytics Specialist

Remote, USA Full-time

Social Media & Content Manager, Remote Job

Remote, USA Full-time

Social Analyst, Brand Engagement job at Clearlink in Draper, UT

Remote, USA Full-time

ECommerce Analyst, Quill Lincolnshire, IL

Remote, USA Full-time

**Experienced Remote Call Center Specialist – Customer Service Representative for Walgreens**

Remote, USA Full-time

Entry Level Auto Claims Adjuster | Colorado Springs, CO, USA

Remote, USA Full-time

Remote, Contract-based Florida Family Law Paralegal Opportunity

Remote, USA Full-time

Full Stack Developer job at Token Metrics in Austin, TX

Remote, USA Full-time

**Experienced Customer Support Representative - Remote Opportunity at blithequark**

Remote, USA Full-time

Experienced Remote Data Entry Specialist – Full-Time Opportunity for Accurate and Detail-Oriented Individuals at arenaflex

Remote, USA Full-time

[Remote] VP Oncology Clinical Solutions

Remote, USA Full-time

**Experienced Full Stack Software Engineer – Web & Cloud Application Development with Disney Encounters at blithequark**

Remote, USA Full-time

**Experienced Data Entry Customer Care Specialist – Remote Opportunity at arenaflex**

Remote, USA Full-time
Back to Home