Site Reliability Engineer (GKE + GCP)

Remote, USA Full-time
Reponsibilities: • Work on a team of extremely talented platform engineers to help maintain and scale the current and future state services platform. • Help architect and develop the future state compute platform by leveraging industry best practices as well as embracing new technologies to support the future growth as a business • Help influence the product roadmaps of GCP (our primary cloud provider) to better suit our future state architecture • Work collaboratively with business and technical stakeholders to develop and architect enhancements to the compute platform capabilities that enable them to develop and iterate applications to power the business • Identify opportunities to introduce automation, improvements to avoid repetitive operational tasks (DRY) • Participate in the on-call rotation to ensure operational excellence and overall platform health Requirements: • 5+ years of experience in platform engineering/SRE roles using an object oriented language (Python, Golang, etc) • Bachelor’s degree in Computer Science, Computer Engineering or equivalent combination of education and experience • Extensive experience working with Kubernetes in a public cloud (GKE, EKS, AKS, etc) • Experience working with Istio/Service Mesh • Experience working with IaC (Terraform, Pulumi, etc) • Experience working within a Public Cloud environment (GCP, AWS, Azure, etc) • Experience working with CI/CD tools such as Argo, Buildkite, TravisCI, Jenkins, Spinnaker, etc • Experience working with platform observability tools (Prometheus, Thanos, Grafana, Fluentbit, Cloud Monitoring, Google Cloud Logging, Datadog, Pagerduty, Cloudwatch, Kibana, Elastic Search, Splunk, VictorOps, etc) • Experience with Networking • Experience and desire to work in an agile environment • Analytical mindset and passion for solving business problems with technology Nice To Haves: • Experience working with Dev Testing tools and patterns such as Garden, Flagger, Canary Deployments, Blue/Green Testing, A/B Testing • Experience setting up and working with Kubernetes Admission Control (Kyverno, OPA, etc) • Experience working with workload scaling (HPA, VPA, Capacity Planning/Reservations, etc) Apply tot his job
Apply Now

Similar Jobs

[Remote] Intermediate Site Reliability Engineer, Environment Automation

Remote, USA Full-time

Site Reliability Engineer, Sr. Consultant level

Remote, USA Full-time

Senior Site Reliability Engineer

Remote, USA Full-time

[Remote] Senior Data Engineer - Snowflake (Remote)

Remote, USA Full-time

Snowflake Data Engineer

Remote, USA Full-time

Manager, Solutions Engineering

Remote, USA Full-time

Snowflake Data Engineer

Remote, USA Full-time

DTICI Snowflake Data engineer T8 @ Daimler Truck

Remote, USA Full-time

[Remote] Lead Cloud Data Engineer (Azure + Snowflake)

Remote, USA Full-time

Snowflake Data Engineer - Hybrid in Seattle

Remote, USA Full-time

Experienced Student Recruitment Agent for Online Mandarin School (Remote) in Los Angeles, CA - Career Growth Opportunity in Education Industry

Remote, USA Full-time

Accounts Payable Specialist – Sonsray Inc. – Torrance, CA

Remote, USA Full-time

Fractional COO & Coach

Remote, USA Full-time

**Experienced Remote Data Entry Specialist – Flexible Work Arrangement for arenaflex**

Remote, USA Full-time

Service Desk Representative – Call Center Based

Remote, USA Full-time

Experienced Remote Customer Support Specialist – Delivering Exceptional Service for blithequark Products from the Comfort of Your Home

Remote, USA Full-time

BI Analyst / BI Developer / BI Reporting Analyst

Remote, USA Full-time

Freestyle Ski and Snowboarding Coach – Amazon Store

Remote, USA Full-time

**Experienced Full Stack Customer Support Specialist – Delivering Exceptional Experiences in a Fully Remote Environment**

Remote, USA Full-time

Client Services & Customer Support - Inwood, NY

Remote, USA Full-time
Back to Home