Senior Site Reliability Engineer (SRE)

Remote, USA Full-time
We’re a globally distributed team building a secure, standards-based OAuth/OIDC engine used by global businesses, digital banks, and regulated industries. Our API-first approach enables organizations to implement OAuth 2.0 and OpenID Connect with ease. We’re hiring a Senior Site Reliability Engineer (SRE) to improve the reliability, scalability, and performance of our platform and cloud offerings. This role is hands-on and focuses primarily on software engineering to solve reliability challenges across our stack. You will work closely with engineering teams to build tools, fix issues in production code, and keep our services running smoothly. What You’ll Do This is a hands-on engineering role. You’ll... • Write and debug production code in Java, Go or Typescript, including fixes during incidents. • Investigate application issues across test, staging, and production environments. • Design, maintain, and optimize Kubernetes-based deployments across Shared Cloud, Dedicated Cloud, and Self-Managed deployment models. • Develop and improve Helm charts as the standard deployment method across all supported environments. • Manage and automate GitLab CI/CD pipelines, including container image packaging, and release processes. • Enhance monitoring, alerting, and observability using Google Cloud Monitoring, Prometheus, and Grafana. • Review and improve cloud functions and internal tooling written in Go, Ruby, and Bash. • Participate in on-call rotations to maintain uptime and rapid incident response. • Lead post-incident reviews and drive long-term reliability improvements. • Collaborate with Engineering and Support teams to diagnose customer issues and optimize service quality. What We’re Looking For • Strong hands-on software engineering background and ability to write high-quality code. • Experience debugging distributed systems and operating Kubernetes in production (preferably on GKE). • Deep understanding of Kubernetes networking, security, Helm charts, and storage management. • Proficiency in one or more programming languages such as Java, Go Typescript or Bash. • Experience managing GitLab CI/CD pipelines and container image workflows. • Ability to write PromQL alerting rules and interpret key reliability metrics. • Familiarity with Redis, Liquibase, and TLS/mTLS certificate management. • Experience with observability, incident management, and performance testing. • Clear communication skills in English; Japanese language proficiency is a plus. • Comfortable working independently in a distributed team across time zones. Why Join Us • Work closely with experienced engineers building a high-security, standards-compliant OAuth/OIDC engine. • Solve complex reliability challenges across multi-cloud and self-managed environments. • Be part of a lean global team where your contributions have direct product impact. • Enjoy flexibility, autonomy, and the opportunity to shape infrastructure best practices. • Competitive compensation, global collaboration, and meaningful technical challenges. Apply tot his job
Apply Now

Similar Jobs

Senior Site Reliability Engineer / Remote / AWS

Remote, USA Full-time

Solution Engineer - Data Engineering Specialist

Remote, USA Full-time

Weekend Site Reliability Engineer | Sporty Group | Remote (Anywhere)

Remote, USA Full-time

Associate Solutions Engineer

Remote, USA Full-time

Shopify Developer + Designer Needed to Build Candle Brand Store (Full Setup)

Remote, USA Full-time

Shopify Developer

Remote, USA Full-time

Shopify Developer Needed - Code Tweaks + Design Refinement (Homepage, Product Pages, Checkout)

Remote, USA Full-time

Sr. Site Reliability Engineer- Remote

Remote, USA Full-time

Shopify Product Researcher, Developer, and Lister Needed

Remote, USA Full-time

Sr. Site Reliability Engineer (SRE)

Remote, USA Full-time

Flexible

Remote, USA Full-time

[Remote] Clinical Operations Analyst

Remote, USA Full-time

**Experienced Part-Time Remote Online Chat Support Specialist – Customer Service Representative for blithequark**

Remote, USA Full-time

PharmD on Demand is hiring: Remote Hospital Pharmacist – FT – Night Shift in Atlanta

Remote, USA Full-time

Virtual Yelp Spam Comment Remover ? Work From Home ? DPSM At

Remote, USA Full-time

**Experienced Customer Service Representative – Remote Call Center Opportunity with blithequark**

Remote, USA Full-time

Experienced Remote Data Entry and Market Research Specialist – Work from Home Opportunity with arenaflex

Remote, USA Full-time

Experienced Customer Support Representative – Freshers and Recent Graduates Welcome to Join arenaflex's Dynamic Team in Delivering Exceptional Customer Experiences

Remote, USA Full-time

**Experienced Customer Data Analyst – Professional Services Team at blithequark**

Remote, USA Full-time

**Experienced Part-Time Ramp Agent – Airport Operations and Customer Service**

Remote, USA Full-time
Back to Home