Remote Site Reliability Engineer

Remote, USA Full-time
We are seeking a Site Reliability Engineer (SRE) to support the reliability, observability, and performance of our backend data platform. This platform ingests high-volume data from hotel systems, ticketing, and MagicBand readers, flowing through custom pipelines into our data warehouse. The ideal candidate will have a strong background in Python, cloud-native technologies, and observability tools, with a focus on ensuring data integrity and system reliability across multiple touchpoints. Responsibilities: Monitor and maintain a custom data pipeline from ingestion to delivery, ensuring data integrity and performance. Instrument and observe systems using cloud serverless technologies, including: - AWS Lambda - Amazon S3 - Amazon Kinesis - Snowflake - Docker containers on ECS Migrate observability workflows from AWS CloudWatch to Datadog, centralizing metrics, dashboards, and alerts. Build and tune Datadog dashboards and alerts to support SLAs and system health. Graph and analyze metrics to ensure pipeline reliability and performance. Investigate and resolve issues in the pipeline, ensuring expected behavior across all stages. Work within the Python codebase (~2040% of time) to: - Create coherent tickets for issues - Fix bugs and improve instrumentation Perform click-ops tasks (~6080% of time) in Datadog, including: - Dashboard creation and maintenance - Access request handling - Alert tuning and incident response We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal. com. To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: Experience with Snowflake and data warehouse integrations. Strong proficiency in Python, especially in backend and infrastructure contexts. Experience with AWS services (Lambda, S3, Kinesis, CloudWatch). Familiarity with Datadog for monitoring, alerting, and dashboarding. Understanding of data pipelines, data integrity, and observability best practices. Experience with Docker and ECS in production environments. Familiarity with infrastructure as code (e. g., Terraform, CloudFormation). Exposure to SLAs, incident response, and data reliability engineering. Apply tot his job
Apply Now

Similar Jobs

Senior Site Reliability Engineer | G Federal Reserve Bank of Chicago | Remote (United States)

Remote, USA Full-time

Site Reliability Engineer, Eng Support - USDS

Remote, USA Full-time

Site Reliability Engineer—Data Platform

Remote, USA Full-time

Lead Site Reliability Engineer, Observability - Remote

Remote, USA Full-time

[Remote] Software Engineer - Customer Experience Engineering

Remote, USA Full-time

Snowflake Data Engineer – Remote

Remote, USA Full-time

[Hiring] EY Parthenon Strategy Senior / Manager – Smart Cities @EY

Remote, USA Full-time

Hiring Now: Senior Snowflake Database Engineer - Delta Dental of

Remote, USA Full-time

Social Media Evaluator – Remote Online Work

Remote, USA Full-time

Part-Time Virtual Assistant (Social Media & Content Management)

Remote, USA Full-time

Experienced Remote Software Engineer for Ads Data Clean Rooms – Data Privacy, Analytics, and Advertising Technology Expert

Remote, USA Full-time

Experienced Data Entry Specialist for Amazon - Flexible Day & Night Shifts with Competitive Hourly Rates

Remote, USA Full-time

Experienced Part-Time Remote Data Entry Clerk – Entry-Level Opportunity with Comprehensive Paid Training at blithequark

Remote, USA Full-time

Software Engineer Intern, Front-End, PMS

Remote, USA Full-time

Appointment Setter

Remote, USA Full-time

Data Analyst - Labeling - Fully Remote

Remote, USA Full-time

**Experienced Remote Customer Support Associate – Deliver Exceptional Customer Experiences with arenaflex**

Remote, USA Full-time

**Experienced Customer Service Representative – Fredericksburg, VA Office**

Remote, USA Full-time

Math Instructor / Tutor - El Cajon, CA - Join Our Team and Empower Students to Master Mathematics

Remote, USA Full-time

Lead Training Specialist, IT - Remote

Remote, USA Full-time
Back to Home