[Remote] Site Reliability Developer 2
Note: The job is a remote job and is open to candidates in USA. Oracle is a world leader in cloud solutions, seeking a Site Reliability Engineer to support daily operations for a secure, large-scale OCI-based cloud environment. The role focuses on maintaining infrastructure, implementing improvements, and ensuring operational health under the guidance of senior engineers. Responsibilities Perform routine operational tasks such as deployments, patching, fleet maintenance, and basic troubleshooting for cloud-based systems Tune team-specific alarms and thresholds, escalate incidents appropriately, and support the management of metrics, KPIs, and system health dashboards Participate in incident response by quickly triaging and escalating incidents, executing operational playbooks, and documenting issues for senior review. You will follow established procedures under supervision and contribute to root-cause analysis by gathering data and providing initial troubleshooting support Serve as a technical support point of contact, troubleshooting and resolving technical issues, assisting customers with environment setup and debugging, and providing timely communication and status updates to customers and internal teams Own, maintain, and improve runbooks to ensure consistency and clarity for operational processes Implement defined enhancements to existing tools, documentation, and monitoring solutions Collaborate closely with other team members and escalate complex issues for further investigation and resolution Participate in on-call rotations with support from senior engineers, ensuring continuity of coverage and timely response Ensure compliance with all security, operational, and documentation standards Skills U.S. Citizenship and possess and maintains TS/SCI w/Poly security clearance Hands-on experience with Linux systems administration Scripting ability with Python or Bash Understanding of basic cloud concepts (networking, compute, identity, observability) Strong problem-solving skills and willingness to learn complex systems Ability to work collaboratively with technical teams and communicate effectively Exposure to Oracle Cloud Infrastructure (OCI) or other major cloud platforms Familiarity with Infrastructure-as-Code tools such as Terraform or Ansible Experience supporting production systems or participating in on-call rotations Understanding of security best practices within classified environments Benefits Medical, dental, and vision insurance, including expert medical opinion Short term disability and long term disability Life insurance and AD&D Supplemental life insurance (Employee/Spouse/Child) Health care and dependent care Flexible Spending Accounts Pre-tax commuter and parking benefits 401(k) Savings and Investment Plan with company match Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation. 11 paid holidays Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours. Paid parental leave Adoption assistance Employee Stock Purchase Plan Financial planning and group legal Voluntary benefits including auto, homeowner and pet insurance Company Overview Oracle is an integrated cloud application and platform services that sells a range of enterprise information technology solutions. It was founded in 1977, and is headquartered in Austin, Texas, USA, with a workforce of 10001+ employees. Its website is