Senior Cloud Infrastructure Engineer

Remote, USA Full-time
About LanceDB LanceDB is a developer-friendly, open-source data lake for multimodal AI. From hyper-scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large-scale AI datasets, LanceDB is the best foundation for your AI application, and powers some of the most groundbreaking applications and challenging requirements today. About the role We’re seeking a seasoned Cloud Infrastructure Engineer with deep expertise in automation, infrastructure-as-code (IaC), and cloud platform management. You’ll design, deploy, and maintain robust cloud environments while collaborating with cross-functional teams to streamline CI/CD pipelines, enhance system reliability, and drive operational excellence. As a Cloud Infrastructure Engineer at LanceDB, your responsibilities will include: • Design & Build Cloud Infrastructure: Architect and manage secure, scalable cloud environments (AWS, Azure, GCP) using IaC tools like Terraform and CloudFormation. • Automate Everything: Develop and maintain automation scripts to streamline deployments, monitoring, and system operations. • Systems Reliability: Implement monitoring/alerting solutions (Prometheus, Grafana, Datadog) to proactively address performance bottlenecks and ensure 99.9% uptime. • Security & Compliance: Enforce security policies, manage secrets (Vault, AWS KMS), and ensure compliance with industry standards (GDPR, SOC2). • Troubleshoot & Optimize: Resolve complex infrastructure issues and lead cost-optimization initiatives for cloud resources. • Collaborate & Mentor: Partner with software engineering teams to integrate DevOps practices into SDLC and mentor junior engineers on IaC and cloud best practices. Requirements • 10+ years in DevOps, Cloud Infrastructure, or SRE roles, with hands-on experience in public cloud platforms (AWS, Azure, GCP, Heroku). • Strong experience operating and supporting production distributed systems and/or databases-as-a-service in a public cloud service provider, where it was the primary product for the company. This excludes being a user of an cloud service provider's database such as RDS or BigQuery. Bonus points for experience crafting multitenant solutions. This is a hard requirement; applicants without this experience will not qualify for this role. • Experience designing and managing complex production environments using Kubernetes and Helm. This is a hard requirement; applicants without this experience will not qualify for this role. • Expertise in IaC tools (Puppet, Terraform, Ansible, CloudFormation) and configuration management. • Deep understanding of networking, security, and cloud architecture best practices. • Experience with monitoring tools (Prometheus, Grafana) and logging systems (ELK, Splunk). • Strong knowledge of CI/CD tools (GitHub Actions) and containerization (Docker, Kubernetes). • You like working with a small, high-caliber team with a lot of autonomy and drive, and you can iterate fast Nice to have • You’ve made substantial contributions to open-source projects (e.g., Puppet modules, Terraform providers). • You design and automate single-command deployments for complex, globally distributed systems to ensure consistency, reliability, and scalability across multi-cloud or hybrid environments. • You fearlessly challenge the status quo and dismiss mediocre engineering as unacceptable. • You have worked on distributed large-scale systems, with a good understanding of how to using tracing tools to identify bottlenecks. • Experience building large-scale semantic search and/or caching systems is especially relevant. Why Join Us You’ll join a world-class team of open-source builders (co-authors of pandas, and contributors to HDFS, Arrow, Iceberg, and HBase) working on cutting-edge AI infrastructure. You’ll collaborate on systems that power next-generation AI workloads while shaping how LanceDB operates and scales production environments. Apply tot his job
Apply Now

Similar Jobs

Senior Cloud Operations Engineer I

Remote, USA Full-time

[Remote] Principal Cloud Infrastructure Engineer, Engineering Excellence

Remote, USA Full-time

Urgent!! Azure Cloud Operations Engineer - TX/NC/GA/NY/MA/OH (Just 5 days Onsite in a Month)

Remote, USA Full-time

[Remote] AWS Cloud/Infrastructure Engineer/Architect

Remote, USA Full-time

Senior Cloud Infrastructure Engineer (OpenStack / Virtualization)

Remote, USA Full-time

Junior Cloud Infrastructure Engineer

Remote, USA Full-time

CloudOps Engineer (Pacific Time)

Remote, USA Full-time

[Remote] Senior Cloud Operations Engineer - Remote / US Citizens Only

Remote, USA Full-time

L1 Cloud Operations Engineer (remote South-East Europe)

Remote, USA Full-time

Staff Product Manager - Cloud Infrastructure

Remote, USA Full-time

Mediaberater / Vertriebsmitarbeiter Außendienst (m/w/d)

Remote, USA Full-time

**Experienced Live Chat Assistant - Call Center (Work from Home) at blithequark**

Remote, USA Full-time

Associate Operations Director - Remote - Nationwide

Remote, USA Full-time

Customer/l Billing Tester

Remote, USA Full-time

Jetblue careers

Remote, USA Full-time

[Remote] Customer Success Representative

Remote, USA Full-time

CULINARY CONSULTANT - REMOTE - WESTERN AND CENTRAL US

Remote, USA Full-time

Senior Technical Curriculum Developer - Workflows

Remote, USA Full-time

[Remote] Clinical Pharmacist (MN License Required)

Remote, USA Full-time

**Virtual Receptionist – Remote Customer Service and Administrative Support Specialist**

Remote, USA Full-time
Back to Home