Data Pipeline & AI Infrastructure Developer

Remote, USA Full-time
We're looking for an experienced machine learning and data engineer to build the systems that power our embodied AI research and production. In this role, you'll own the build-out of critical components of our data pipelines and compute infrastructure, ensuring our research team has reliable, high-performance platforms to train and deploy advanced robotics models. Data Pipelines You'll build and maintain large-scale data ingestion systems that capture multimodal robotics data (video, point clouds, proprioception, and action trajectories), handling the end-to-end flow from ingestion through transformation, quality assurance, and delivery to training systems. You'll ensure data reliability, versioning, and reproducibility across terabytes of embodied data while building observability and dataset management tooling. Your work directly determines the quality and scale of data our AI systems learn from. AI Cluster Infrastructure You'll architect and operate our training infrastructure—Kubernetes-based HPC clusters, GPU orchestration, distributed training, and model deployment—optimizing resource allocation, monitoring cluster health, and ensuring high availability. You'll build automation and tooling that makes research code production-ready, enables efficient multi-tenant experiments, and lets the team move fast. Your infrastructure enables breakthroughs in robotic intelligence. What you bring You're fluent in Python and comfortable with systems languages (C, C++, Rust, or Go). You have deep experience building data pipelines or infrastructure at scale. You know Kubernetes, distributed systems, and HPC environments well. You've worked with large-scale data storage, workflow orchestration, and compute resource management. You understand Linux systems, networking, and real-time constraints. You bridge the gap between research and production. You debug across layers and value reliability, observability, and clean abstractions. You're excited to work in a fast-moving environment where your infrastructure directly enables cutting-edge AI research and real-world robotic deployments. Apply tot his job
Apply Now

Similar Jobs

Software Engineer, Data Platform-Slack (Senior SWE/Staff SWE)

Remote, USA Full-time

Data Platform Support Engineer

Remote, USA Full-time

Analytics Platform Engineer Associate

Remote, USA Full-time

Senior Software Engineer (Data Platform)

Remote, USA Full-time

Senior/Staff Software Engineer, Data

Remote, USA Full-time

Data Platform Engineer

Remote, USA Full-time

Senior Privacy Analyst, FedRAMP

Remote, USA Full-time

Data Loss Prevention (DLP) Analyst

Remote, USA Full-time

Cyber Security Analyst @ Texas Remote in USA

Remote, USA Full-time

IT Security Analyst 3 - IS - Data Security - FT - Day - Remote SoCal

Remote, USA Full-time

Clinical Operations and Care Coordination Associate

Remote, USA Full-time

Junior Prompt Engineer; Remote

Remote, USA Full-time

**Experienced Entry-level Virtual Data Entry Clerk – Remote Opportunity for Career Growth and Development at blithequark**

Remote, USA Full-time

Specialist Direct Medical Programs College Counselor – Remote, Part-Time

Remote, USA Full-time

Technical Writer- Project Hire/Temporary Assignment

Remote, USA Full-time

**Experienced Customer Care Quality Manager B2C - Join the Energy Revolution at blithequark**

Remote, USA Full-time

Medical Scribe, Wage, Student Health and Wellness (Staff Wage)

Remote, USA Full-time

Delta Airlines Support Job On Phone, Email, Social Networking Sites Remote US

Remote, USA Full-time

Real Estate Agent — Grow Faster With SPACE’s Coaching-First Model

Remote, USA Full-time

Lead UX Writer

Remote, USA Full-time
Back to Home