Machine Learning Engineering Manager – LLM Serving, Infrastructure

Remote, USA Full-time

• Lead a high-performing engineering team to develop, build, and deploy a high-scale, low-latency LLM Serving Infrastructure. • Drive the implementation of a unified serving layer to support multiple LLM models and inference types (batch, offline eval flows and real-time/streaming). • Lead all aspects of the development of the Model Registry for deploying, versioning, and running LLMs across production environments. • Ensure successful integration with the core Personalization and Recommendation systems to deliver LLM-powered features. • Define and champion standardized technical interfaces and protocols for efficient model deployment and scaling. • Establish and monitor the serving infrastructure's performance, cost, and reliability, including load balancing, autoscaling, and failure recovery. • Collaborate closely with data science, machine learning research, and feature teams (Autoplay, Home, Search, etc.) to drive the active adoption of the serving infrastructure. • Scale up the serving architecture to handle hundreds of millions of users and high-volume inference requests for internal domain-specific LLMs. • Drive Latency and Cost Optimization: partner with SRE and ML teams to implement techniques like quantization, pruning, and efficient batching to minimize serving latency and cloud compute costs. • Develop Observability and Monitoring: build dashboards and alerting for service health, tracing, A/B test traffic, and latency trends to ensure consistency to defined SLAs. • Contribute to Core LPM Serving: focus on the technical strategy for deploying and maintaining the core Large Personalization Model (LPM). Apply tot his job Apply tot his job

Apply Now

Experienced Customer Service Representative – Remote | WFH Opportunity for Delivering Exceptional Support in a Dynamic E-commerce Environment

Remote, USA Full-time

Machine Learning Engineering Manager – LLM Serving, Infrastructure

Similar Jobs

Center of Excellence Coordinator

RN - Healthcare Sales

Patient Advocate (Part Time, Remote 1099)

Assoc. Medical Director - Remote (Dallas, TX, US)

Health Promotion Specialist

Hormone Health Coach & Wellness Entrepreneur (Fully Remote – U.S., Canada & Global)

[Hiring] Professional Health Coach- Digital Medicine @Ochsner Health

Health Economist Statistician

Data Analyst, Epic EHR > > Location: - REMOTE

Senior Healthcare Compliance Officer, Revenue Cycle Management

Radiology Scheduler - Remote, Spanish Bilingual Required

[Remote] Cybersecurity Analyst with Salesforce

Data Engineer (Redshift)

Experienced Part-Time Remote Customer Service Representative – Healthcare Transportation Services

Experienced Customer Service Representative – Remote | WFH Opportunity for Delivering Exceptional Support in a Dynamic E-commerce Environment

Senior Solutions Consultant

Experienced Full Stack Database Analyst – Data Insights and Analytics for blithequark

Experienced Customer Support Associate – Specialty Mail Order Pharmacy at blithequark

Customer Success: AI Strategist

[Remote] GCP Application Architect

Machine Learning Engineering Manager – LLM Serving, Infrastructure

Similar Jobs

Center of Excellence Coordinator

RN - Healthcare Sales

Patient Advocate (Part Time, Remote 1099)

Assoc. Medical Director - Remote (Dallas, TX, US)

Health Promotion Specialist

Hormone Health Coach & Wellness Entrepreneur (Fully Remote – U.S., Canada & Global)

[Hiring] Professional Health Coach- Digital Medicine @Ochsner Health

Health Economist Statistician

Data Analyst, Epic EHR > > Location: - REMOTE

Senior Healthcare Compliance Officer, Revenue Cycle Management

Radiology Scheduler - Remote, Spanish Bilingual Required

[Remote] Cybersecurity Analyst with Salesforce

Data Engineer (Redshift)

**Experienced Part-Time Remote Customer Service Representative – Healthcare Transportation Services**

Experienced Customer Service Representative – Remote | WFH Opportunity for Delivering Exceptional Support in a Dynamic E-commerce Environment

Senior Solutions Consultant

**Experienced Full Stack Database Analyst – Data Insights and Analytics for blithequark**

**Experienced Customer Support Associate – Specialty Mail Order Pharmacy at blithequark**

Customer Success: AI Strategist

[Remote] GCP Application Architect

Experienced Part-Time Remote Customer Service Representative – Healthcare Transportation Services

Experienced Full Stack Database Analyst – Data Insights and Analytics for blithequark

Experienced Customer Support Associate – Specialty Mail Order Pharmacy at blithequark