Software Engineer, Inference AI/ML

Remote, USA Full-time
CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost. Responsibilities Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve) Write tests, code comments, and short design docs; participate in code reviews Add basic metrics and dashboards; assist with alarms and runbooks Follow on-call runbooks and learn incident response in a guided rotation Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance Skills BS/MS in CS, EE, or related field, or equivalent practical experience Foundations in data structures, algorithms, and networked services Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics Exposure to containers and Kubernetes (coursework or projects welcome) Curiosity about GPU inference concepts (micro-batching, KV cache, streaming) Internship or project that deployed a microservice or ML inference demo Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling Benefits Medical, dental, and vision insurance - 100% paid for by CoreWeave Company-paid Life Insurance Voluntary supplemental life insurance Short and long-term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Ability to Participate in Employee Stock Purchase Program (ESPP) Mental Wellness Benefits through Spring Health Family-Forming support provided by Carrot Paid Parental Leave Flexible, full-service childcare support with Kinside 401(k) with a generous employer match Flexible PTO Catered lunch each day in our office and data center locations A casual work environment A work culture focused on innovative disruption Company Overview CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is
Apply Now

Similar Jobs

Accountant l

Remote, USA Full-time

Associate Product Manager

Remote, USA Full-time

[Remote] Laravel Full Stack Developer

Remote, USA Full-time

OPS Clinician SBS

Remote, USA Full-time

Account Manager / Outside Sales Representative - Virginia Beach, VA area

Remote, USA Full-time

Project Assistant

Remote, USA Full-time

Phoenix, AZ Account Executive - Bilingual Spanish

Remote, USA Full-time

Associate Equipment Specialist - Solar (Traveler) | Mortenson

Remote, USA Full-time

Project Coordinator

Remote, USA Full-time

Social Video Editor

Remote, USA Full-time

Experienced Remote Benefits Customer Service Representative – Delivering Compassionate Support and Exceptional Service to Clients at blithequark

Remote, USA Full-time

Experienced Data Scientist – Remote Real-World Evidence Analysis and Clinical Trials Support at arenaflex

Remote, USA Full-time

[Remote] Fully Remote Senior Tax Accountant 120k+ 1200 Billable hours.

Remote, USA Full-time

Sr Director, Product Operations and DPC - Old Navy

Remote, USA Full-time

Walmart Jobs (Remote) $80/H - Apply Now

Remote, USA Full-time

**Experienced Entry-Level Data Entry Specialist - Apple Products Remote Job Opportunity with Competitive Salary and WFH Benefits**

Remote, USA Full-time

Entry level / Data Entry Assistant (Remote)

Remote, USA Full-time

Experienced Web Chat Customer Service Agent – Delivering Exceptional Support in a Remote Setting with arenaflex

Remote, USA Full-time

Senior Cybersecurity Analyst 100 New Millennium Way, Bldg 2, Durham NC

Remote, USA Full-time

Hybrid MDS Coordinator - Long Term Care Facility

Remote, USA Full-time
Back to Home