[Remote] Applied AI Inference Engineer
Note: The job is a remote job and is open to candidates in USA. Baseten powers mission-critical inference for leading AI companies and is seeking an Applied AI Inference Engineer. In this role, you will partner with customers to architect, build, and deploy high-scale production AI applications, driving impact throughout the customer journey from initial exploration to production deployment. Responsibilities • Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects • Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion • Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers • Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs • Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity • Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates Skills • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field • 1+ years of professional work experience in a fast-paced, high-growth environment • Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python • Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment • Strong communication skills, particularly on complex technical topics • Experience in building or optimizing AI/ML projects is highly valued Benefits • Competitive compensation, including meaningful equity. • 100% coverage of medical, dental, and vision insurance for employee and dependents • Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) • Paid parental leave • Company-facilitated 401(k) • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Company Overview • Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is Company H1B Sponsorship • Baseten has a track record of offering H1B sponsorships, with 6 in 2025, 8 in 2024, 1 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job