AI Software Engineer (Platform Software)

Remote, USA Full-time
About the Job • FuriosaAI is looking for passionate AI Software Engineers to join our Platform Team. You will participate in the research and development of models optimized for our NPU accelerator. • Our team builds the production-grade, streamlined AI software that makes up our SDK. This includes the runtime, LLM serving framework, and PyTorch models/extensions. • Your work on these critical parts of the SDK will directly enable AI developers to efficiently deploy optimized AI models on FuriosaAI NPUs. Responsibilities • Develop and optimize DNN model implementations in PyTorch for FuriosaAI's Tensor Contraction Processor (TCP) architecture • Analyze the features, implementations, CUDA and Triton kernels of existing AI model inference frameworks such as vLLM, TensorRT-LLM, and DeepSpeed-MII • Research and implement generative AI models, parallelism strategies, and inference techniques to improve performance and efficiency • Collaborate closely with the compiler team to optimize and enable models. Minimum Qualifications • BS degree in Computer Science, Engineering, or a related field, or equivalent industry experience • Proficiency in Python programming skill • Experience in developing AI models in DNN frameworks (e.g., PyTorch) • Solid understanding of machine learning, deep learning, natural language processing (NLP), and/or generative AI models • Strong communication skills with the ability to collaborate effectively across cross-functional teams Preferred Qualifications • Hands-on experience with PyTorch 2.0 technologies (e.g., TorchDynamo) or DNN compiler technologies, such as Triton and MLIR • Proficiency in C++/CUDA or Rust programming skills • Hands-on experience deploying and optimizing large-scale ML models in production • Hands-on experience in model training and fine-turning of pre-trained models • Experience in LLM inference frameworks: vLLM, TensorRT-LLM, and DeepSpeed-MII • Strong background in model quantizations and model evaluations • Strong background in machine learning, generative AI, and model evaluation techniques • Proven track record of contributing to open-source projects Contact • [email protected] Apply tot his job
Apply Now

Similar Jobs

AI Software Engineer (Typescript & LLM)

Remote, USA Full-time

AI Solutions Architect (Junior–Mid Level, 2–6 Years Experience)

Remote, USA Full-time

[Remote] Solution Architect, Data Solutions

Remote, USA Full-time

Senior/Principal Solutions Architect, Open Source AI (North America - Remote)

Remote, USA Full-time

AI/ML Solution Architect - Remote / Telecommute

Remote, USA Full-time

[Remote] Solutions Architect (contact center + AI) [80592]

Remote, USA Full-time

Architect I – Applied AI Engineering

Remote, USA Full-time

Senior AI Infrastructure Engineer, AI Tooling - DGX Cloud

Remote, USA Full-time

Senior Systems Engineer, AI Applications Tooling

Remote, USA Full-time

AI / Software Engineer

Remote, USA Full-time

Loan Consultant

Remote, USA Full-time

**Experienced Chat Support Agent (Remote) - Entry Level, No Degree Required - $15-$18 per Hour**

Remote, USA Full-time

Broker Relations Principal - Cigna Healthcare - Remote

Remote, USA Full-time

Experienced Remote Live Chat Representative – Delivering Exceptional Customer Service and Driving Business Growth through Effective Communication and Technical Skills at blithequark

Remote, USA Full-time

[Remote] Revenue Contractor

Remote, USA Full-time

Work From Home Bookkeeper - Virtual Accounting Jobs (Xero, QuickBooks)

Remote, USA Full-time

Customer Value Director - 10475

Remote, USA Full-time

Experienced Full-Time Customer Care Representative - Remote Work from Home Opportunity with blithequark

Remote, USA Full-time

Field Sales Engineer - PTD

Remote, USA Full-time

Remote Product Manager- $120-$150k + $20k Equity (Med Device)

Remote, USA Full-time
Back to Home