100% Remote Golang Developer with Devops/LLM exp. W2 Consultant

Remote, USA Full-time
Hi, Hope you are doing well, Please find the job description given below and let me know your interest. Position: 100% Remote Golang Developer with Devops/LLM exp. || W2 Consultant Location: Remote Duration: 6-12 months Visa: Only USC, GC Job Description We are looking for devs with general cloud services / distributed services experience, with LLM experience as a secondary skill. GPU experience is now low on the list of preferred skills: Dedicated Inference Service Required Skills- Proficiency in Golang for building scalable and performant backend services. Deep experience building services in modern cloud environments on distributed systems (i.e., containerization (Kubernetes, Docker), infrastructure as code, CI/CD pipelines, APIs, authentication and authorization, data storage, deployment, logging, monitoring, alerting, etc.) Experience working with Large Language Models (LLMs), particularly hosting them to run inference Strong verbal and written communication skills. Your job will involve communicating with local and remote colleagues about technical subjects and writing detailed documentation. Experience with building or using benchmarking tools for evaluating LLM inference for various models, engine, and GPU combinations. Familiarity with various LLM performance metrics such as prefill throughput, decode throughput, TPOT, and TTFT Experience with one or more inference engines: e.g., vLLM, SGLang, and Modular Max Familiarity with one or more distributed inference serving frameworks: e.g., llm-d, NVIDIA Dynamo, and Ray Serve etc. Experience with AMD and NVIDIA GPUs, using software like CUDA, ROCm, AITER, NCCL, RCCL, etc. Knowledge of distributed inference optimization techniques - tensor/data parallelism, KV cache optimizations, smart routing etc. What You'll Be Working On- Develop and maintain an inference platform for serving large language models optimized for the various GPU platforms they will be run on. Work on complex AI and cloud engineering projects through the entire product development lifecycle (PDLC) - ideation, product definition, experimentation, prototyping, development, testing, release, and operations. Build tooling and observability to monitor system health, and build auto tuning capabilities. Build benchmarking frameworks to test model serving performance to guide system and infrastructure tuning efforts. Build native cross platform inference support across NVIDIA and AMD GPUs for a variety of model architectures. Contribute to open source inference engines to make them perform better on DigitalOcean cloud. , Gaurav Gaur Email: | Phone LinkedIn: DMS Vision ,INC 4645 Avon Lane, Suite 210 Frisco, TX 75033 Apply tot his job
Apply Now

Similar Jobs

Senior Marketing Strategist

Remote, USA Full-time

Golang Developer /Freelance/ Remote/

Remote, USA Full-time

Paid Search Specialist (Google Ads) + $300 Sign-On Bonus!

Remote, USA Full-time

Google Ads Specialist | Remote | LATAM Only 83131

Remote, USA Full-time

Paid Ads Specialist (Meta + Google) – Strategic, High-Performance, Startup Role

Remote, USA Full-time

PART-TIME EVENT ADMINISTATIVE ASSISTANT - Remote

Remote, USA Full-time

bolthires Part-Time (Customer Service Remote) Jobs – Hiring Now – Indeed Jobs US

Remote, USA Full-time

Part-Time Evening Data Entry Specialist at The Elite Job

Remote, USA Full-time

Part-time Server - Ria

Remote, USA Full-time

[Remote] Senior Claims Representative (Remote)

Remote, USA Full-time

Experienced eDiscovery Technologist and Consultant – Remote Opportunity for a Seasoned Professional in Electronic Discovery, Information Governance, and Data Recovery Services

Remote, USA Full-time

[Remote] Infrastructure Engineer(12+ years need)

Remote, USA Full-time

Experienced Remote Part Time Flexible Customer Care Representative – Delivering Exceptional Service and Driving Sales Growth in a Dynamic E-Learning Environment

Remote, USA Full-time

Accounts Receivable Specialist (REMOTE)

Remote, USA Full-time

**Experienced Part-Time Remote Data Entry Clerk – Unlock a World of Flexibility and Growth Opportunities at blithequark**

Remote, USA Full-time

**Experienced Part-Time Data Entry Specialist – Remote Opportunity at arenaflex**

Remote, USA Full-time

Experienced Remote Customer Service Representative – Part-Time Home-Based Opportunity with arenaflex

Remote, USA Full-time

[Remote] Associate - Revenue & Payment Ops

Remote, USA Full-time

Amazon Delivery Driver – Amazon Store

Remote, USA Full-time

Contractor, Partnerships Manager, Tech Week

Remote, USA Full-time
Back to Home