100% Remote Golang Developer with Devops/LLM exp. || W2 Consultant

Remote, USA Full-time
Hi, Hope you are doing well, Please find the job description given below and let me know your interest. Position: 100% Remote Golang Developer with Devops/LLM exp. || W2 Consultant Location: Remote Duration: 6-12 months Visa: Only USC, GC Job Description We are looking for devs with general cloud services / distributed services experience, with LLM experience as a secondary skill. GPU experience is now low on the list of preferred skills: Dedicated Inference Service Required Skills- • Proficiency in Golang for building scalable and performant backend services. • Deep experience building services in modern cloud environments on distributed systems (i.e., containerization (Kubernetes, Docker), infrastructure as code, bolthires/CD pipelines, APIs, authentication and authorization, data storage, deployment, logging, monitoring, alerting, etc.) • Experience working with Large Language Models (LLMs), particularly hosting them to run inference • Strong verbal and written communication skills. Your job will involve communicating with local and remote colleagues about technical subjects and writing detailed documentation. • Experience with building or using benchmarking tools for evaluating LLM inference for various models, engine, and GPU combinations. • Familiarity with various LLM performance metrics such as prefill throughput, decode throughput, TPOT, and TTFT • Experience with one or more inference engines: e.g., vLLM, SGLang, and Modular Max • Familiarity with one or more distributed inference serving frameworks: e.g., llm-d, NVIDIA Dynamo, and Ray Serve etc. • Experience with AMD and NVIDIA GPUs, using software like CUDA, ROCm, AITER, NCCL, RCCL, etc. • Knowledge of distributed inference optimization techniques - tensor/data parallelism, KV cache optimizations, smart routing etc. What You'll Be Working On- • Develop and maintain an inference platform for serving large language models optimized for the various GPU platforms they will be run on. • Work on complex AI and cloud engineering projects through the entire product development lifecycle (PDLC) - ideation, product definition, experimentation, prototyping, development, testing, release, and operations. • Build tooling and observability to monitor system health, and build auto tuning capabilities. • Build benchmarking frameworks to test model serving performance to guide system and infrastructure tuning efforts. • Build native cross platform inference support across NVIDIA and AMD GPUs for a variety of model architectures. • Contribute to open source inference engines to make them perform better on DigitalOcean cloud. Thanks & Regards, Gaurav Gaur Email: [email protected] | Phone : 972-645-9280 LinkedIn: DMS Vision ,INC 4645 Avon Lane, Suite 210 Frisco, TX 75033 Apply tot his job Apply tot his job
Apply Now

Similar Jobs

Sr. Golang Developer | Remote | W2 Only |

Remote, USA Full-time

Sr. Golang Developer | REMOTE | W2 ; No OPT's

Remote, USA Full-time

B-CPT-10226 SEO & Google Ads Specialist (Construction Industry Focus) at 20four7VA

Remote, USA Full-time

Paid Search Specialist (Google Ads) + $300 Sign-On Bonus!

Remote, USA Full-time

Paid Ad Specialist (Freelance)

Remote, USA Full-time

Technical Aide - Maplewood, MN

Remote, USA Full-time

[Remote] Entry Level Sales Associate | Drive Revenue from anywhere with Full Flexibility

Remote, USA Full-time

Client Solutions Rep - Entry Level OK

Remote, USA Full-time

Entry-Level Customer Sales Representative; Remote

Remote, USA Full-time

Social Media Jobs – Work from Home | No Degree or Experience Needed

Remote, USA Full-time

Casualty Claims Examiner ($2,500 Sign-On Bonus)

Remote, USA Full-time

Medicare Cost Report Auditor I (Birmingham)

Remote, USA Full-time

[Remote] Global Immigration Manager

Remote, USA Full-time

Education Support Specialist

Remote, USA Full-time

**Experienced Pathology Support Specialist – Remote Administrative Assistant for Medical Transcription and Office Operations**

Remote, USA Full-time

[Entry Level/Remote] bolthires At Home Advisor Part-Time Jobs $33/H (Hiring Now)

Remote, USA Full-time

Senior Manager, Data Science - Leading AI/ML Innovation in Retail & E-commerce at Walmart Global Tech

Remote, USA Full-time

Quality Assurance (QA) Pharmacist

Remote, USA Full-time

Experienced Customer Service Representative for Remote Work Opportunity at blithequark – Delivering Exceptional Client Experiences through Empathy, Knowledge, and Efficiency

Remote, USA Full-time

Director Network Operations and Engineering

Remote, USA Full-time
Back to Home