Senior Python / LLM Engineer Needed – Router MVP & Predictive Model Loader

Remote, USA Full-time
✅ In-Scope Work (Remaining) Milestone 1 Router MVP Implementation Deliverables • Embedding pipeline (query + document embeddings) • Vector storage using FAISS or Chroma • Projection module for query feature extraction • Configurable scoring & model-selection strategies • Router MVP with pluggable LLM backends • Router validation tests (routing correctness & mis-routing analysis) Acceptance Criteria • End-to-end routing demonstrated • Deterministic and explainable routing decisions • Pytest unit tests included • All code committed to repository ⸻ Milestone 2 Predictive Loader & Integration Deliverables • Predictive model loader (LLM-based classifier) • Warm-start caching & preload logic • Cache management strategy • Full integration with Router MVP • FastAPI backend exposing routing endpoints • Structured JSON logging • End-to-end testing + documented stress testing • Final validation & testing report Acceptance Criteria • Predictive loading working correctly • Fully integrated end-to-end system • Stress-testing methodology clearly documented • Final report delivered ⸻ Technical Requirements • Strong Python backend experience • FastAPI • LangChain or equivalent LLM orchestration framework • FAISS or Chroma vector stores • Dockerized services • Structured JSON logging • Pytest for unit & integration testing ⸻ Ideal Candidate • 3+ years of Python backend experience • Proven experience with LLMs, embeddings, and routing systems • Hands-on with vector databases and retrieval pipelines • Comfortable writing clean, testable, production-ready code • Experience with performance testing and system validation • Strong communication and documentation skills ⸻ Deliverables & Collaboration • All work delivered via Git repository • Clean, readable, well-tested code • Clear documentation for setup, testing, and usage • Milestone-based payments ⸻ To Apply Please include: 1. Relevant experience with LLM routing, embeddings, or RAG systems 2. GitHub or code samples (if available) 3. Brief explanation of how you would approach Router MVP + predictive loading Apply tot his job
Apply Now

Similar Jobs

Software Engineer (LLM) (Freelance | Remote | $70 –$110/hr )

Remote, USA Full-time

2026 - Performance Testing Engineering Intern, ATX

Remote, USA Full-time

Premier Loan Officer - FL Remote

Remote, USA Full-time

Loan Processor/File Scrubber (REMOTE)

Remote, USA Full-time

[Remote] Senior Loan Processor- Home Lending

Remote, USA Full-time

Mortgage Processor (Pacific Timezone Only)

Remote, USA Full-time

[Remote] Indirect Loan Processor I - Dealer Services (Remote, must live in CA)

Remote, USA Full-time

Loan Processor (Hybrid) Columbus, OH (N High St)

Remote, USA Full-time

Mortgage Processor - 2988026

Remote, USA Full-time

[Remote] Senior Loan Processor - Retail (CST & EST)

Remote, USA Full-time

Physical Therapist (New Grad Mentor Program) - ...

Remote, USA Full-time

Experienced Lead Software Engineer, Data Platforms - Big Data Applications and Cloud Infrastructure Development

Remote, USA Full-time

Retail Customer Service - Northwoods Rack

Remote, USA Full-time

CONSERVATION TECHNICIAN

Remote, USA Full-time

Senior User Researcher & AI Research Operations

Remote, USA Full-time

**Customer Service Representative – Immediate Openings – Amarillo**

Remote, USA Full-time

Experienced Full Stack Software Engineer – Web & Cloud Application Development

Remote, USA Full-time

**Experienced Customer Service Representative – Medical Patient Advocacy & Support**

Remote, USA Full-time

**Experienced Full Stack Data Entry Specialist – Amazon Store Operations (Home-Based Freelance)**

Remote, USA Full-time

Automation Tester Contractor 51867

Remote, USA Full-time
Back to Home