AI/ML Consultant – LLM Deployment

Remote, USA Full-time

to set up and configure a self-hosted large language model (Llama 3.1) on our Linux server infrastructure for automated report generation. Primary Objective: Deploy and configure Llama 3.1 (or equivalent) 8B on our hosted Linux server (CPU-only) and create an API service that our SafetyNet Platform can call for AI-powered report generation. Specific Deliverables: Server Environment Setup Configure Linux (Ubuntu 22.04) server environment Install Python 3.11, dependencies, and required libraries Set up virtual environment and security configurations AI Model Installation & Configuration Download and install Llama 3.1 8B Instruct model Optimize model configuration for CPU-only inference Implement quantization if needed for performance Test model functionality and response quality API Service Development (Nice to have) Create REST API service (Flask/FastAPI) for report generation Implement secure endpoints for our SafetyNet Platform to call Add error handling, logging, and health check endpoints Configure service to auto-start on server reboot (systemd) Security & Performance Configure firewall rules (allow only our application server) Implement authentication/API key system Optimize for 30-60 second response times Set up monitoring and logging Documentation & Training Comprehensive setup documentation API usage guide with examples Troubleshooting guide 2-hour knowledge transfer session with our development team Testing & Validation Generate 10+ test reports with sample data Validate output quality and format Performance testing under load Integration testing with our platform (we'll provide API endpoints) Technical Requirements Must Have: 3+ years experience with Python and machine learning frameworks (PyTorch, Transformers) Experience deploying and running large language models (Llama, GPT, Mistral, etc.) Strong Linux system administration skills (Ubuntu/Debian) Experience with API development (Flask, FastAPI, or similar) Understanding of CPU-based ML inference and optimization Experience with Hugging Face model hub Knowledge of systemd service configuration Security best practices for production systems Nice to Have: Experience with model quantization and optimization (bitsandbytes, ONNX) DevOps experience (Docker, monitoring tools) Previous work with government or healthcare systems (HIPAA/FERPA compliance) Experience with justice system or social services applications Apply tot his job

Apply Now

Experienced Full Stack Data Entry and Packaging Strategy Lead - Remote Work Opportunity with Competitive Hourly Rate and Comprehensive Benefits

Remote, USA Full-time

AI/ML Consultant – LLM Deployment

Similar Jobs

AI / ML Engineer OR Data Scientist

Machine Learning Engineer -Remote Job at YO IT CONSULTING in United

Junior AI/NLP/Machine Learning Engineer 2

Research Scientist 5/6 – AI for Member Systems

Principal Machine Learning Infrastructure Researcher

ML/AI Data Scientist for AI-Powered NEXRAD Tracking API - Contract to Hire

Machine Learning Scientist, Pricing/Personalization (Open to Remote) (New York,

Change Manager Consultant

Sr. Threat Researcher II (Remote)

Management Analyst (Part-time) (4772)

Experienced Full Stack Customer Service Representative – Remote Health and Wellness Support

Staff Product Analyst (Remote, USA)

Experienced Full Stack Data Entry and Packaging Strategy Lead - Remote Work Opportunity with Competitive Hourly Rate and Comprehensive Benefits

Solution Architect

Experienced Full Stack Customer Support Representative – E-commerce Chat Support in Canada

Vacation Advisor, Disney Vacation Club (Celebration) Bench, Full Time at Walt Disney World Resort Celebration, FL

Experienced Remote Data Entry Clerk – Product Review and Data Management Specialist

[Remote] Analyst, Claims Research (Remote)

Experienced Data Entry Clerk – Remote Work From Home (Part-Time / Full-Time) Opportunity with arenaflex

Remote Opportunity Product Tester Jobs

AI/ML Consultant – LLM Deployment

Similar Jobs

AI / ML Engineer OR Data Scientist

Machine Learning Engineer -Remote Job at YO IT CONSULTING in United

Junior AI/NLP/Machine Learning Engineer 2

Research Scientist 5/6 – AI for Member Systems

Principal Machine Learning Infrastructure Researcher

ML/AI Data Scientist for AI-Powered NEXRAD Tracking API - Contract to Hire

Machine Learning Scientist, Pricing/Personalization (Open to Remote) (New York,

Change Manager Consultant

Sr. Threat Researcher II (Remote)

Management Analyst (Part-time) (4772)

**Experienced Full Stack Customer Service Representative – Remote Health and Wellness Support**

Staff Product Analyst (Remote, USA)

Experienced Full Stack Data Entry and Packaging Strategy Lead - Remote Work Opportunity with Competitive Hourly Rate and Comprehensive Benefits

Solution Architect

**Experienced Full Stack Customer Support Representative – E-commerce Chat Support in Canada**

Vacation Advisor, Disney Vacation Club (Celebration) Bench, Full Time at Walt Disney World Resort Celebration, FL

**Experienced Remote Data Entry Clerk – Product Review and Data Management Specialist**

[Remote] Analyst, Claims Research (Remote)

**Experienced Data Entry Clerk – Remote Work From Home (Part-Time / Full-Time) Opportunity with arenaflex**

Remote Opportunity Product Tester Jobs

Experienced Full Stack Customer Service Representative – Remote Health and Wellness Support

Experienced Full Stack Customer Support Representative – E-commerce Chat Support in Canada

Experienced Remote Data Entry Clerk – Product Review and Data Management Specialist

Experienced Data Entry Clerk – Remote Work From Home (Part-Time / Full-Time) Opportunity with arenaflex