Member of Engineering – Pre-training, Data Engineering

Remote, USA Full-time

Job Description: • Build and maintain high-performance pipelines for trillions of tokens. • Deliver diverse and high quality datasets for pre-training foundation models. • Closely work with other teams such as Pretraining, Posttraining, Evals and Product to to ensure alignment on the quality of the models delivered. Requirements: • Strong background in building production-grade, distributed data systems for machine learning, with experience in: • Orchestration: Slurm, Airflow, or Dagster • Observability & Reliability: CI/CD, Grafana, Prometheus, etc. • Infra: Git, Docker, k8s, cloud managed services • Batched inference (ex: vLLM) • Performance obsession, especially with large-scale GPU clusters and distributed pipelines • Expert-level python knowledge and ability to write clean and maintainable code • Strong algorithmic foundations • Proficiency with libraries like Polars, Dask, or PySpark • Nice to have: • Experience in building trillion-scale SOTA pretraining datasets • Experience translating research to production at scale • Experience with OCR, web crawling, or evals • Prior experience pre-training LLMs Benefits: • Fully remote work & flexible hours • 37 days/year of vacation & holidays • Health insurance allowance for you and dependents • Company-provided equipment • Wellbeing, always-be-learning and home office allowances • Frequent team get togethers • Great diverse & inclusive people-first culture Apply tot his job

Apply Now

Experienced Full-Time Customer Experience Specialist – Remote Night Shifts with Comprehensive Benefits and Growth Opportunities at Blithequark

Remote, USA Full-time

Member of Engineering – Pre-training, Data Engineering

Similar Jobs

Data Engineer, AI Native

Staff ML Engineer, FinTech (Remote)

[Remote] Data Engineer (Pricing & Monetization)

Senior Machine Learning Engineer - Remote

[Remote] Senior Data Engineer – Observability & Security

[Remote] Python/AI Developer with Automobile Exp

Engineering Manager – Machine Learning | Runway | $310k-$370k | Remote (USA, Canada)

Software/Hardware engineer, Data Scientist, Neuroscientist (IoT, AI)

AI Engineering Manager

Engineering Manager, Applied AI

Experienced Full-Time Customer Experience Specialist – Remote Night Shifts with Comprehensive Benefits and Growth Opportunities at Blithequark

Senior Strategic Account Executive - West Region (Remote) - RF-SMART for NetSuite Sales and Customer Success

Experienced Remote Online Chat Specialist – Customer Support and Service Representative for Dynamic Team at blithequark

Work From Home Data Entry Jobs – Earn Weekly Pay, No Experience

Azure Administrator/Cloud Architect ( Only W2 Consultants)

Experienced Patient Service Representative (Call Center) – Remote Opportunity for a High-Volume Call Center Environment

Experienced Live Chat Representative – Part-Time Remote Opportunity at arenaflex

Executive Director - West Covina

Associate Customer Success Manager, APAC

Nursing Faculty-Irving Campus

Member of Engineering – Pre-training, Data Engineering

Similar Jobs

Data Engineer, AI Native

Staff ML Engineer, FinTech (Remote)

[Remote] Data Engineer (Pricing & Monetization)

Senior Machine Learning Engineer - Remote

[Remote] Senior Data Engineer – Observability & Security

[Remote] Python/AI Developer with Automobile Exp

Engineering Manager – Machine Learning | Runway | $310k-$370k | Remote (USA, Canada)

Software/Hardware engineer, Data Scientist, Neuroscientist (IoT, AI)

AI Engineering Manager

Engineering Manager, Applied AI

Experienced Full-Time Customer Experience Specialist – Remote Night Shifts with Comprehensive Benefits and Growth Opportunities at Blithequark

Senior Strategic Account Executive - West Region (Remote) - RF-SMART for NetSuite Sales and Customer Success

Experienced Remote Online Chat Specialist – Customer Support and Service Representative for Dynamic Team at blithequark

Work From Home Data Entry Jobs – Earn Weekly Pay, No Experience

Azure Administrator/Cloud Architect ( Only W2 Consultants)

**Experienced Patient Service Representative (Call Center) – Remote Opportunity for a High-Volume Call Center Environment**

**Experienced Live Chat Representative – Part-Time Remote Opportunity at arenaflex**

Executive Director - West Covina

Associate Customer Success Manager, APAC

Nursing Faculty-Irving Campus

Experienced Patient Service Representative (Call Center) – Remote Opportunity for a High-Volume Call Center Environment

Experienced Live Chat Representative – Part-Time Remote Opportunity at arenaflex