Data Engineer (AI Enablement)

Remote, USA Full-time
THE JOB / Data Engineer (AI Enablement) STRATEGY / Responsible for building and operating the data foundations that power Octagon’s AI solutions and enterprise search. • **Our headquarters are in Stamford, CT, but the location of this position can be 100% remote for qualified candidates. You’re a systems-minded builder who turns messy, multi-source data into reliable, searchable, and governed knowledge. Your mission is to stand up the pipelines, vector search, and metadata standards that make AI tools accurate, fast, and safe. You’ll partner closely with the Solutions Engineer (peer role) to take prototypes and ship durable infrastructure—ingestion, embeddings, indexing, and APIs—so teams can find and use what they need. You’ll report to the Director, Data Strategy and work across departments to reduce manual effort, improve data quality, and enable AI-powered workflows at scale. THE WORK YOU’LL DO • Data foundations: Design and operate the vector database/search layer (e.g., FAISS/pgvector/Milvus) and document-chunking/embedding pipelines that make Octagon’s content discoverable and auditable. • Scalable pipelines for AI/ML/LLM: Implement and maintain ELT/ETL to support downstream workflows such as data labeling, classification, and document parsing; build robust validations, lineage, and observability. • Retrieval APIs: Expose governed retrieval endpoints that respect permissions (ACLs), support metadata filters, and return source snippets/IDs for grounding and citations. • Data structuring & manipulation: Normalize, transform, and move JSON and other structured payloads cleanly through workflows to ensure reliable handoffs and automation outputs. • Align & collaborate: Align product peers, design, data science, engineering, and commercial teams around a unified roadmap and shared data contracts. • Operationalize prototypes: Take MVPs from the Solutions Engineer and productionize with CI/CD, telemetry, cost/usage guardrails, and pilot → rollout gating. • Reliability & security: Build monitoring (freshness, re-index SLAs, retrieval quality), secrets management, access controls, and audit logging aligned with enterprise governance. • Flexibility and willingness to travel and work weekends or holidays as needed. Anticipated travel level: Low (0–15%). THE BIGGER TEAM YOU’LL JOIN Recognized as one of the “Best Places to Work in Sports”, Octagon is the global sports, entertainment, and experiential marketing arm of the Interpublic Group. We take pride in being Playmakers – finding insightful, bold ways to create play in our work, our lives, and in the world. We believe in the power of play to create big ideas and unlock potential for our clients and talent. We can put ourselves in the shoes of fans because we ARE fans – of sports, entertainment, and culture at large. This expertise allows us to continually evolve the fan experience across sports and entertainment alongside some of the biggest brands and talent in the world. The world needs play more than ever. Are you a Playmaker? WHO WE’RE LOOKING FOR • 3+ years (or equivalent portfolio) building data systems: data modeling, ELT/ETL, Python + SQL; experience with cloud object storage and relational databases. • Hands-on with embeddings and vector databases (e.g., FAISS/pgvector/Milvus) and document processing pipelines for RAG-style retrieval. • Scalable pipeline experience supporting AI/ML/LLM use cases (labeling, classification, doc parsing) and partnering closely with Data Science and Data Labeling teams. • Data structuring & manipulation expertise: cleanly normalizing and transforming JSON/Parquet/CSV payloads; designing resilient data contracts and schemas. • Orchestration/ops: Airflow/Prefect (or similar), CI/CD, structured logging/monitoring, cost/usage guardrails; secure secrets management. • Strong collaboration and communication skills; proven ability to align product/design/engineering/commercial stakeholders around a unified roadmap. Nice-To-Haves • Enterprise connectors and productivity stacks (e.g., Microsoft 365/SharePoint/Teams/Graph, Copilot or Copilot Studio/Power Automate; Google Workspace; Salesforce; DAMs). • Experience implementing LLM inference patterns, similarity search, guardrails, and memory; familiarity with agent frameworks or custom orchestration. • Additional languages for systems work (e.g., C++, C#, Java, or Go). • Containers (Docker), GitHub Actions, IaC; lightweight internal UIs (Streamlit or R Shiny) to expose services. • Familiarity with marketing/media-measurement datasets and associated normalization/quality checks. The base range for this position is $90,000 – $100,000. Where an employee or prospective employee is paid within this range will depend on, among other factors, actual ranges for current/former employees in the subject position; market considerations; budgetary considerations; tenure and standing with the company (applicable to current employees); as well as the employee’s/applicant’s background pertinent experience, and qualifications We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, age, disability, gender identity, marital or veteran status, or any other protected class. Apply tot his job
Apply Now

Similar Jobs

AI Automation Developer for Workflow 1 (SEO Blogs, Images & AI Videos)

Remote, USA Full-time

[Remote] Data Strategist & Automation Specialist

Remote, USA Full-time

AI Agent Developer Needed – Social Media Automation & Lead Management

Remote, USA Full-time

IT Application Developer (AI-Driven IT Ops) Intern

Remote, USA Full-time

AI Automation & Agent Developer (OpenAI, Zapier/n8n) - Project-Based - Contract to Hire

Remote, USA Full-time

[Remote] Automation Engineers

Remote, USA Full-time

[Remote] Devin.ai Consultant (Enterprise AI Implementation Specialist)

Remote, USA Full-time

[Remote] Sr. Advanced ETL Data Engineer / AWS Data Engineer-Dimensional Modeling for Analytics & reporting.(W2)

Remote, USA Full-time

Senior Data/AI Engineer ($55/hr )Lead AI and Data Solutions Engineer ($60/hr)

Remote, USA Full-time

[Remote] Cloud Data Engineer (Strictly W2 ONLY)

Remote, USA Full-time

Sub-Regional EHS Manager II

Remote, USA Full-time

Remote Data Entry Clerk at blithequark - Part-Time Flexible Schedule with Competitive Pay

Remote, USA Full-time

Account Coordinator - Media and Technology Team in San Francisco

Remote, USA Full-time

Surety Technology & Automation Representative

Remote, USA Full-time

Sr. Dot Net Engineer

Remote, USA Full-time

AI Ethicist / AI Ethics Officer

Remote, USA Full-time

Remote Part Time Paralegal Contract work

Remote, USA Full-time

Sr. Software Engineer II - DevSecOps, Reliability, Security (Remote Eligible)

Remote, USA Full-time

Experienced Online Customer Support Specialist – Remote Work Opportunity for Delivering Exceptional Healthcare Solutions at arenaflex

Remote, USA Full-time

**Experienced Remote Data Entry Specialist – Join blithequark's Global Team and Shape the Future of Financial Services**

Remote, USA Full-time
Back to Home