Lead Data Engineer - Scalable Data Pipelines - Contract to Hire

Remote, USA Full-time

Lead Data Engineer (PySpark, Airflow, Azure) – Scalable Data Pipelines We’re looking for an experienced Senior Data Engineer to design, build, and optimize large-scale data pipelines powering analytics and machine learning workloads. This role is ideal for someone who is hands-on, performance-oriented, and comfortable leading other engineers while owning end-to-end data workflows. You’ll work on both batch and real-time processing, take ownership of Spark performance tuning, and help enforce best practices around data quality, governance, and reliability. ⸻ Responsibilities • Design, develop, and optimize scalable data pipelines using Python, PySpark, Apache Spark, and Airflow • Build and maintain batch and streaming data processing systems on Spark • Design and manage Airflow DAGs to orchestrate complex, dependency-heavy workflows • Implement data partitioning, caching, and Spark performance tuning to handle large datasets efficiently • Ensure data quality, governance, security, and reliability across the data lifecycle • Monitor, troubleshoot, and optimize data jobs, SLAs, and pipeline dependencies • Manage cloud infrastructure (Azure) for data workloads, including cost optimization • Implement CI/CD pipelines for data workflows using Git, Docker, and Infrastructure-as-Code tools • Support analytics and ML use cases by working with structured and unstructured data • Lead and mentor other data engineers, providing architectural guidance and code reviews • Promote best practices in coding standards, documentation, and version control • Collaborate effectively with distributed, remote teams in an Agile environment ⸻ ✅ Requirements • 8+ years of hands-on experience in Data Engineering • Strong expertise with Apache Spark / PySpark, including internals such as: • RDDs, DataFrames, DAG execution, partitioning, shuffles, and caching • Proven experience building and operating Airflow DAGs (scheduling, dependencies, retries, SLAs) • Advanced Python and SQL skills with a focus on performance and maintainability • Solid experience with Azure data and compute infrastructure • Working knowledge of Docker, Kubernetes, Terraform, and CI/CD best practices • Strong problem-solving skills and ability to optimize large-scale data processing systems • Prior experience leading or mentoring engineers • Comfortable working in Agile/Scrum environments • Excellent communication skills and ability to collaborate with remote teams ⸻ ⭐ Nice to Have • Experience with streaming frameworks (Spark Structured Streaming, Kafka, Event Hubs) • Familiarity with data governance, lineage, and observability tools • Experience supporting ML or advanced analytics pipelines • Background in cost-efficient Spark optimization at scale Apply tot his job

Apply Now

Experienced Data Entry Clerk for Logistics and Administrative Support – Entry-Level Opportunity with arenaflex for Career Growth and Development

Remote, USA Full-time

Experienced Remote Customer Care Chat Specialist – Home-Based Role with Flexible Hours and Opportunities for Growth at blithequark

Remote, USA Full-time

Experienced Data Entry Professional for Part-Time Remote Opportunities with blithequark - Unlocking Career Growth in a Dynamic and Inclusive Environment

Remote, USA Full-time

Experienced Remote Data Entry Operator – Unlock a Fulfilling Career Path with Flexible Work Arrangements and Competitive Compensation

Remote, USA Full-time

Back to Home

Lead Data Engineer - Scalable Data Pipelines - Contract to Hire

Similar Jobs

Data Engineer – MUST HAVE AZURE & IICS – 100% Remote

Staff Data Platform Engineer

Corporate Vice President - Data Protection Engineer

SOX Control Tester

Principal Product Manager, Reporting & Optimization Insights [Remote]

Software Engineer II - Data Platform

Data Engineer 5 - Privacy

Senior Software Engineer, iOS

Data Scientist/Analyst - Remote

Data Scientist(Remote)

VP of Marketing Operations – Hemp Cannabinoid Brands (Remote)

Experienced Data Entry Clerk for Logistics and Administrative Support – Entry-Level Opportunity with arenaflex for Career Growth and Development

Experienced Remote Customer Care Chat Specialist – Home-Based Role with Flexible Hours and Opportunities for Growth at blithequark

Experienced Data Entry Professional for Part-Time Remote Opportunities with blithequark - Unlocking Career Growth in a Dynamic and Inclusive Environment

Entry Level Data Entry Clerk – Remote Work from Home Opportunity with arenaflex for Career Growth and Development

Engagement Marketing Manager

Experienced Remote Customer Sales Specialist – Strategic Growth and Client Relationship Building

Chat and Email Customer Service Adaptable Hours Messaging Only No Degree Needed

Senior Director Analyst, Security Architecture and Cloud Security (Remote Canada and EMEA)

Experienced Remote Data Entry Operator – Unlock a Fulfilling Career Path with Flexible Work Arrangements and Competitive Compensation