Senior NLP Data Engineer

Remote, USA Full-time
About the position We offer a flexible working policy that supports a healthy balance between personal and professional well-being. This role requires in-office presence on Tuesdays & Thursdays to collaborate, connect, and learn from peers - while also maintaining the flexibility for meaningful work-life balance. Being a Senior NLP Data Engineer at iManage Means… You’re passionate about transforming unstructured text into meaningful insights that power AI and machine learning solutions. You thrive at the intersection of data engineering, AI and natural language processing, building the pipelines and datasets that fuel generative AI applications, agentic systems, advanced model fine tuning and other NLP-driven capabilities across iManage. As an NLP Data Engineer on the Applied AI team, you will design, build, and optimize large-scale text data pipelines that power AI/ML and Generative AI solutions for our customers. You’ll work with knowledge engineering, applied AI, and product teams to prepare, enrich, and integrate document data. Your work will be essential to enabling intelligent, AI-powered features across the iManage platform. Responsibilities • Designing, developing and maintaining scalable pipelines in MSFT Azure to ingest and transform large volumes of text data from multiple sources • Designing automated workflows for text normalization, deduplication, language identification, PII redaction and metadata enrichment • Building automated data validation processes to ensure accuracy and consistency • Supporting model fine-tuning, semantic search and Gen AI evaluations tuning through dataset curation, prompt dataset preparation, labeling coordination, and text quality validation • Partnering with the Applied AI team to gather data requirements and build data interfaces for developing and maintaining machine learning systems • Maintaining data lineage and following data privacy, security and governance best practices • Implementing data versioning and lineage tracking for machine learning experiments Requirements • A Bachelor’s degree or higher in Computer Science, Data Engineering, Applied Mathematics, Computational Linguistics, or a quantitative related field. • 4+ years of data engineering experience, with at least 2 years working with unstructured data in a business setting. • Strong proficiency in Python, PySpark, and data manipulation for large unstructured text datasets. • Strong understanding of NLP concepts such as tokenization, embeddings, semantic search, and experience with standard text libraries such as SpaCy, HuggingFace Datasets, NLTK. • Solid dataOps knowledge and experience orchestrating advanced NLP data pipelines using cloud based data infrastructure • Proficiency with Git and collaborative development frameworks • A passion for enabling AI capabilities through scalable, reliable data architecture. • Problem solving, creativity, curiosity, and a collaborative mindset. Nice-to-haves • Exposure to Microsoft Azure Services such as Fabric, ADLS, AI Foundry, Azure ML, MLflow • Experience with knowledge graph implementation for NLP applications • Experience working with data for the legal domain • Experience designing architectures for large-scale text corpora Benefits • Join a supportive, experienced team with an inclusive, encouraging, and vibrant culture. • Have flexible work hours that allow me to balance my ‘me time’ with my work commitments. • Collaborate in a modern open plan workspace, with a gaming area, free snacks, drinks and regular social events. • Focus on impactful work, solving complex, real challenges utilizing the latest technologies and protocols. • Own my career path with our internal development framework. Ask us more about this! • Learn new skills and earn certifications with access to unlimited courses in LinkedIn Learning. • Join an innovative, industry leading SaaS company that is continuing to grow & scale! • Creating an inclusive environment where I can help shape the culture not just by fitting in, but by adding to it. • Providing a market competitive salary that is applied through a consistent process, equitable for all our employees, and regularly reviewed based on industry data. • Rewarding me with an annual performance-based bonus. • Offering comprehensive Health/Vision/Dental/Life Insurance, and a 401k Retirement Savings Plan with a company match up to 4%. • Giving access to HealthJoy, a healthcare concierge service, to help me maximize my health benefits. • Granting enhanced leave for expecting parents; 20 weeks 100% paid for primary leave, and 10 weeks 100% paid for secondary leave. • Providing me with a flexible time off policy to take the time off that I need. Be it for vacation, volunteering, celebrating holidays, spending time with family, or simply taking time to recharge and reset. • Caring for my mental health and well-being with multiple company wellness days and free access to the Healthy Minds app for mindfulness, meditation and more. Apply tot his job
Apply Now

Similar Jobs

Sr AI NLP Engineer RTI Hedge Fund

Remote, USA Full-time

[Remote] Partner Service and Experience Manager

Remote, USA Full-time

[Remote] NLP Data Scientist

Remote, USA Full-time

AI and ML Engineer

Remote, USA Full-time

Allocation Planner, NA

Remote, USA Full-time

Principal Applied Researcher – AI/NLP

Remote, USA Full-time

Senior Principal, Talent Management

Remote, USA Full-time

Dynamic Entry‑Level Remote Data Analyst – Nike Global Analytics Team – Flexible Work‑From‑Home Opportunity

Remote, USA Full-time

[Remote] Senior AI Engineer | NLP | Large Language Models | Machine Learning | Remote, US

Remote, USA Full-time

[Remote] React/Node JS Developer (Cybersecurity)

Remote, USA Full-time

MySQL DBA (EMEA)

Remote, USA Full-time

**Experienced Flex Jobs Data Entry Specialist – Online Market Research and Customer Service Representative**

Remote, USA Full-time

[PART_TIME Remote] Cloud Solutions Architect-

Remote, USA Full-time

Senior Identity & Access Management Engineer - Cybersecurity Expert for Remote Work Environment

Remote, USA Full-time

Experienced Data Entry Specialist for Remote Operations – Airline Industry Data Management and Analysis

Remote, USA Full-time

Chief Legal Counsel, PSOB Program (Washington)

Remote, USA Full-time

[Remote] California-Barred Defense Litigation Attorney (Remote, Flexible Hours)

Remote, USA Full-time

Enterprise Application Assessment Cybersecurity Analyst (Remote)

Remote, USA Full-time

Experienced Customer Service Manager for Dynamic Team – Fully Remote Opportunity with Unlimited Growth Potential

Remote, USA Full-time

MUSEUM SCI SR

Remote, USA Full-time
Back to Home