Solutions Architect, AI Hyperscalers

Remote, USA Full-time
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by extraordinary technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. NVIDIA is searching for an AI/ML Solutions Architect focusing on Hyperscale customers and Cloud Service Providers. Your primary responsibilities will be to lead software customer technical engagement for AI training, inference and infrastructure being deployed at vast scale. You will work across multiple organizations within NVIDIA as well as at the customer to ensure successful and trouble-free deployments. If you would you like to partner with a large company to build automation and management to create a robust large scale artificial intelligence infrastructure and are interested in the optimization and characterization of customer specific AI models and pipelines - you should apply! What you’ll be doing: • As a key technical member of a focused account team, you will serve as the main point of contact for NVIDIA products, enabling internet giants and cloud providers to have an innovative AI/ML software infrastructure. • Work directly with best-in-class engineering teams to secure design wins, address challenges, bring solutions to production, and support them throughout their lifecycle. • Become a trusted advisor to your customer by understanding their environment, constraints, and long-term strategy. Translate these insights into product requirements and innovative solutions. • Help your customer enhance the value of NVIDIA technology, and provide feedback to NVIDIA for future product improvements. • Facilitate the resolution of customer issues, offering timely and proactive communications to mitigate risks. • Lead workshops, demos, and proof-of-concepts to showcase NVIDIA’s AI/ML capabilities. • Guide customers on standard processes for scalable AI model deployment and inference optimization. What we need to see: • Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience. • 4+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions. • Proven understanding of Linux, including solving, optimization, and customization for AI/ML workloads. • Strong understanding of data science and machine learning infrastructure—software and hardware. • Professional-level communication skills, including the ability to tailor messages for varying technical audiences and maintain composure in high-pressure situations. • Excellent follow-up and interpersonal skills, with a true passion for problem-solving. • Proficient in Python, with the ability to develop scripts and build custom tools. Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful. • Shown eagerness to learn and apply new technologies. Ways to stand out from the crowd: • Experience with Chatbots, RAG pipelines, vector databases, and distributed training or inference workloads. • Experience or background in HPC (High Performance Computing) environments for AI or ML applications. • Familiarity with multi-node GPU clusters and performance tuning for large-scale AI workloads. • Experience developing in cloud and/or virtualized environments, containerized solutions, with knowledge of Docker, Kubernetes • Background with common deep learning frameworks such as PyTorch or JAX. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until October 11, 2025. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Apply tot his job
Apply Now

Similar Jobs

[Remote] Multimodal AI System Engineer / DoD

Remote, USA Full-time

AI Systems Engineer

Remote, USA Full-time

Sr. Data Scientist, Enterprise AI (Remote)

Remote, USA Full-time

Staff AI Research Scientist - Data Quality, Handshake AI

Remote, USA Full-time

AI Applied Scientist, Code Intelligence

Remote, USA Full-time

[Remote] Senior Solutions Architect - Emerging Technologies (AI, GenAI, ML)

Remote, USA Full-time

Cloud Solution Architect- Data & AI (Remote role)

Remote, USA Full-time

[Remote] Technical Solutions Architect -- AI Native Engineering

Remote, USA Full-time

AI Solution Architect

Remote, USA Full-time

Lead AI Agent Engineer (Prompting & Evaluation)

Remote, USA Full-time

**Experienced Full Stack Data Engineer – Privacy (L4) - Remote, Part Time**

Remote, USA Full-time

Entry Level Process CADD/BIM Designer

Remote, USA Full-time

Experienced Data Entry Specialist for Fully Remote Work at blithequark – Competitive $24/Hour

Remote, USA Full-time

**Experienced Technical Support / Customer Service Representative – Remote Night Shift Opportunity at arenaflex**

Remote, USA Full-time

Experienced College Student - Social Media Management Specialist for Innovative Remote Startup Community

Remote, USA Full-time

Senior Automation Tester // Newtown, PA / Remote

Remote, USA Full-time

Commercial Loan Documentation Specialist III /Remote/VA/ MD/ DC/ SC/ NC/

Remote, USA Full-time

**Experienced Full Stack Data Entry Specialist – Web & Cloud Application Development**

Remote, USA Full-time

Financial Advisor- Centralized

Remote, USA Full-time

**Experienced Remote Live Chat Agent – Delivering Exceptional Customer Experiences at arenaflex**

Remote, USA Full-time
Back to Home