Nvidia AI Solution Architect m/f/d - #2137557

Lenovo


Date: vor 9 Stunden
Stadt: Stuttgart
Gehalt: €84,500 - €118,000 / Jahr
Vertragstyp: Ganztags
Arbeitsplan: Volle Tag
Lenovo


Nvidia AI Solution Architect m/f/d



General Information


WD00082803

Career area:

Artificial Intelligence

Country/Region:

Germany

City:



Why Work at Lenovo

We are Lenovo. We do what we say. We own what we do. We WOW our customers. Lenovo is a US$57 billion revenue global technology powerhouse, ranked #248 in the Fortune Global 500, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver Smarter Technology for All, Lenovo has built on its success as the world’s largest PC company with a full-stack portfolio of AI-enabled, AI-ready, and AI-optimized devices (PCs, workstations, smartphones, tablets), infrastructure (server, storage, edge, high performance computing and software defined infrastructure), software, solutions, and services. Lenovo’s continued investment in world-changing innovation is building a more equitable, trustworthy, and smarter future for everyone, everywhere. Lenovo is listed on the Hong Kong stock exchange under Lenovo Group Limited (HKSE: 992) (ADR: LNVGY). This transformation together with Lenovo’s world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit www.lenovo.com, and read about the latest news via our StoryHub.



Description and Requirements

We are looking for an AI Delivery Architect / Technical lead who can understand the AI solution requirements to design, deploy, tested and validate - Proof of Concepts (PoCs) and full production environments for enterprise customers. This role focuses on the end-to-end deployment design, working closely with customer c-suite team, understand use case and create AI solution. Create detailed deployment solution covering infra, software, tools, process, and procedures that include Nvidia AI Enterprise solutions on Lenovo hardware and ensuring seamless, scalable, and robust production implementations. Your work will help customers unlock the full potential of AI technologies by managing deployments that ensure AI adoption and optimization of AI models in real-world use cases.Key responsibilities:
  • Lead end-to-end transitions of AI PoCs into production environments, managing the entire process from testing to final deployment.

  • Configure, install, and validate AI systems using key platforms, including:
  • VMware ESXi and vSphere for server virtualization, Linux (Ubuntu/RHEL) and Windows Server for operating system integration, Docker and Kubernetes for containerization and orchestration of AI workloads.
  • Conduct comprehensive performance benchmarking and AI inferencing tests to validate system performance in production.
  • Optimize deployed AI models for accuracy, performance, and scalability to ensure they meet production-level requirements and customer expectations.
  • Serve as the primary technical lead/SME for the AI POC deployment in enterprise environments, focusing on AI solutions powered by Nvidia GPUs.
  • Work hands-on with Nvidia AI Enterprise and GPU-accelerated workloads, ensuring efficient deployment and model performance using frameworks such as PyTorch and TensorFlow.
  • Lead technical optimizations aimed at resource efficiency, ensuring that models are deployed effectively within the customer’s infrastructure.
  • Implement risk management strategies and develop contingency plans to mitigate potential issues such as hardware failures, network bottlenecks, and software incompatibilities.
  • Maintain ongoing, transparent communication with all relevant stakeholders, providing updates on project status and addressing any issues or changes in scope.
  • Conduct post-deployment knowledge transfer sessions to educate client/ Lenovo Managed services teams on managing AI infrastructure, troubleshooting common issues, and optimizing AI models.
  • Provide comprehensive training sessions on the operation, management, and scaling of AI systems, ensuring that customers are fully prepared for ongoing operations post-handoff.

Experience:

  • Overall experience 7-10 years

  • Relevant experience of 2-4 years in deploying AI/ML models/ AI solutions using Nvidia GPUs in enterprise production environments.
  • Demonstrated success in leading and managing complex AI infrastructure projects, including PoC transitions to production at scale.

Technical Expertise:

  • Experience in the area of Retrieval Augmented Generation (RAG), NVIDIA AI Enterprise, NVIDIA Inference Microservices (NIMs), NVIDIA NeMo framework, Model Management, Kubernetes

  • Extensive experience with Nvidia AI Enterprise, GPU-accelerated workloads, and AI/ML frameworks such as PyTorch and TensorFlow.
  • Proficient in deploying AI solutions across enterprise platforms, including VMware ESXi, Docker, Kubernetes, and Linux (Ubuntu/RHEL) and Windows Server environments.
  • MLOps proficiency with hands-on experience using tools such as Kubeflow, MLflow, or AWS SageMaker for managing the AI model lifecycle in production.
  • Strong understanding of virtualization and containerization technologies to ensure robust and scalable deployments.

What Lenovo can offer you:

  • Employee Share Purchase Plan
  • Employee Assistance Program, e.g., for health, legal & financial consultancy
  • Pension Plan
  • Meal Allowance / Lunch Vouchers
  • Internal E-learning Development Platform Available for Employees
  • Specialized Development Trainings (based on nomination process)
  • Employees Groups (LGBT+, WILL, etc.)
  • Opportunity to Join/Create Employees Groups (inclusivity, well-being, sports, volunteering, charity, etc.)
  • Job Rad (Bike Leasing)
  • Mobile phone + 3 Sim Cards for Mobile Working

Check out the video, our DACH team created, to give you an insight into the Lenovo culture! We are looking forward to talking to you!

Wie bewerbe ich mich?

Um sich für diesen Job zu bewerben, müssen Sie auf unserer Website autorisieren. Wenn Sie noch kein Konto haben, registrieren Sie sich bitte.

Veröffentlichen Sie einen Lebenslauf

Ähnliche Jobs

Werkstudent Real Estate Management (m/w/d)

DEKRA Germany,
vor 2 Stunden
Stuttgart Teilzeit | DEKRA e.V. Stuttgart | DE51013841-04 Beginn: nach Absprache | Umfang: ca. 20 Std. / Woche Aufgaben Du bist Teil des Immobilienteams des DEKRA Konzerns Du arbeitest mit beim Aufbau eines Energiemanagementsystems mit Bei der Erstellung von Mietverträgen...
DEKRA Germany

Consultant (m/w/d) Operational Restructuring in Stuttgart

Deloitte,
vor 3 Stunden
Du machst den Unterschied. Ob im Business Audit & Assurance, Risk Advisory, Tax & Legal, Financial Advisory oder Consulting: Wir bei Deloitte unterstützen unsere weltweiten Mandanten dabei, sich kontinuierlich weiterzuentwickeln. Entdecke ein vielfältiges Arbeitsumfeld, das ständig in Bewegung ist und...
Deloitte

Bell Labs Internship - Goal-Oriented Communication

Nokia,
vor 4 Stunden
Job Description As an intern on Bell Labs Internship - Goal-oriented Communication, you would join the Network Automation team, focusing on optimization, low-latency communication and AI. Position: Bell Labs Internship - Goal-oriented Communication Duration: 6 months Location: Stuttgart, Germany Education...
Nokia