Senior Solutions Architect, Infiniband and Networking Ethernet

NVIDIA CorporationMünchen
Gehalt: Von 292.500,00 € bis 650.000,00 €

Overview

NVIDIA is seeking a Senior Networking (ETH/IB) Solutions Architect to join its NVIDIA Infrastructure Specialist Team. The role involves designing, implementing, and maintaining large‐scale AI/HPC networking infrastructure for customers worldwide.

Responsibilities

  • Build AI/HPC infrastructure for new and existing customers.
  • Support operational and reliability aspects of large‐scale AI clusters, focusing on performance at scale, real‐time monitoring, logging, and alerting.
  • Engage in and improve the full lifecycle of services—from inception and design through deployment, operation, and refinement.
  • Maintain live services by measuring and monitoring availability, latency, and overall system health.
  • Provide feedback to internal teams, including opening bugs, documenting workarounds, and suggesting improvements.

Qualifications

  • BS/MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields.
  • At least 8 years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture.
  • Proficiency in configuring, testing, validating, and resolving issues in LAN and InfiniBand networks, especially in medium to large‐scale HPC/AI environments.
  • Advanced knowledge of EVPN, BGP, OSPF, VXLAN protocols.
  • Hands‐on experience with network switch/router platforms such as Cumulus Linux, SONiC, IOS, JunosOS, and EOS.
  • Extensive experience delivering automated network provisioning solutions using tools like Ansible, Salt, and Python.
  • Ability to develop CI/CD pipelines for network operations.
  • Strong focus on customer needs and satisfaction.
  • Self‐motivated with leadership skills to work collaboratively with customers and internal teams.
  • Strong written, verbal, and listening skills in English.

Desired Skills

  • Familiarity with cloud networks (AWS, GCP, Azure).
  • Relevant Linux or networking certifications.
  • Experience with high‐performance computing architectures.
  • Understanding of job schedulers (Slurm, PBS).
  • Knowledge of Luster management technologies, including BCM (Base Command Manager).
  • Experience with GPU‐focused hardware/software.

Salary

Base salary is determined by location, experience, and comparable positions. For Poland: Level 4 – 292,500 PLN to 507,000 PLN; Level 5 – 375,000 PLN to 650,000 PLN.

#J-18808-Ljbffr