Austin, Texas
HPC Solution Architect
Hopkinton, Massachusetts, United States
Senior Principal Software Engineer
The Software Engineering team delivers next-generation software application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics — all with the most advanced technologies, tools, software engineering methodologies and the collaboration of internal and external partners.
Join us to do the best work of your career and make a profound social impact as a Senior Principal Software Engineer on our Software Engineering Team in Austin, Texas or Hopkinton, Massachusetts.
What you’ll achieve
As a Senior Software Principal Engineer, you will be responsible for developing sophisticated systems and software basis the customer’s business goals, needs and general business environment creating software solutions.
We are hiring aSenior HPC Solution Architect to design, deploy, and support large‑scale HPC and AI clusters for enterprise, research, and hyperscale customers. This is a hands‑on, customer‑facing Individual Contributor role that blendsLinux systems engineering, cluster lifecycle automation, provisioning frameworks (Omnia/OpenCHAMI), Slurm/Kubernetes, and deep troubleshooting of production environments. Ideal for strong technical engineers who enjoy solving complex customer problems, contributing to open‑source, and shaping modern HPC deployment practices.
Lead customer architecture & design, translating HPC/AI workload requirements into scalable cluster architectures (compute, schedulers, storage, interconnects)
Deploy and operationalize clusters using Omnia or similar automation, including provisioning, scheduler bring‑up, telemetry, authentication, and repo management
Build and maintain provisioning workflows (OpenCHAMI‑based or equivalent) covering PXE/iPXE boot, cloud‑init, security, and identity/cert operations
Serve as Tier‑3 engineering escalation, troubleshooting complex provisioning, scheduling, GPU, networking, and performance issues; perform RCAs and drive permanent fixes
Contribute to open source and customer enablement through code contributions, documentation, workshops, runbooks, templates, and field readiness materials
Essential Requirements:
HPC & Distributed Systems: 8+ years engineering large‑scale HPC and distributed infrastructure, with strong knowledge of cluster architecture, schedulers, and provisioning workflows
Linux & Automation: Deep experience with RHEL/Rocky/Ubuntu; hands‑on cluster deployments using open‑source toolchains, Omnia, and OpenCHAMI (composable provisioning, cloud‑init, microservices)
Schedulers, Containers & Observability: Production experience with Slurm and/or Kubernetes; proficient with Docker/Podman, OpenTelemetry pipelines, and telemetry instrumentation
Networking, Fabrics & Streaming: Solid L2/L3 fundamentals, PXE/iPXE, DHCP/TFTP; experience with InfiniBand/RoCE/Omni‑Path fabrics and event streaming with Kafka
Scripting, Monitoring & Customer Engagement: Strong skills in Ansible, Python, Bash; expertise with Prometheus and Grafana dashboards; proven communication skills for escalations and simplifying complex HPC concepts
Compensation
Dell is committed to fair and equitable compensation practices. The salary range for this position is $210,000 - $265,000.
Benefits and Perks of working at Dell Technologies
Your life. Your health. Supported by your benefits. You can explore the overall benefits experience that awaits you as a Dell Technologies team member — right now at MyWellatDell.com
Who we are
We believe that each of us has the power to make an impact. That’s why we put our team members at the center of everything we do. If you’re looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry, we’re looking for you.
Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play. Join us to build a future that works for everyone because Progress Takes All of Us.
Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. Read the full Equal Employment Opportunity Policy here.
-
Dicas para uma entrevista bem-sucedida
Temos muito o que conversar, e não se esqueça que esta é a sua chance de entrevistar a Dell também. Saiba Mais -
Perguntas Frequentes
Para te ajudar a entender melhor sobre nosso processo seletivo, nós fornecemos uma lista com as perguntas mais comuns. Saiba Mais -
Receba alertas de vagas
Inscreva-se e receba oportunidades que combinem com suas habilidades, diretamente no seu e-mail. Registre-se
Benefícios Globais
Programas de Saúde
Ferramentas e Recursos Premiados de Bem-Estar Financeiro
Licença-maternidade e paternidade generosa, para novas mães, pais, cuidadoras e cuidadores
Plataforma de bem-estar líder no setor
Programa de Assistência para as Pessoas da nossa Equipe
Nenhuma vaga vista recentemente. Veja todas oportunidades
Nenhuma vaga salva. Veja todas oportunidades
Seja a primeira pessoa a receber novas vagas
Receba alertas de vagas no seu e-mail
Inscreva-se na nossa Rede de Talentos e receba oportunidades que combinem com suas habilidades, diretamente no seu e-mail.