Explore thousands of job offers
Nvidia
Nvidia

33 views
Return to selected search result Principal Firmware Engineer – Data Center Server Management (Santa Clara, CA, USA)

Job Overview

Principal Firmware Engineer – Data Center Server Management

Introduction

NVIDIA, a leader in AI computing, is at the forefront of technological innovation. With our revolutionary GPU technology, we’ve transformed industries ranging from gaming to deep learning and autonomous vehicles. Now, with the NVIDIA GH200 superchip, we’re setting new standards in performance and scalability for HPC and generative AI workloads. We’re seeking a Principal Firmware Engineer to lead end-to-end manageability architecture for next-generation AI supercomputing platforms. If you’re a visionary engineer eager to shape the future of data center technologies, this is your opportunity to shine.

Job Description

As a Principal Firmware Engineer, you will drive server management for large GPU and Grace-based clusters, collaborating with internal teams and cloud customers to deliver cutting-edge solutions for data centers. Your role will involve defining architectures, ensuring reliability, and optimizing firmware to meet the demanding requirements of high-performance computing environments. This position is ideal for a technical leader who thrives in dynamic, fast-paced settings.

Key Responsibilities

  • Lead the design and implementation of server management solutions for large-scale GPU-based data centers.
  • Collaborate with data center architects and cloud customers to define and refine technical requirements.
  • Ensure alignment between customer needs and internal firmware/software designs.
  • Develop data center health management workflows in collaboration with component leads.
  • Drive reliability, optimization, and telemetry performance in firmware architecture.
  • Support cluster bring-up and troubleshoot issues at record-breaking speeds.
  • Own the delivery of high-quality, reliable firmware to data centers.

Candidate Requirements

  • Experience: 15+ years in server firmware (BMC) and platform software development.
  • Education: BS, MS, or PhD in Electrical Engineering, Computer Science, or related fields (or equivalent experience).
  • Technical Skills: Proficiency in C/C++, Python, and debugging for server platforms. Hands-on experience with data center health management workflows and server architecture.
  • Tools: Familiarity with SCM tools (e.g., Git, Perforce) and project management software (e.g., Jira).
  • Soft Skills: Strong written and oral communication, excellent teamwork, and a commitment to delivering top-notch results. Self-starter with a creative approach to solving complex problems.

Preferred Qualifications

  • In-depth knowledge of x86 or ARM system architecture.
  • Proven leadership in managing large-scale technical projects involving 50+ engineers.
  • Hands-on experience with data center health management.

Compensation and Benefits

  • Base Salary Range: $272,000 – $471,500 (dependent on location and experience).
  • Additional Benefits: Equity options, comprehensive healthcare, and other industry-leading perks.

Why Join NVIDIA?

NVIDIA is renowned as one of the most desirable employers in the tech industry. Our culture fosters innovation, creativity, and collaboration. By joining our team, you’ll work alongside some of the brightest minds in technology and have the opportunity to shape the future of computing. If you’re passionate about solving complex problems and making an impact, we want to hear from you.

More Information

Job Location

Share this job

Nvidia

The way it's meant to be played
(0)
Company Information
  • Total Jobs 16 Jobs
  • Category IT
  • Location California
  • Full Address Santa Clara, California, United States
  • Company Size > 2000 employees
  • CEO Jensen Huang

Explore thousands of job offers from leading companies across the USA. Start your career journey today with Joblya!