Principal GPU Network Software Engineer - Communications Library (Santa Clara) Job at AMD, Santa Clara, CA

RHdTT2ozTkNPVFFjcmtRQ21DRm14UVdpZ2c9PQ==
  • AMD
  • Santa Clara, CA

Job Description

Principal GPU Network Software Engineer - Communications Library

Principal GPU Network Software Engineer - Communications Library

1 week ago Be among the first 25 applicants

This range is provided by AMD. Your actual pay will be based on your skills and experience talk with your recruiter to learn more.

Passionate about building teams that will continue to shape organization's future.....

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

The Role

As a GPU Network Software Engineer with our NTSG - Network Technology Solutions Group, you will design, implement, and test features in communication libraries, middleware, and frameworks to provide best in class support for GPU applications running high performance computing and machine learning workloads at scale. You will work with technical experts within AMD, our partners, and the open-source community to implement these features as part of AMDs open source ROCm stack for GPU computation.

The Person

You are accustomed to working in a dynamic, geographically distributed agile team, where partnership and collaboration are paramount. You possess excellent written and verbal communication skills, and strong attention to detail. You are results-oriented and accustomed to tight deadlines and changing priorities. Most importantly, you are constantly thinking of ways to improve performance of multi-node GPU applications.

Key Responsibilities

  • Design, implement, and test features to enhance GPU support in communication libraries, middleware and frameworks
  • Benchmark, profile and optimize code to maximize performance of multi-node GPU applications
  • Deliver high-quality code and documentation following best practices for open-source software development
  • Work with key technical experts at our customers, across AMD, and with our industry partners in the Ultra Ethernet Consortium and Ultra Accelerator Link Consortium to advance scale out and scale-up software and hardware solutions.

Preferred Experience

  • Strong background developing system software in C/C++
  • Experience with at least one of the following:
  • Implementing communication middleware like MPI/SHMEM
  • Implementing lower-level communication frameworks like UCX and libfabric, or development using RDMA APIs
  • Development and optimization of communication collective algorithms (e.g. AllReduce)
  • Familiarity with GPU programming in HIP or CUDA
  • In-depth knowledge of best practices in software development, including testing, profiling, debugging, documentation, version control, issue tracking, and planning
  • Proven track record contributing to open-source projects

Academic Credentials

  • B.Sc. or B.Eng. degree in Computer Science, Electrical Engineering, or equivalent

Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred.

LOCATION:

Santa Clara, CA

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.

Seniority level

  • Seniority level

    Mid-Senior level

Employment type

  • Employment type

    Full-time

Job function

  • Job function

    Engineering and Information Technology
  • Industries

    Semiconductor Manufacturing

Referrals increase your chances of interviewing at AMD by 2x

Get notified about new Network Software Engineer jobs in Santa Clara, CA .

Mountain View, CA $147,000.00-$240,000.00 2 days ago

Internship, Network Engineer, Infrastructure Engineering (Fall 2025)

Sunnyvale, CA $130,000.00-$160,000.00 2 weeks ago

Network Production Engineer (University Grad)

Network Engineer, Operations and Support

Network Production Engineer, Infrastructure

Fremont, CA $133,000.00-$190,000.00 4 days ago

Software Engineer - Datacenter networking

Software Engineer - Datacenter networking

Software Engineer - Datacenter networking

Network Engineer, Operations and Support (Labs)

Fremont, CA $161,000.00-$225,000.00 4 days ago

San Jose, CA $60,000.00-$75,000.00 2 weeks ago

Software Engineer (L5) - Open Connect Platform

San Mateo, CA $150,000.00-$170,000.00 2 weeks ago

Fremont, CA $120,000.00-$140,000.00 1 day ago

Software Engineer - FBOSS (Facebook Open Switching System)

San Mateo, CA $130,000.00-$280,000.00 2 weeks ago

Software Engineer - Security Research (Intrusion Prevention System Development)

Fremont, CA $112,000.00-$128,000.00 4 weeks ago

Sunnyvale, CA $160,000.00-$210,000.00 2 months ago

Sunnyvale, CA $156,000.00-$229,000.00 2 weeks ago

Foster City, CA $81,600.00-$115,250.00 5 days ago

Software Engineer, Traffic Infrastructure

San Francisco Bay Area $160,000.00-$180,000.00 2 weeks ago

Sunnyvale, CA $122,000.00-$174,000.00 2 weeks ago

Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Job Tags

Full time,

Similar Jobs

VSolvit

SYSTEM ADMINISTRATOR (RED HAT/LINUX) Job at VSolvit

 ...Administrator to support our government customer as part of the NS-DSS Program. This position focuses on maintaining and supporting Red Hat Enterprise Linux (RHEL) systems across both on-premise and cloud environments. The System Administrator will ensure secure, stable,... 

Addison Kenway.

Dermatology Physician - Dermatologist Job at Addison Kenway.

 ...Dermatologist Franklin Square, New York Job#16709021 A well-established and growing dermatology practice in Franklin Square, NY is seeking a Dermatologist to join their team. This flexible opportunity is open to full- or part-time candidates, including experienced... 

Ground Up Careers

Construction Superintendent Job at Ground Up Careers

 ...Job: Commercial Construction Superintendent Location: Phoenix AZ Job Type: Full-Time Managed by: Ground Up Careers About Us: We are in need of a Superintendent with experience in ground up projects. We are a growing company specializing in quality construction... 

LX Pantos Americas

Environment, Health and Safety Manager Job at LX Pantos Americas

 ...Job Description Role Overview We are seeking a highly skilled and proactive Environment, Health, & Safety (EHS) Manager to lead safety initiatives and ensure compliance with OSHA, EPA, and NFPA regulations across our warehouse operations. This role is critical... 

Summit Recruiting Group

CRNA - Anesthesiology job available in Findlay, Ohio Job at Summit Recruiting Group

 ...Marathon Petroleum Corp., Cooper Tire & Rubber Co., Whirlpool Corp. Dishwasher Plant ~100 Top Hospitals 6-time recipient IBM Watson Health ~100 Great Community Hospitals" 2019 Becker's Hospital Review ~ No.1 Micropolitan, 6 consecutive years Site Selection Magazine...