Software Engineer - Math Libraries for AI and HPC Kernel Generation
Remote
Relocation
AI Summary
Join NVIDIA's math libraries teams to develop and optimize algorithms for AI and HPC kernel generation. Work on high-quality and performance numerical dense linear algebra software on GPUs.
Key Highlights
Scoping, designing, and implementing high-quality and performance numerical dense linear algebra software on GPUs
Providing technical leadership and feedback to library engineers
Working closely with product management and other internal and external customers
Technical Skills Required
Benefits & Perks
Competitive salaries
Generous benefits package
Remote work
Relocation package
Job Description
We are looking for software engineers to join our math libraries teams for AI and HPC kernel generation, specifically targeting emulation of math operations across different precisions. Around the world, leading commercial and academic organizations are revolutionizing AI, scientific and engineering simulations, and data analytics, using data centers powered by GPUs. Applications of these technologies are in healthcare, NLP, VR, deep learning, autonomous vehicles and countless others. Did you know our team develops the GPU accelerated math libraries that makes all of this possible? If the idea of tinkering with bits and precision formats in math operations and applying your knowledge to develop and optimize algorithms to make an impact around world excite you, come and join our team!
What You Will Be Doing
- Scoping, designing, and implementing high quality and performance numerical dense linear algebra software on GPUs.
- Providing technical leadership and feedback to library engineers working with you on projects and sometimes mentor interns.
- Working closely with product management and other internal and external customers to understand feature and performance requirements and help define the technical roadmaps of libraries.
- Finding opportunities to improve library performance and reduce code maintenance overhead through re-architecting.
- PhD or Master’s degree in Computer Science, Applied Math, or related science or engineering field of study (or equivalent experience).
- 5+ years of experience in designing, developing, testing, maintenance, and performance optimization of production software using CUDA and C++.
- Good knowledge of GPU (preferred) or CPU hardware architecture.
- Strong fundamentals in finite precision arithmetics and numerical methods for linear algebra.
- Great teamwork, communication, and documentation habits.
- Experience with CUTLASS, or low level programming like assembly for performance optimization is a huge plus.
- A scripting language, preferably Python.
- Experience with working in a globally-distributed team.
With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 281,250 PLN - 487,500 PLN for Level 4, and 360,000 PLN - 624,000 PLN for Level 5.
JR2004481
Similar Jobs
Explore other opportunities that match your interests
Visa Sponsorship
Relocation
Remote
Job Type
Full-time
Experience Level
Director
adomik
France
Visa Sponsorship
Relocation
Remote
Job Type
Full-time
Experience Level
Entry level
STATION F
France
Visa Sponsorship
Relocation
Remote
Job Type
Full-time
Experience Level
Director
saas.group
France