Director of AI Infrastructure

vinsmart future • United State

Relocation

Apply

AI Summary

We are looking for a Director of AI Infrastructure to architect, build, and scale our end-to-end AI infrastructure stack. This is a founding leadership role with full ownership of architecture, technology decisions, and ecosystem partnerships. The ideal candidate will have experience building systems at scale in AI infrastructure, data center, or hardware infrastructure.

Key Highlights

End-to-End AI Infrastructure Architecture

Hyperscale Compute & Distributed Systems

AI Platform & Inference Systems

Data Center Strategy & Execution

Build the Team & Ecosystem

Key Responsibilities

Define and design the full stack of AI infrastructure

Build and operate large-scale GPU clusters

Lead development of training and inference platforms

Lead design and scaling of AI-ready data centers

Build and lead a world-class AI infrastructure organization

Technical Skills Required

GPU clusters HPC AI compute infrastructure NVIDIA OEMs networking infrastructure partners distributed systems training pipelines model optimization power cooling high-density compute systems

Benefits & Perks

Flexible executive compensation

Relocation & long-term incentives

Access to Vingroup ecosystem

Nice to Have

Experience working with hundreds → thousands → tens of thousands of GPUs

Built systems for frontier AI / hyperscale cloud / HPC environments

Job Description

Work Location: Hanoi: TechnoPark Tower, Vinhomes Ocean Park, Hanoi, Vietnam

Job overview:

We are investing in hyperscale AI infrastructure, with a long-term vision to develop multi-GW AI data center capacity and large-scale GPU compute clusters, enabling frontier model training and AI deployment at national scale.

We are looking for a Director of AI Infrastructure to architect, build, and scale our end-to-end AI infrastructure stack - from data center design to distributed training and inference systems.

This is a founding leadership role with full ownership of architecture, technology decisions, and ecosystem partnerships to build one of the largest AI infrastructure platforms in Southeast Asia.

What you will own:
1. End-to-End AI Infrastructure Architecture

Define and design the full stack of AI infrastructure:

→ Hyperscale data centers (multi-MW → GW-scale roadmap)

→ GPU clusters (hundreds → thousands GPUs, scaling toward 10,000+)

→ High-performance networking and distributed storage

Architect systems for frontier-scale model training and large-scale inference

2. Hyperscale Compute & Distributed Systems

Build and operate large-scale GPU clusters (Phase 1: 500–1000 GPUs, scaling beyond)
Design distributed systems for training, scheduling, and orchestration
Optimize throughput, latency, utilization, and cost efficiency

3. AI Platform & Inference Systems

Looking to advance your Development & Programming career with relocation support? Explore Development & Programming Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.

Lead development of training and inference platforms
Design systems for serving AI models at scale (millions of users / requests)
Optimize model deployment, performance, and reliability

4. Data Center Strategy & Execution

Lead design and scaling of AI-ready data centers (tens of MW → multi-GW roadmap)
Drive decisions on power density, cooling (liquid/air), and rack design
Own vendor ecosystem across NVIDIA, OEMs, networking, and infrastructure partners

5. Build the Team & Ecosystem

Build and lead a world-class AI infrastructure organization
Define engineering culture, hiring standards, and execution model
Establish partnerships with global leaders in AI, cloud, and hardware ecosystems

What we are looking for:

We are open to candidates from different backgrounds, as long as you have built systems at scale in at least one of the following:

Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.

Option A - Distributed AI / Compute

Built or operated large-scale distributed training systems
Deep expertise in GPU clusters, HPC, or AI compute infrastructure

Option B - AI Platform / Inference

Built AI platforms or large-scale inference systems
Strong understanding of LLMs, training pipelines, and model optimization

Option C - Data Center / Hardware Infrastructure

Designed or operated hyperscale data centers (MW-scale or beyond)
Strong expertise in power, cooling, and high-density compute systems.

You might be a fit if you have:

Experience working with hundreds → thousands → tens of thousands of GPUs
Built systems for frontier AI / hyperscale cloud / HPC environments
Strong hands-on technical depth combined with leadership and builder mindset

Interested in relocating to United State? Check out our comprehensive Relocation Jobs in United State page with detailed relocation packages and benefits.

Why this role is unique
Greenfield build - no legacy con
straints Real
scale commitment - multi-MW to 3–4GW AI infrastructure roadmap Full
ownership - architecture, vendor selection, and execution Oppor
tunity to shape national-scale AI capabilities

Compensation & Benefits
Flexible executive compensation (structured based on candidate profile)
Relocation & long-term incentives
Access to Vingroup ecosystem (housing, healthcare, education…)

📩 Apply via:
Email: v.hanhtt41@vinsmartfuture.tech
For more information, please direct message me.

Job Overview

Posted Date May 04, 2026

Employment Type Full-time

Experience Level Director

Location United State

Category Programming

Company vinsmart future

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

Senior Quality Assurance Manager

Programming

•

3h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Not Applicable

Jobot

United State

Senior Software Engineer - Data Assembly

Programming

•

8h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Mid-Senior level

blissway inc.

United State

Principal/Sr Principal Software Engineer - Payloads and Tasking Applications

Programming

•

14h ago

Premium Job

•••••• •••••• ••••••

Job Type ••••••

Experience Level ••••••

Northrop Grumman

United State

Director of AI Infrastructure

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Senior Quality Assurance Manager

Jobot

Senior Software Engineer - Data Assembly

blissway inc.

Principal/Sr Principal Software Engineer - Payloads and Tasking Applications

Premium Job

Northrop Grumman

Subscribe our newsletter