hotskills.tech

HotSkills.Tech - Explore Job Opportunities

Explore Job Opportunities

Find the latest job openings from top companies around the world. Use the filters to refine your search and discover the perfect role for you.

Featured Jobs

TechCorp logo

Senior Python Developer

TechCorp

San Francisco, CA

DataCo logo

Data Scientist

DataCo

New York, NY

WebSolutions logo

Frontend Developer

WebSolutions

Remote

CloudTech logo

DevOps Engineer

CloudTech

Seattle, WA

Open Jobs

Applied Intuition logo

Engineering Manager - ML Platform and Infrastructure

Applied Intuition - Sunnyvale, CA

Tanium logo

Senior Software Engineer - Platform Development

Tanium - Emeryville, CA

Carbon, Inc. logo

Senior Staff Software Engineer

Carbon, Inc. - Redwood City, CA

ServiceNow logo

Principal Software Engineer | DevSecOps | Product Security

ServiceNow - Santa Clara, CA

Jobs via Dice logo

PowerBuilder Developer-Denver, CO(Onsite)

Jobs via Dice - Denver, CO

Nubank logo

Principal Software Engineer - Credit Card

Nubank - Durham, NC

National Guardian Life Insurance Company logo

Cloud Engineer - CONTRACT

National Guardian Life Insurance Company - Madison, WI

Applied Intuition logo

Applied Intuition - Sunnyvale, CA

Job Type: Not specified

Experience: Not specified

About Job

About the role As an Engineering Manager on the ML Platform team, you'll lead a world-class group of engineers focused on building the infrastructure that powers Physical AI at scale. Your team will own three critical areas: Training & Inference Orchestration, where we build frameworks to efficiently schedule and run massive jobs across thousands of GPUs; GPU Cluster Architecture, where we design and scale what will be the largest GPU cluster for Physical AI in the industry; and Performance Optimization, where we push the limits of hardware utilization, throughput, and cost efficiency for large-scale training and inference workloads. You'll work at the intersection of systems engineering and ML, partnering directly with stack development and research teams to remove bottlenecks and accelerate the path from experimentation to production. At Applied Intuition, you will: Grow and manage a team of world-class infrastructure and systems engineers with the goal of delivering a best-in-class ML platform for Physical AI Own the design and evolution of frameworks for orchestrating distributed training and inference jobs across thousands of GPUs Drive the buildout and scaling of our GPU cluster infrastructure, making critical decisions on architecture, scheduling, networking, and resource management Lead efforts to optimize training and inference performance — including throughput, fault tolerance, GPU utilization, and cost efficiency at scale Set team goals and roadmap in alignment with research milestones, model development timelines, and production deployment requirements Partner closely with research, stack development, and infrastructure teams to understand their workflows and accelerate their iteration speed Drive hiring, mentoring, and growth for a high-performing, mission-driven team We’re looking for someone who has: 3+ years of engineering management experience, ideally leading infrastructure or platform teams Passion for building and leading high-performing teams that operate at the frontier of scale Deep experience with distributed systems, GPU computing, or large-scale ML infrastructure Direct experience building or operating large GPU clusters (1,000+ GPUs) Strong understanding of distributed training frameworks (e.g., PyTorch Distributed, Megatron-LM, DeepSpeed, FSDP) and job orchestration at scale Familiarity with GPU cluster management, high-performance networking (InfiniBand, RDMA), and resource scheduling (Slurm, Kubernetes) Track record of building and operating systems that run reliably at massive scale Nice to have: Background in training optimization techniques such as mixed-precision training, pipeline/tensor/data parallelism, or checkpointing strategies Experience with inference optimization (batching, model serving, quantization, compiler-level optimizations) Familiarity with Physical AI domains such as autonomous driving, robotics, or simulation Contributions to open-source ML infrastructure projects Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment. Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials & certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position. Please reference the job posting’s subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the location listed is: $204,000 - $343,000 USD annually. job perk: Posted At: 2 days ago, job perk: Salary: 204K–343K a year, job perk: Schedule Type: Full-time, job perk: Qualifications: No degree mentioned

Enhance Your Job Prospects

  • Access exclusive job listings in specialized industries.
  • Optimize your CV with our AI-powered tools.
  • Define salary and equity expectations from the start.
  • Find the perfect fit with personalized job filters.
  • Get top founders and recruiters reaching out to you.

Showing 127 to 133 of 2696 results