Engineering Manager, AI Performance

San Francisco, CA - US

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Crusoe

Crusoe is on a mission to align the future of computing with the future of the climate.

View all jobs at Crusoe

Apply now Apply later

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated,  purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role:
Crusoe Energy is on a mission to align the future of computing with the future of the climate. As an Engineering Manager, AI Performance, you will lead a team of high-performance computing experts focused on optimizing Crusoe’s inference engines and AI infrastructure. Your team’s work will directly impact Crusoe’s core business by accelerating model throughput and maximizing hardware efficiency.

In this hybrid leadership role, you will combine technical expertise with people management to deliver performance gains across our scalable AI systems. You’ll partner closely with AI researchers, infrastructure engineers, and product leaders to push the boundaries of what’s possible in real-time inference.

This is a full-time hybrid position based in San Francisco, CA or Sunnyvale, CA, with an in-office presence required three times per week.

What You’ll Be Working On:

Team Leadership & Strategic Execution:

  • Manage and grow a team of performance engineers optimizing Crusoe’s inference stack

  • Define team goals, set technical direction, and ensure timely delivery of high-impact projects

  • Provide coaching, mentorship, and support for individual team member development

Technical Oversight:

  • Guide performance improvements in inference engines such as VLLM and related systems

  • Lead efforts to implement optimizations that increase token throughput and system efficiency

  • Oversee the development of CUDA kernels and other low-level performance enhancements for deep learning workloads

Cross-Functional Collaboration:

  • Partner with AI researchers, product managers, and infrastructure teams to prioritize and deliver performance improvements

  • Drive initiatives to improve observability, performance profiling, and benchmarking infrastructure

  • Contribute to internal standards for onboarding, documentation, and knowledge sharing across the org

Industry Engagement & Innovation:

  • Stay informed on the latest trends in high-performance AI infrastructure and inference acceleration

  • Support contributions to open-source projects and participate in relevant research communities and forums

What You’ll Bring to the Team:

Leadership & Team Management:

  • 2+ years managing or technically leading high-performing engineering teams

  • Proven track record of delivering performance-critical software in AI, HPC, or similar environments

  • Strong ability to prioritize work, communicate trade-offs, and lead teams through ambiguity

Technical Depth:

  • Strong understanding of CUDA or OpenCL with experience in kernel development or GPU optimization

  • Proficiency in Python and familiarity with deep learning frameworks such as PyTorch or TensorFlow

  • Expertise in CPU/GPU performance profiling and system-level optimization

Bonus Points:

  • Experience bringing early-stage projects from concept to deployment (“zero-to-hero” mindset)

  • Knowledge of SIMD, AVX, or vector processing techniques

  • Background in graphics shaders, high-performance rendering pipelines, or similar parallel compute workloads

Benefits:

  • Industry competitive pay

  • Restricted Stock Units in a fast growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts

  • Paid Parental Leave

  • Paid life insurance, short-term and long-term disability

  • Teladoc

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company paid Commuter FSA benefit of $200 per month

Compensation:
Compensation will be paid up to $259,000 a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Apply now Apply later
Job stats:  0  0  0

Tags: CUDA Deep Learning Engineering GPU HPC ML infrastructure Open Source Pipelines Python PyTorch Research SIMD TensorFlow vLLM

Perks/benefits: 401(k) matching Career development Competitive pay Equity / stock options Health care Insurance Parental leave Salary bonus Startup environment

Region: North America
Country: United States

More jobs like this