Engineering Manager, AI Performance
San Francisco, CA - US
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Full Time Mid-level / Intermediate USD 259K+
Crusoe
Crusoe is on a mission to align the future of computing with the future of the climate.Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.
Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.
About This Role:
Crusoe Energy is on a mission to align the future of computing with the future of the climate. As an Engineering Manager, AI Performance, you will lead a team of high-performance computing experts focused on optimizing Crusoe’s inference engines and AI infrastructure. Your team’s work will directly impact Crusoe’s core business by accelerating model throughput and maximizing hardware efficiency.
In this hybrid leadership role, you will combine technical expertise with people management to deliver performance gains across our scalable AI systems. You’ll partner closely with AI researchers, infrastructure engineers, and product leaders to push the boundaries of what’s possible in real-time inference.
This is a full-time hybrid position based in San Francisco, CA or Sunnyvale, CA, with an in-office presence required three times per week.
What You’ll Be Working On:
Team Leadership & Strategic Execution:
Manage and grow a team of performance engineers optimizing Crusoe’s inference stack
Define team goals, set technical direction, and ensure timely delivery of high-impact projects
Provide coaching, mentorship, and support for individual team member development
Technical Oversight:
Guide performance improvements in inference engines such as VLLM and related systems
Lead efforts to implement optimizations that increase token throughput and system efficiency
Oversee the development of CUDA kernels and other low-level performance enhancements for deep learning workloads
Cross-Functional Collaboration:
Partner with AI researchers, product managers, and infrastructure teams to prioritize and deliver performance improvements
Drive initiatives to improve observability, performance profiling, and benchmarking infrastructure
Contribute to internal standards for onboarding, documentation, and knowledge sharing across the org
Industry Engagement & Innovation:
Stay informed on the latest trends in high-performance AI infrastructure and inference acceleration
Support contributions to open-source projects and participate in relevant research communities and forums
What You’ll Bring to the Team:
Leadership & Team Management:
2+ years managing or technically leading high-performing engineering teams
Proven track record of delivering performance-critical software in AI, HPC, or similar environments
Strong ability to prioritize work, communicate trade-offs, and lead teams through ambiguity
Technical Depth:
Strong understanding of CUDA or OpenCL with experience in kernel development or GPU optimization
Proficiency in Python and familiarity with deep learning frameworks such as PyTorch or TensorFlow
Expertise in CPU/GPU performance profiling and system-level optimization
Bonus Points:
Experience bringing early-stage projects from concept to deployment (“zero-to-hero” mindset)
Knowledge of SIMD, AVX, or vector processing techniques
Background in graphics shaders, high-performance rendering pipelines, or similar parallel compute workloads
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid Commuter FSA benefit of $200 per month
Compensation:
Compensation will be paid up to $259,000 a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Tags: CUDA Deep Learning Engineering GPU HPC ML infrastructure Open Source Pipelines Python PyTorch Research SIMD TensorFlow vLLM
Perks/benefits: 401(k) matching Career development Competitive pay Equity / stock options Health care Insurance Parental leave Salary bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.