AI Platform Engineer - Private AI
USA-TX-Austin - River Place B6, United States
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Full Time Senior-level / Expert USD 127K - 226K
Broadcom
Broadcom Inc. is a global technology leader that designs, develops and supplies a broad range of semiconductor, enterprise software and security solutions.Please Note:
1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In > Create Account)
2. If you already have a Candidate Account, please Sign-In before you apply.
Job Description:
Broadcom is looking for a Principal AI Platform Engineer to join VMware Cloud Foundation’s (VCF) AI and Advanced Services team. This position is key to building a best in class private cloud AI platform. You will have a high impact by playing a critical role designing and implementing scalable solutions along with a team of talented and enthusiastic engineers.
This role will be a member of the Product Engineering group in the AI & Advanced services R&D team in VCF Division. You will work closely with our tech leads to drive internal and external technical enablement of our platform, building, scaling, testing, demonstrating and delivering our software to various stakeholders such as marketing, IT, field engineering, system test, partners, customers, and our own engineering team.
The AI & Advanced Services team is responsible for building AI platform capabilities into the VMware Cloud Foundation product to enable our enterprise customers to have all of the AI platform features they need to build, deploy, test, manage, and scale their AI infrastructure and workloads.
Responsibilities
Collaborate with cross-functional teams to test, scale, and find opportunities to improve the performance and features of the AI platform
Decompose vague problems into detailed requirements, and develop solutions that meet the needs of our customers in partnership with the technical leads of the team
Work with the CI/CD infrastructure team to create automation to improve consistency and reduce time to deliver new environments
Participate in code reviews and ensure that the code is aligned with VMware's coding standards and best practices
Troubleshoot and resolve complex issues related to Private AI services and how those services interface with other components of the stack such as storage, networking, etc.
Requirements
Understanding of AI workloads and hardware such as scaling infrastructure to meet AI model performance requirements; scaling models across GPUs using constructs such as NVLink, or other similar use cases
Ability to stand up VCF and Private AI Foundation from a bare metal environment to a fully deployed platform with workloads deployed
Ability to configure GPUs according to different workload parameters and needs
Experience writing technical documentation and creating user-friendly blogs and videos on technical products and how to use them
2+ years hands on experience with Container technologies (Docker and Kubernetes)
Hands on experience deploying and maintaining Kubernetes Operators is a big plus
Proven knowledge of systems deployment in real world environments
Strong analytical and diagnostic skills with ability to work independently
Excellent communication and collaboration skills, with the ability to work with cross-functional teams
Experience with agile development methodologies and version control systems, such as Git
BS in Computer Science or related technical fields and 12+ years of related experience in the software industry or MS in Computer Science or related technical fields and 10+ years of related experience in the software industry
Candidate should not require sponsorship
What We Offer:
Competitive salary and benefits package
Opportunities for career growth and professional development
Collaborative and dynamic work environment
Access to cutting-edge technologies and tools
Additional Job Description:
Compensation and Benefits
The annual base salary range for this position is $127,100 - $226,000.
This position is also eligible for a discretionary annual bonus in accordance with relevant plan documents, and equity in accordance with equity plan documents and equity award agreements.
Broadcom offers a competitive and comprehensive benefits package: Medical, dental and vision plans, 401(K) participation including company matching, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave and vacation time. The company follows all applicable laws for Paid Family Leave and other leaves of absence.
Broadcom is proud to be an equal opportunity employer. We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law. We will also consider qualified applicants with arrest and conviction records consistent with local law.
If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.
Tags: Agile CI/CD Computer Science Docker Engineering Git Kubernetes ML infrastructure NVLink R R&D Testing
Perks/benefits: Career development Competitive pay Equity / stock options Health care Medical leave Salary bonus Signing bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.