Worldwide Specialist Solutions Architect, GenAI - Training & Inference, Data & AI GTM

Austin, Texas, USA

Full Time Senior-level / Expert USD 138K - 239K

Amazon.com

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...

View all jobs at Amazon.com

Apply now Apply later

Posted 3 hours ago

Do you want to help define the future of Go to Market (GTM) at AWS using generative AI (GenAI)?

AWS Sales, Marketing, and Global Services (SMGS) is responsible for driving revenue, adoption, and growth from the largest and fastest growing small- and mid-market accounts to enterprise-level customers including public sector.

Within SMGS, you will be part of the core worldwide GenAI, Training & Inference team, responsible for defining, building, and deploying targeted strategies to accelerate customer adoption of our services and solutions across industry verticals.

You will be working directly with the most important customers (across segments) in the GenAI model training and inference space helping them adopt and scale large-scale workloads (e.g., foundation models) on AWS. You will conduct model performance evaluations, optimizations and work directly with engineering and product to optimize the ML stack for efficiency and scale. You will conduct external/internal evangelism, and developing demos and proof-of-concepts

Key job responsibilities
You will help develop the industry’s best cloud-based solutions to grow the GenAI business. Working closely with our engineering teams, you will help enable new capabilities for our customers to develop and deploy GenAI workloads on AWS. You will facilitate the enablement of AWS technical community, solution architects and, sales with specific customer centric value proposition and demos about end-to-end GenAI on AWS cloud.

You will possess a technical and business background that enables you to drive an engagement and interact at the highest levels with startups, Enterprises, and AWS partners. You will have the technical depth and business experience to easily articulate the potential and challenges of GenAI models and applications to engineering teams and C-Level executives. This requires deep familiarity across the stack – compute infrastructure (Amazon EC2, Lustre), ML frameworks PyTorch, JAX, orchestration layers Kubernetes and Slurm, parallel computing (NCCL, MPI), MLOPs, as well as target use cases in the cloud.

You will drive the development of the GTM plan for building and scaling GenAI on AWS, interact with customers directly to understand their business problems, and help them with defining and implementing scalable GenAI solutions to solve them (often via proof-of-concepts). You will also work closely with account teams, research scientists, and product teams to drive model implementations and new solutions.

You should be passionate about helping companies/partners understand best practices for operating on AWS. An ideal candidate will be adept at interacting, communicating and partnering with other teams within AWS such as product teams, solutions architecture, sales, marketing, business development, and professional services, as well as representing your team to executive management. You will have a natural appetite to learn, optimize and build new technologies and techniques. You will also look for patterns and trends that can be broadly applied across an industry segment or a set of customers that can help accelerate innovation.

This is an opportunity to be at the forefront of technological transformations, as a key technical leader. Additionally, you will work with the AWS ML and EC2 product teams to shape product vision and prioritize features for AI/ML Frameworks and applications. A keen sense of ownership, drive, and being scrappy is a must.

Basic Qualifications

- Bachelor's degree in computer science, engineering, mathematics or equivalent
- 8+ years of specific technology domain areas (e.g. software development, cloud computing, systems engineering, infrastructure, security, networking, data & analytics) experience
- 3+ years of design, implementation, or consulting in applications and infrastructures experience
- 5+ years building or optimizing computational applications for large scale HPC systems (e.g. physics based simulations) to take advantage of high performance networking (e.g. Amazon EFA, Infiniband, RoCE), distributed parallel filesystems (e.g. Lustre, BeeGFS, GPFS) and accelerators (e.g. GPUs, custom-silicon)
- Understanding of deep learning training and inference workloads and requirements for high performance compute, network and storage

Preferred Qualifications

- 5+ years of infrastructure architecture, database architecture and networking experience
- Experience working with end user or developer communities
- 8+ years building or optimizing computational applications for large scale HPC systems (e.g. physics based simulations) to take advantage of high performance networking (e.g. Amazon EFA, Infiniband, RoCE), distributed parallel filesystems (e.g. Lustre, BeeGFS, GPFS) and accelerators (e.g. GPUs, custom-silicon)
- Professional experience in deep learning training and inference workloads and requirements for high performance compute, network and storage

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $138,200/year in our lowest geographic market up to $239,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.

Apply now Apply later

Job stats: 0 0 0

Categories: Architecture Jobs Deep Learning Jobs Generative AI Jobs

Tags: Architecture AWS Computer Science Consulting Deep Learning EC2 Engineering Generative AI HPC InfiniBand JAX Kubernetes Machine Learning Mathematics MLOps Model training Physics PyTorch Research Security