Generative AI Inference Solutions Architect

Remote Office

Cerebras Systems

Cerebras is the go-to platform for fast and effortless AI training and inference.

View all jobs at Cerebras Systems

Apply now Apply later

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.  

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

The Role

As a solutions architect for Cerebras Inference platform, you will provide technical guidance in our sales initiatives, showcase the capabilities of our hardware and software solutions, and drive customer engagements. You will be working with the fastest inference engine in the world and will help our customers to understand and realize its potential for existing and completely new business applications.   

We are looking for talented AI Solutions Architects with a blend of deep technical expertise, customer-facing soft skills and sales acumen. The ideal candidate will also bring a broad knowledge of various industries. 

Responsibilities
  • Lead the technical aspects of the sales process  
    • Join sales calls to present technical aspects of Cerebras Inference solution, addressing customer questions and demonstrating our value proposition. Provide in-depth explanations of our product features, focusing on performance benefits, scalability, and optimizations that our specialized hardware enables. 
    • Understand and gather customer requirements.  
  • Design, scope and drive demos, trials and PoCs  
    • Design demos to showcase key advantages of our unique product 
    • Scope and drive customer trials and proof-of-concept projects, define success metrics, oversee execution and ensure a smooth experience customer satisfaction. 
  • Own end-to-end delivery of the solution, provide technical guidance during deployment and post-sales support 
    • Work closely with customers to design deployment solutions tailored to their needs 
    • Drive end-to-end delivery of the solution from the technical side 
    • Build and maintain strong customer relationships to become their go-to technical expert 
  • Provide feedback to the internal product and engineering teams
    • Collaborate with internal teams, including R&D and product management, to communicate customer feedback and drive future product improvements.  
Requirements 
  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field. 
  • 5+ years in customer-facing engineering roles. 
  • Strong understanding of Generative AI model architecture, inference optimization, enterprise infrastructure and deployment challenges. 
  • Experience with specialized AI accelerators. 
  • Solid programming skills in Python and familiarity with distributed computing. 
  • Exceptional communication skills with the ability to explain complex technical concepts to both technical and non-technical audiences. 
  • Ability to work collaboratively in a fast-paced environment and adapt to changing customer needs. 
  • Ability to manage complex technical projects and deliver solutions tailored to customer needs.  
  • Strong interpersonal and communication skills, effective in collaborative and fast-paced team settings.  
Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection  point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

  1. Build a breakthrough AI platform beyond the constraints of the GPU
  2. Publish and open source their cutting-edge AI research
  3. Work on one of the fastest AI supercomputers in the world
  4. Enjoy job stability with startup vitality
  5. Our simple, non-corporate work culture that respects individual beliefs

Read our blog: Five Reasons to Join Cerebras in 2024.

Apply today and become part of the forefront of groundbreaking advancements in AI.

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  4  2  0

Tags: Architecture Computer Science Engineering Generative AI GPU Machine Learning Open Source Python R R&D Research

Perks/benefits: Career development Startup environment

Regions: Remote/Anywhere North America
Country: United States

More jobs like this