Software Engineer, AI Inference

San Jose, CA

Apply now Apply later

Job Description: Software Engineer. AI Inference

Department: Engineering

Reporting to: VP, Software Engineering

Who we are:

At Persimmons, Inc., we are pioneering the future of generative AI with our ground-breaking full-stack innovations, including hardware and software integration capable of supporting trillion-parameter models. We envision the future of AI hardware with our cutting-edge generative AI solutions designed for edge computing and hyperscale cloud environments. Our mission is to empower organizations to deploy AI capabilities at unprecedented scales and performance levels, bridging the gap between groundbreaking technology and real-world applications. We partner with some of the world's leading edge and hyperscale cloud companies to deliver high-performance, secure, and compliant AI hardware solutions.

What you’ll do:

As Software Engineer, AI Inference, you will be responsible for designing, developing, and maintaining software including SDK tools, inference server, customer facing API, cloud platform and UI interface work. Your experience in python based applications, hosting cloud services, REST API design and having a good aesthetic sense of design and user experience will help define success in this role. Your primary duties and responsibilities include:

  • Develop software in python and web frameworks for SDK tools for observability, telemetry and also device interfacing.

  • Develop software in python for our inference server and build our customer facing REST API and UI interfaces

  • Collaborate with the compiler and device software teams to develop software that interacts with Persimmons hardware

  • Containerize applications in Docker and deploy them as cloud services

Qualifications:

  • Experience with Python backend development and REST API design

  • Experience with web applications and cloud services

  • Proficiency in Python programming.

What We Offer:

  • The opportunity to work at the forefront of innovation with some of the most advanced technologies in AI.

  • A collaborative environment where innovation is at the heart of everything we do.

  • Competitive salary and benefits package.

  • Flexible PTO.

  • 401k.

Persimmons, Inc. is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Please submit your resume and a brief cover letter through our official application portal at https://jobs.ashbyhq.com/persimmons-ai. We kindly request that applicants refrain from contacting our employees directly regarding this listing. Additionally, we do not work with external recruiters for these roles. If a recruiter refers a candidate, no referral fees will be paid, and the referral will be considered voluntary.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: APIs Docker Engineering Generative AI Python REST API

Perks/benefits: Competitive pay Flex vacation

Region: North America
Country: United States

More jobs like this