Software Engineer, AI Inference
San Jose, CA
Persimmons, Inc.
Job Description: Software Engineer. AI Inference
Department: Engineering
Reporting to: VP, Software Engineering
Who we are:
At Persimmons, Inc., we are pioneering the future of generative AI with our ground-breaking full-stack innovations, including hardware and software integration capable of supporting trillion-parameter models. We envision the future of AI hardware with our cutting-edge generative AI solutions designed for edge computing and hyperscale cloud environments. Our mission is to empower organizations to deploy AI capabilities at unprecedented scales and performance levels, bridging the gap between groundbreaking technology and real-world applications. We partner with some of the world's leading edge and hyperscale cloud companies to deliver high-performance, secure, and compliant AI hardware solutions.
What you’ll do:
As Software Engineer, AI Inference, you will be responsible for designing, developing, and maintaining software including SDK tools, inference server, customer facing API, cloud platform and UI interface work. Your experience in python based applications, hosting cloud services, REST API design and having a good aesthetic sense of design and user experience will help define success in this role. Your primary duties and responsibilities include:
Develop software in python and web frameworks for SDK tools for observability, telemetry and also device interfacing.
Develop software in python for our inference server and build our customer facing REST API and UI interfaces
Collaborate with the compiler and device software teams to develop software that interacts with Persimmons hardware
Containerize applications in Docker and deploy them as cloud services
Qualifications:
Experience with Python backend development and REST API design
Experience with web applications and cloud services
Proficiency in Python programming.
What We Offer:
The opportunity to work at the forefront of innovation with some of the most advanced technologies in AI.
A collaborative environment where innovation is at the heart of everything we do.
Competitive salary and benefits package.
Flexible PTO.
401k.
Persimmons, Inc. is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Please submit your resume and a brief cover letter through our official application portal at https://jobs.ashbyhq.com/persimmons-ai. We kindly request that applicants refrain from contacting our employees directly regarding this listing. Additionally, we do not work with external recruiters for these roles. If a recruiter refers a candidate, no referral fees will be paid, and the referral will be considered voluntary.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Docker Engineering Generative AI Python REST API
Perks/benefits: Competitive pay Flex vacation
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.