Software Development Engineer - Generative AI, AGIF | Inference Engine
Boston, Massachusetts, USA
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Full Time Mid-level / Intermediate USD 129K - 223K
Amazon.com
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...
Are you interested in advancing Amazon's Generative AI capabilities? Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building state-of-the-art Generative AI technology that will benefit all Amazon businesses and customers.
Key job responsibilities
As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance model inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.
A day in the life
You will consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms from public and internal papers, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.
About the team
Our mission is to build best-in-class, fast, accurate, and cost-efficient frontier model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
- 3+ years of non-internship professional software development experience
- Must have one of the following two: 1) Prior experience with software performance optimization Or 2) Knowledge of Deep Learning and Transformer architectures
- Bachelor's degree in computer science or equivalent
- Experience with Large Language Model Inference
- Experience with GPU programming (TensorRT-LLM)
- Experience with Python, PyTorch, and C++ programming and performance optimization
- Experience with Trainium and Inferentia Development
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.
Key job responsibilities
As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance model inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.
A day in the life
You will consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms from public and internal papers, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.
About the team
Our mission is to build best-in-class, fast, accurate, and cost-efficient frontier model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
Basic Qualifications
- 3+ years of non-internship professional software development experience
- Must have one of the following two: 1) Prior experience with software performance optimization Or 2) Knowledge of Deep Learning and Transformer architectures
Preferred Qualifications
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- Bachelor's degree in computer science or equivalent
- Experience with Large Language Model Inference
- Experience with GPU programming (TensorRT-LLM)
- Experience with Python, PyTorch, and C++ programming and performance optimization
- Experience with Trainium and Inferentia Development
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.
Job stats:
5
1
0
Categories:
Deep Learning Jobs
Engineering Jobs
Generative AI Jobs
Tags: Architecture Computer Science Deep Learning Engineering Generative AI GPU LLMs Model inference Python PyTorch SDLC TensorRT Testing
Perks/benefits: Career development Equity / stock options
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Sr. Data Engineer jobsPower BI Developer jobsPrincipal Data Engineer jobsData Scientist II jobsBI Developer jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsDevOps Engineer jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsAI/ML Engineer jobsStaff Software Engineer jobsData Science Manager jobsData Manager jobsLead Data Analyst jobsData Analyst Intern jobsData Specialist jobsSr. Data Scientist jobsBusiness Data Analyst jobsBusiness Intelligence Analyst jobsData Governance Analyst jobsData Engineer III jobsSenior Backend Engineer jobs
Consulting jobsMLOps jobsAirflow jobsOpen Source jobsKafka jobsEconomics jobsKPIs jobsGitHub jobsLinux jobsJavaScript jobsTerraform jobsPostgreSQL jobsRAG jobsPrompt engineering jobsBanking jobsStreaming jobsData Warehousing jobsScikit-learn jobsNoSQL jobsClassification jobsRDBMS jobsComputer Vision jobsPhysics jobsdbt jobsHadoop jobs
Pandas jobsScala jobsGoogle Cloud jobsGPT jobsData warehouse jobsR&D jobsLangChain jobsMicroservices jobsBigQuery jobsCX jobsELT jobsOracle jobsDistributed Systems jobsScrum jobsLooker jobsReact jobsIndustrial jobsPySpark jobsRedshift jobsJira jobsOpenAI jobsRobotics jobsSAS jobsUnstructured data jobsSalesforce jobs