Software Engineer, Inference - Performance Optimization
Tasks
- Analyze inference workloads end to end across application model and fleet infrastructure
- Build performance models from microbenchmarks into cost to serve estimates
- Collaborate with engineering and research teams to improve production inference systems and project future impact
- Enhance tooling to identify latency and throughput bottlenecks across layers
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | Capacity Planning | Cost modeling | Distributed Systems | Latency analysis | Machine Learning | Machine Learning Inference | Microbenchmarking | Performance Analysis | Performance Profiling | Performance optimization | Systems Modeling | Throughput Optimization
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Software Engineer, Generative AI, Workspace USD 147K-211KC++ | Distributed Systems | Generative AI | Information Retrieval | Integration TestingBenefits | Bonus | EquityMid-level Full TimeBoulder, CO, USA3h ago
-
Staff Software Engineer, Machine Learning, Google Chat USD 207K-300KAgentic Workflows | Caching | Cloud Spanner | Continuous Delivery | Continuous integrationSenior-level Full TimeSunnyvale, CA, USA3h ago
-
Software Engineer III, Database Internals AlloyDB USD 147K-211KACID | C# | C++ | CAP Theorem | Compiler TheoryEntry-level Full TimeSunnyvale, CA, USA3h ago
-
Software Engineer III, AI/ML Computer Vision, AR USD 147K-211KC++ | Computer Vision | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSan José, CA, USA3h ago
-
Agentic AI | C plus plus | C# | Cloud services | Data ProcessingMid-level Full TimeSan Francisco, CA, USA3h ago
-
Supply Chain Data Engineer Ii USD 94K-118KDBT | Data Governance | Data Modeling | Data Pipelines | Data Quality401k | Disability insurance | Employee stock purchase plan | Health insurance | Life insuranceMid-level Full TimeWayne, PA, US, 190878h ago
-
AI/ML Engineer 2 USD 101K-165KAI Agents | API Development | AWS | Azure | CI/CDDisability insurance | Family leave | Flexible spending accounts | Life and AD D Insurance | Medical/Dental/Vision insuranceSenior-level Full TimePhiladelphia, PA, US, 191039h ago
-
Senior Industrial Engineer, Process Optimization USD 100K-120K5S | AutoCAD | Cause analysis | Cost modeling | Excel401k | Dental insurance | Disability insurance | Flexible spending account | Health savings accountSenior-level Full TimeBethlehem, PA, United States R12h ago
-
Applied AI ML Engineer-Vice President USD 150K-210KAWS Bedrock | AWS SageMaker | Amazon EKS | AutoPrompt | DDPBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersExecutive-level Full TimeNew York, NY, United States13h ago
-
Applied AI ML Engineer-Senior Associate USD 175K-210KAWS Bedrock | AWS SageMaker | Amazon EKS | Containerization | Data PreprocessingSenior-level Full TimeNew York, NY, United States13h ago
-
Data Engineer USD 104K-176KAWS EC2 | AWS S3 | Agile | Amazon Web Services | Data ArchitectureOn-site workSenior-level Full TimeOmaha, NE1d ago
-
AI/ML Scientist/Developer USD 115K-130KCloud Computing | Containerization | Data integration | Deep learning | Differential Equations401k match | Dependent Care Assistant Program | Educational benefits | Employee referral bonus | Flexible spending accountsMid-level Full TimeFrederick, MD1d ago
-
Data Scientist (Generative AI) USD 125K-160KAWS | AWS Bedrock | AWS SageMaker | Adversarial Networks | Attention MechanismsEntry-level Full TimeMcLean, VA, United States1d ago
-
AWS | Amazon S3 | Cloud Storage | Cloud platform | Dataset PipelinesOn-site work environment | Visa sponsorship availableMid-level Full TimeGreenwich, Connecticut, United States1d ago
-
Agents | Amazon Web Services | Artificial Intelligence | Cloud platform | Dataset PipelinesMid-level Full TimeManhattan, Nevada, United States1d ago
-
Mid-level Full TimeNew Jersey, New Jersey, United States1d ago
-
AWS | Cloud platform | Deep learning | Django | DockerBonus | Equity | Onsite work | Visa sponsorship availableMid-level Full TimeCalifornia, California, United States1d ago
-
AWS | Agents | Amazon S3 | Cloud Storage | DjangoBonus | Equity | On-site work | Visa sponsorshipMid-level Full TimeGoldens Bridge, New York, United States1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Health, dental, vision coverage | Learning stipend | Relocation assistanceSenior-level Full TimeGeorgia, Georgia, United States1d ago
-
AWS | Adapters | ArangoDB | Asynchronous programming | Context engineering401k | Dental insurance | Health insurance | Learning stipend | Relocation assistanceSenior-level Full TimeJacksonville, Florida, United States1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Learning stipend | On site work 5 days per weekSenior-level Full TimeMenlo Park, California, United States1d ago
-
AWS | Adapters | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Learning stipend | Relocation assistanceSenior-level Full TimeWashington D.C., District of Columbia, United …1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Dental insurance | Health insurance | Relocation assistance | Unlimited learning stipendSenior-level Full TimeCharlotte, North Carolina, United States1d ago
-
AWS | Adapters | ArangoDB | Asynchronous programming | Context engineering401k | Health, dental, vision coverage | Learning stipend | Relocation assistance | Visa sponsorshipSenior-level Full TimeMountain View, California, United States1d ago
-
Senior Machine Learning Engineer USD 152K-250KAutomation | Distributed Training | Distributed inference | GPU | Go401k | Employee assistance program | Flexible PTO | Flexible spending account | Health savings account contributionsSenior-level Full TimeLas Vegas, Nevada1d ago