Software Engineer, Inference - Performance Optimization
Tasks
- Analyze inference workloads end to end across application model and fleet infrastructure
- Build performance models from microbenchmarks into cost to serve estimates
- Collaborate with engineering and research teams to improve production inference systems and project future impact
- Enhance tooling to identify latency and throughput bottlenecks across layers
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | Capacity Planning | Cost modeling | Distributed Systems | Latency analysis | Machine Learning | Machine Learning Inference | Microbenchmarking | Performance Analysis | Performance Profiling | Performance optimization | Systems Modeling | Throughput Optimization
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Data Engineer (Starlink Growth) USD 125K-175KAnomaly Detection | Clustering | Data Modeling | Data Visualization | Geospatial analysisWork extended hours as needed | Work weekends as neededEntry-level Full TimeBastrop, TX11h ago
-
Sr. Software Development Engineer, Aurora Storage USD 168K-227KAWS | Amazon Aurora | Amazon RDS | Automation | DebuggingCareer growth | Flexible work options | Knowledge sharing | Mentorship | Work-life balanceSenior-level Full TimeRedmond, Washington, USA12h ago
-
Sr. Data Engineer (Starlink Grwoth) USD 160K-220KAnomaly Detection | Clustering | Data Visualization | Geospatial Data | Machine LearningExtended hours | Weekend work as neededSenior-level Full TimeBastrop, TX12h ago
-
Senior Software Engineer, AI Engineer USD 170K-210KAgent Orchestration | Caching | Cost Optimization | Evaluation | Language ModelsSenior-level Full TimeHybrid - SF Bay Area R13h ago
-
Data Engineer USD 240K-280KAnomaly Detection | Clustering | Data Pipelines | Data Quality | Data Transformation401k | Dental insurance | Health insurance | Insurance | Life insuranceEntry-level Full TimePalo Alto, CA13h ago
-
Staff AI Engineer - Search & Discovery USD 184K-376KAgentic design | Generative AI | LLM orchestration | Langchain | Language Models401k match | Dental insurance | Disability insurance | Employee assistance program | Flexible spending accountSenior-level Full TimeMountain View, USA14h ago
-
Agile | Algorithms | CI/CD | Cloud Computing | Data Engineering401k plan | Dental insurance | FSA Plan | HSA plan | Health insuranceMid-level Full TimeAustin, Texas, United States14h ago
-
Legal AI Engineer (Contractor) USD 300K-300KAPI Integration | Agentic Workflows | Artificial Intelligence | Claude | Data SecurityHybrid workSenior-level ContractSan Francisco, CA - Hybrid; Denver, … R14h ago
-
Applied AI Engineer USD 129K-185KAWS | Ansible | Artifactory | CI/CD | DockerMedical, dental & vision coverage | Nutritionist | PTO paid by company | Paid time off | Volunteer PTOSenior-level Full TimeRaleigh, NC14h ago
-
Hiring: AI-Augmented Full Stack Engineer - Python | W2 Only | Ex-Oracle / Google / Meta Preferred USD 95K-140KAPI | Cloud Native | Code generation | Continuous Deployment | Continuous integrationW2 employmentEntry-level Contract Full TimeAustin, TX, United States15h ago
-
Data Science Principal USD 176K-282KAI Assisted Development | Anomaly Detection | Audit Trail | Business Intelligence | Code generationEmployee resource groups | Employee wellbeing support | Fitness programs | Learning and development programs | Medical, dental, vision plansSenior-level Full TimeBoston, Massachusetts, United States15h ago
-
Data Science Principal USD 176K-282KAnomaly Detection | Audit Trail | Automation | Data Cleansing | Data Governance401k employer match | Discretionary paid time off | Emotional and mental wellness support | Employee resource groups | Fitness programsSenior-level Full TimeNew York, New York, United States15h ago
-
Senior Data Platform Engineer USD 168K-227KAirflow | Amazon Bedrock | Amazon S3 | Apache Iceberg | Cloud Pub/SubAnnual bonus | Equity grants | Health insurance | Retirement benefits | Sales incentiveSenior-level Full TimePleasanton, California, USA HQ15h ago
-
Staff Data Engineer USD 200K-325KClassification | Cloud Storage | Data Governance | Data Modeling | Data ObservabilitySenior-level Full TimeCambridge, Massachusetts, United States, New York, …16h ago
-
Batch Processing | Bias Mitigation | Classification | Evaluation | Experimentation401k employer match | Family planning support | Flexible vacation | Gender-affirming care | Health care benefitsSenior-level Full TimeRemote - United States R17h ago
-
Sr. ML Engineer USD 183K-246KAgent systems | Data extraction | Distributed Systems | Document parsing | EvaluationDental insurance | Flexible work options | In-person retreats | Learning stipend | Medical insuranceSenior-level Full TimeNew York City HQ17h ago
-
Founding Senior AI Engineer - Peerbound USD 200K-250KCaching | Database Design | Distributed Systems | LLM | Langchain401k | Dental insurance | Flexible PTO | In-office collaboration | Medical insuranceSenior-level Full TimeNew York17h ago
-
AWS | Artificial Intelligence | C# | C++ | Computer ScienceEmployee ownership | Professional growth opportunities | US security clearance supportSenior-level Full TimeRaleigh, North Carolina, United States18h ago
-
AWS | C# | C++ | Computer Vision | DockerEmployee ownership | Security clearance supportSenior-level Full TimeRaleigh, North Carolina, United States18h ago
-
AWS | Amazon S3 | Artificial Intelligence | C# | C++Employee ownership | Security clearance supportSenior-level Full TimeRaleigh, North Carolina, United States18h ago
-
Staff AI Engineer USD 240K-310KAgent Frameworks | Distributed Systems | Embedding pipelines | Embeddings | Evaluation401k plan | Equity compensation | Flexible time off | Health and wellness benefits | In-person offsitesSenior-level Full TimeSan Francisco, CA18h ago
-
Senior AI Engineer USD 200K-220KAI Agents | Conversational AI | Data Analysis | Deep learning | Generative AI401k retirement savings plan | Cross-functional collaboration | Employer sponsored healthcare dental and vision | Equity participation | Flexible spending accountSenior-level Full TimeRemote, USA R18h ago
-
Machine Learning Engineer III USD 176K-209K3D Convolutions | AWS | Amazon SageMaker | Apache Airflow | Computer Vision401k | Adoption support | Apparel discounts | Child Care Discounts | Citi Bike DiscountSenior-level Full TimeNew York, New York19h ago
-
Image & Computer Vision AI Engineer USD 135K-206KBias Testing | CNN | Computer Vision | Deep learning | EmbeddingsHybrid work | Mission-driven projectsMid-level Full TimeReston, VA/Washington, DC OR Somerville, MA19h ago
-
Software Engineer 3 - Query Optimization USD 109K-215KAlgorithms | C++ | Data Structures | Distributed Systems | Geospatial search401k plan | Employee stock purchase program | Fertility and adoption assistance | Flexible paid time off | Mental health counselingSenior-level Full TimeAtlanta; Boston; New York City19h ago