Senior Software Engineer II, Inference
Tasks
- Apply graceful degradation
- Decompose multi-service work into milestones
- Define SLIs/SLOs
- Drive architecture
- Elevate coding and testing standards
- Implement autoscaling policy
- Implement inference optimizations
- Lead design reviews
- Mentor engineers
- Own post incident actions
- Perform capacity planning
- Plan rollback and traffic shift
- Quantify optimization impact
- Review cross-team designs
Perks/Benefits
- 401k match
- Employee stock purchase program ESPP
- Flexible PTO
- Flexible spending account
- Health savings account
- Life insurance
- Medical/Dental/Vision insurance
- Paid parental leave
- Tuition reimbursement
Skills/Tech-stack
Autoscaling | BF16 | Batching | C++ | CI/CD | CUDA | Caching | Distributed Systems | FP8 | GPU Resource Isolation | Go | Grafana | Kubernetes | Latency optimization | Mixed Precision | NCCL | NUMA | Networked systems | OpenTelemetry | Performance Engineering | Prometheus | Python | RDMA | Ray Serve | Reliability Engineering | Resource Isolation | SLIs | SLOs | Streaming | TensorRT-LLM | Throughput Optimization | Torchserve | Triton | VLLM
Education
N/A
Regions
Countries
States
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R16d ago
-
Senior Developer USD 145K-150KAPI | AWS ECS | AWS EKS | AWS Fargate | Amazon S3Agile team collaboration | Remote workSenior-level Full TimeFairfax, VA, United States3h ago
-
Ansible | Azure | Azure Foundry | Azure Monitor | CI/CDMid-level Full TimeHuntsville, AL4h ago
-
Technical Architect – AI, ML & Generative AI USD 150K-202KAWS Bedrock | AWS SageMaker | Apache Spark | Artificial Intelligence | CI/CD401k | Critical Illness Accident Hospital Indemnity Identity Theft Protection | Dental plans | Life and Accidental Death and Dismemberment | Long-term disabilitySenior-level Full TimeFrisco, United States5h ago
-
Senior Full-Stack Data Engineer (Remote) USD 110K-130KAI code assistance | Automated testing | Azure DevOps | CI/CD | Code assistanceFull-time | Fully remote | Long-term | No timezone shiftingSenior-level Full TimeFlorida, Aventura, United States of America R6h ago
-
Senior Data Engineer on Site USD 140K-200KAWS Lambda | Amazon Kinesis | Amazon S3 | Data Modeling | Data ProcessingDay 1 impact | Fast execution | Low corporate overhead | Own major product parts | Shape technical visionSenior-level Full TimeSan Francisco, United States6h ago
-
Bash | C# | CI/CD | JUnit | JavaFinancial benefits | Health and wellness benefitsEntry-level Full TimePennsylvania, Exton7h ago
-
Mid-level Full TimeBurlingame, CA8h ago
-
Machine Learning Engineer USD 214K-240KBigtable | Code review | Computer Vision | Data Mining | Data PipelinesMid-level Full TimeBellevue, WA8h ago
-
Data Engineer, Analytics USD 209K-235KData Warehousing | Dimensional Modeling | ETL | MPP | MapReduceEntry-level Full TimeBurlingame, CA8h ago
-
Senior Software Engineer, AI/ML GenAI USD 174K-252KBenchmarking | C++ | Cloud platform | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA9h ago
-
Staff Software Engineer, ML Infrastructure, Applied AI USD 207K-300KC++ | Code review | Data Processing | Debugging | Design reviewSenior-level Full TimeSunnyvale, CA, USA9h ago
-
Software Engineer, GenAI, Platforms and Devices USD 147K-211KAudio generation | C++ | Data Processing | Debugging | Image GenerationMid-level Full TimeMountain View, CA, USA9h ago
-
Practice Customer Engineer IV, Cloud AI, Google Cloud USD 192K-267KAgent systems | Autogen | Convolutional Neural Networks | CrewAI | Deep learningSenior-level Full TimeSunnyvale, CA, USA; Seattle, WA, USA9h ago
-
Software Engineer, Machine Learning USD 207K-300KAlgorithms | C++ | Data Processing | Data Structures | DebuggingSenior-level Full TimeNew York, NY, USA; Mountain View, …9h ago
-
Senior Blackbelt Engineer, Gemini Cloud Assist (English) USD 234K-325KAPI | Artificial Intelligence | CNTK | Caffe | Computer VisionSenior-level Full TimeSunnyvale, CA, USA9h ago
-
Cloud Architecture | Diagnostics | EKS | Enterprise Infrastructure | FirewallMid-level Contract TemporaryNew York, USA9h ago
-
Associate Principal, Trust and Safety, GenAI USD 142K-205KCybersecurity | Dashboarding | Data Transformation | Data Visualization | Data collectionMid-level Full TimeWashington D.C., DC, USA; Atlanta, GA, …9h ago
-
Software Engineer III, Rankings, Lens Quality USD 147K-211KAlgorithms | C plus plus | Data Processing | Data Structures | DebuggingSenior-level Full TimeMountain View, CA, USA9h ago
-
Software Engineer Manager, Network Capacity Scaling USD 207K-300KAlerting | Code review | Compute Technologies | Data Processing | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA9h ago
-
Staff Software Engineer, Cooling Optimization USD 207K-300KC++ | Control Theory | Cooling systems | Data Center Technology | Data StructuresSenior-level Full TimeSunnyvale, CA, USA9h ago
-
C++ | Diagnostics | Distributed Systems | Firmware | GPUSenior-level Full TimeKirkland, WA, USA; New York, NY, …9h ago
-
Senior Data Engineer - Remote USD 150K-160KAPIs | Data Governance | Data Lineage | Data Management | Data Modeling401k | Dental insurance | Health and wellness programs | Health insurance | Life insuranceSenior-level Full TimeAtlanta, GA, US, 30328 R13h ago
-
Data Engineer I USD 68K-100KAPI Keys | Azure | Azure Data | Azure Data Factory | Azure Synapse401k match | Employee onboarding program | Flexible PTO | Health and wellness benefits | Onsite optionsEntry-level Full TimeMorrisville, NC, US, 2756013h ago
-
Senior Analytics Engineer, Marketing USD 159K-201KAttribution | DBT | Data Modeling | Dimension tables | Dimensional ModelingAnnual equity refresh grants | New hire equity grant | Remote work flexibilitySenior-level Full TimeUnited States - Remote R16h ago