Staff / Senior Software Engineer, Inference
San Francisco, CA | New York City, NY | Seattle, WA
USD 300K-485K Senior-level Full Time
Tasks
- Build and maintain AI inference infrastructure
- Build deployment pipelines for model releases
- Design intelligent request routing
- Develop autoscaling for production workloads
- Implement distributed systems for inference
- Implement load balancing and traffic management
- Integrate AI accelerator platforms
- Manage fleet wide orchestration
- Manage multi-region deployments
- Optimize compute efficiency
- Tune performance using observability data
Perks/Benefits
Skills/Tech-stack
AWS | Azure | Batching | Caching | Distributed Systems | GCP | Inference Optimization | Kubernetes | LLM Inference | LLM Inference Optimization | Load Balancing | Machine Learning | Python | Request Routing | Rust | Traffic Management
Education
Regions
Countries
States
Related jobs
-
Data Engineer, Analytics USD 205K-235KData Governance | Data Modeling | Data Quality | Data Security | Data VisualizationEntry-level Full TimeSeattle, WA1h ago
-
Data Engineer (Analytics) USD 191K-235KBig Data | Data Modeling | Data Warehousing | Data integration | Dimensional ModelingDomestic and international travel | TelecommutingMid-level Full TimeMenlo Park, CA | Remote, US R1h ago
-
Ad Ranking | Algorithms | C++ | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA1h ago
-
AI Engineer, Professional Services, Google Cloud USD 183K-265KApache Beam | Apache Spark | C++ | Data Validation | Data WarehousingTechnical workshops | Travel opportunitiesSenior-level Full TimeAustin, TX, USA; Atlanta, GA, USA1h ago
-
Senior Software Engineer, AI/ML GenAI, Core USD 174K-252KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA1h ago
-
Software Engineer III, AI/ML, Core USD 147K-211KAlgorithms | Data Processing | Data Storage | Data Structures | DebuggingSenior-level Full TimeSunnyvale, CA, USA1h ago
-
Software Engineer III, AI/ML, Google Workspace USD 147K-211KC++ | Data Processing | Debugging | Language Processing | ML InfrastructureSenior-level Full TimeBoulder, CO, USA1h ago
-
Software Engineering - DataStage Developer USD 112K-129KAxway | Axway Secure SFTP | Azure | Azure DevOps | CA7 SchedulerHybrid work schedule | Remote workMid-level Full TimeSyracuse, New York, United States4h ago
-
Senior Data Engineer - Knowledge Platform USD 160K-260KApache Airflow | Apache NiFi | Batch Processing | BigQuery | Cloud platformEquity compensation | Fully stocked kitchen | Open office space | Team building eventsSenior-level Full TimeUS - San Francisco8h ago
-
Robotics Platform Security Engineer USD 90K-300KAppArmor | Auditd | C# | C++ | CIS BenchmarksHybrid work option | On-site collaboration | Remote work optionSenior-level Full TimeIrvine, CA9h ago
-
Software Engineer II - Abnormal Data Platform USD 149K-214KAerospike | Amazon DynamoDB | Apache Spark | Data Storage | DatabricksDistributed team collaboration | Remote work | Technical mentorshipMid-level Full TimeRemote - USA R9h ago
-
Applied AI ML Engineer-Senior Associate USD 175K-210KAWS | Amazon Bedrock | Amazon EKS | Amazon SageMaker | Data ShardingBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States11h ago
-
Machine Learning Engineer, Growth USD 130K-500KElasticsearch | Embeddings | Fine Tuning | Go | KafkaEquity grant | Free gym membership | Health insurance | Housing bonus | Meals stipendMid-level Full TimeSan Francisco11h ago
-
Senior Analytics Engineer USD 87K-161KData Lakehouse | Data mesh | Databricks | Delta Lake | ETL401k | Health insurance | Hybrid work | Paid time off | Remote workSenior-level Full TimeRemote-MO, United States R12h ago
-
Applied AI ML-Senior Associate USD 177K-240KAWS | Algorithms | Azure | Data Quality | Data StructuresBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeNew York, NY, United States13h ago
-
Abuse Test Engineer, Energy Storage USD 140K-224KActuation | Data Analysis | Data acquisition | Data logging | Electrochemical systemsSenior-level Full TimeMcCarran, NV13h ago
-
Data Warehouse Engineer USD 46K-60KCorepoint | Data Modeling | Data Quality | Data Validation | Data Warehousing401k match | Dental insurance | Discount programs | Employee counseling | FSAEntry-level Full TimeRemote, United States R13h ago
-
Lead Machine Learning Engineer USD 157K-237KA/B | A/B Testing | Airflow | B testing | Data PipelinesSenior-level Full TimeUS TX Austin14h ago
-
Senior Machine Learning Engineer USD 170K-237KA/B | A/B Testing | Apache Airflow | B testing | Deep learningSenior-level Full TimeUS TX Austin14h ago
-
AWS Batch | AWS EC2 | AWS IAM | AWS Lambda | AWS S3Annual bonus | Company paid benefits | Equity | Paid time offSenior-level Full TimeLos Angeles, California14h ago
-
Staff Applied AI Engineer, Enterprise GenAI USD 216K-270KAWS | Cloud platform | Data Analysis | Generative AI | Google CloudCommuter stipend | Equity compensation | Health, dental, vision insurance | Learning and development stipend | Paid time offSenior-level Full TimeSan Francisco, CA; Seattle, WA; New …14h ago
-
Entry-level InternshipHouston, TX14h ago
-
AI/ML Engineer USD 130K-223KAgentic AI | Deep learning | Distributed Training | Docker | EmbeddingsMid-level Full TimeScottsdale, AZ15h ago
-
Principal Engineer, Data & ML Platform USD 119K-180KAPIs | Automated testing | Cloud Native | Cloud platform | Continuous DeploymentSenior-level Full TimeScottsdale, AZ15h ago
-
Principal Machine Learning Engineer USD 245K-393KCloud infrastructure | Data Science | Distributed Systems | Infrastructure as Code | ML pipelinesSenior-level Full TimeChicago, Illinois, USA R15h ago