Application Software Engineer, Inference
Tasks
- Architect model serving infrastructure
- Benchmark and accelerate inference engines
- Build reliable high concurrency serving systems
- Collaborate to integrate inference into systems
- Create CI CD infrastructure
- Develop inference systems
- Develop tools for tracing and replay
- Optimize inference latency and throughput
- Own request routing and rate limiting
Perks/Benefits
- 401k plan
- Employee stock purchase plan
- Long-term incentives
- Medical, dental & vision coverage
- Onsite Palo Alto
- Paid Holidays
- Paid parental leave
- Paid vacation
Skills/Tech-stack
Agent Orchestration | Agent SDK | Auto Scaling | Batch scheduling | C++ | CI/CD | ClickHouse | Continuous batching | Continuous integration | Distributed Systems | Docker | GPU Kernel | GRPC | Global KV Cache | Go | Inference | KV cache | Kubernetes | Language Models | Large Language Models | Load Balancing | MongoDB | Observability | Performance optimization | PostgreSQL | Profiling | Python | Quantization | REST | Rust | Speculative decoding
Education
Regions
Countries
States
Cities
Related jobs
-
Software Engineer III, AI/ML, Proxybidder ML USD 147K-211KC++ | Data Processing | Debugging | JAX | KerasBenefits | Bonus target | EquitySenior-level Full TimeNew York, NY, USA1h ago
-
Staff Software Engineer, Agent-Centric Data and APIs USD 207K-301KC++ | CSS | Data Engineering | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSan Francisco, CA, USA1h ago
-
Senior Software Engineer, AI/ML GenAI, Core USD 174K-253KC++ | Computer Vision | Data Processing | Data Storage | Data StructuresHealth insurance | Paid time off | Parental leave | Retirement plansSenior-level Full TimeSan Jose, CA, USA1h ago
-
Software Engineer, Google Cloud Storage USD 147K-211KAccess Control | C++ | Chaos Engineering | Cloud Functions | Cloud StorageMid-level Full TimeRaleigh, NC, USA; Durham, NC, USA1h ago
-
Software Engineer, ML Fleet Intelligence USD 207K-301KAlgorithms | Anomaly Detection | Cloud Computing | Data Analysis | Data ProcessingSenior-level Full TimeSunnyvale, CA, USA1h ago
-
Artificial Intelligence | Cloud Security | Cloud Security Incident Response | Cyber Security | Cyber ThreatBenefits | Full scope polygraph clearanceSenior-level Full TimeMaryland, USA1h ago
-
Staff Software Engineer, Cloud AI USD 207K-301KAgent Development | C++ | Cloud Computing | Cloud platform | Data StorageSenior-level Full TimeSunnyvale, CA, USA; New York, NY, …1h ago
-
Software Engineer III, Perception, XR USD 147K-211KAudio Processing | Data Processing | Java | Kotlin | Language ProcessingSenior-level Full TimeSan Jose, CA, USA1h ago
-
Machine Learning Engineer USD 130K-170KAirflow | CI/CD | DBT | Docker | ELT401k match | Dental insurance | Disability coverage | Life insurance | Medical insuranceSenior-level Full TimeNew York, NY, US5h ago
-
Senior Machine Learning Engineer, Applied AI Modeling USD 139K-218KClassification | Embeddings | Evaluation | Fine Tuning | Hugging FaceHome office stipend | Medical, dental & vision coverage | Paid Holidays | Paid parental leave | Professional development budgetSenior-level Full TimeRemote US R8h ago
-
Senior Principal Engineer- MLOps & AI Machinery, ADAS/AV USD 240K-320KASIL | Automation | Automotive Safety | Automotive hardware | Automotive safety standards401k match | Dental insurance | Flexible spending account | Health insurance | Health savings accountExecutive-level Full TimeSunnyvale, CA, United States8h ago
-
AI Software Engineer USD 100K-180KAgile | CI/CD | Code Analysis | Confluence | Data EngineeringCareer progression | World-class benefitsMid-level Full TimeAnnapolis Junction, Maryland, United States8h ago
-
AI Software Engineer-Senior USD 124K-195KAgile | Code Analysis | Confluence | Continuous Delivery | Continuous DeploymentCareer progression | World-class benefitsSenior-level Full TimeAnnapolis Junction, Maryland, United States8h ago
-
Senior-level Full TimeAnnapolis Junction, Maryland, United States8h ago
-
AI Analytic Software Engineer-Senior USD 130K-195KAWS Amplify | AWS Bedrock | Confluence | Docker | ElasticsearchCareer progression | World-class benefitsSenior-level Full TimeAnnapolis Junction, Maryland, United States8h ago
-
Sr. AI Engineer USD 150K-175KAccess Control | Agentic Frameworks | Auditability | CI/CD | Cloud Native401-k match | Dental insurance | Expense Reimbursement for Home Office | Life insurance | Medical insuranceSenior-level Full TimeRemote, USA, United States R9h ago
-
AI Solutions Engineer (Fixed-Term) USD 70K-300KAgentic Systems | Application development | Customer discovery | LLM Application Development | LLM applicationMid-level TemporaryIrvine, CA11h ago
-
Senior Analytics Engineer USD 159K-200KAWS | Airflow | DBT | Dagster | Data ObservabilityAutonomy | Fully remote | High-impact work | Use of AI toolsSenior-level Full TimeRemote US R11h ago
-
Data Platform Engineer USD 125K-188KAPI Gateway | AWS | AWS Glue | AWS Kinesis | AWS LambdaEmployee discounts | Matching 401-K | Medical/Dental/Vision | Paid time off | Wellness programMid-level Full TimeUnited States, San Mateo, CA13h ago
-
Senior Machine Learning Engineer USD 185K-255KApache Spark | CI/CD | Docker | Drift Detection | Feature EngineeringEvents and activities | Healthcare coverage | Hybrid work flexibility | Self-managed PTO | SnacksSenior-level Full TimeSeattle, WA13h ago
-
C++ | Dataset design | Deep learning | Generative AI | JavaSenior-level Full TimeSunnyvale, California, USA13h ago
-
Systems Software Test Lead - Robotics USD 150K-190KAWS | Agile | Autonomy | Cloud Computing | Control SystemsCareer growth opportunities | Comprehensive benefits | MentorshipSenior-level Full TimeNorth Reading, Massachusetts13h ago
-
Data Platform Administrator USD 59K-72KAccess Management | Apache Spark | Automation | CI/CD | Data Governance401k employer matching | Birthday leave | Commuter benefits program | Educational assistance | Employer-paid health insuranceSenior-level Full TimeRockville, MD, US13h ago
-
Senior Machine Learning Operations Engineer USD 166K-208KAlerting | CI/CD | Canary Deployment | Champion Challenger | Drift DetectionSenior-level Full TimeSan Francisco, CA, New York, NY, … R14h ago
-
Adjunct Instructor, Applied AI Engineering - Howard University - Fall 2026 (In-Person, Master’s Degree Required) USD 150K-150KAI Assisted Development | Codebase Management | Generative AI | Language Models | Large Language Models401k | Employee assistance program | Exclusive marketplace savingsMid-level Full TimeWashington DC, United States14h ago