Member of Technical Staff, Inference & RL Systems
Tasks
- Automate fault detection and recovery
- Build and maintain distributed RL and post-training infrastructure
- Collaborate with research teams on execution systems
- Design and scale inference serving systems
- Improve reliability of rollout, evaluation, and reward pipelines
- Improve throughput and latency for long-context workloads
- Optimize KV-cache management and batching
- Profile and eliminate performance bottlenecks
Perks/Benefits
- 401k with matching
- Equity
- Health insurance
- Relocation stipend
- Unlimited paid time off
- Visa sponsorship
Skills/Tech-stack
Distributed Systems | GPU | Inference Serving | Memory Management | Model execution | Performance Profiling | Performance debugging | RL infrastructure | Scaling Systems | System Optimization
Education
N/A
Regions
Countries
States
Related jobs
-
Sr. Java Full Stack Developer USD 103K-173KAPI Design | AWS | Ansible | Automated testing | BenchmarkingSenior-level Full TimeDallas, Texas, United States14h ago
-
AI Specialist - Product and Applied Research USD 178K-235KArtificial Intelligence | C++ | Computer Vision | Data Mining | Data RegressionMid-level Full TimeMenlo Park, CA | New York, …15h ago
-
Staff Software Engineer, Generative AI, Google Cloud AI USD 207K-300KArtificial Intelligence | Cloud platform | Computer Vision | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA15h ago
-
Software Engineer, BigQuery AI/ML USD 147K-211KArtificial Intelligence | BigQuery | C plus plus | Cloud platform | Compute TechnologiesMid-level Full TimeKirkland, WA, USA15h ago
-
Senior Software Engineer - System Optimization USD 119K-258KBenchmarking | C# | C++ | Debugging | Distributed SystemsSenior-level Full TimeMountain View, CA, US; Redmond, WA, …1d ago
-
AWS | C# | Cloud Native | Data Engineering | DeploymentSenior-level Full TimeSouth San Francisco, United States1d ago
-
Senior Software Engineer - Data Fulfillment USD 155K-190KAPI | AWS | Azure | C# | CI/CD401k plan | Dental insurance | Flexible paid time off | Flexible work hours | Medical insuranceSenior-level Full TimeSeattle, WA1d ago
-
Staff Software Engineer - Distributed Data Systems USD 182K-247KACID transactions | Algorithms | Apache Spark | C++ | Data StructuresSenior-level Full TimeBellevue, Washington1d ago
-
Senior Software Engineer - Distributed Data Systems USD 157K-213KACID transactions | Algorithms | Apache Spark | C++ | Data StructuresSenior-level Full TimeBellevue, Washington1d ago
-
Member of Technical Staff - Hardcore Supercompute USD 180K-440KAnsible | C plus plus | Containerization | Debugging | Distributed Systems401k retirement plan | Dental insurance | Disability insurance | Employee discounts | Life insuranceSenior-level Full TimePalo Alto, CA; Seattle, WA1d ago
-
Principal AI Engineer - Nexus Black USD 135K-160KCI/CD | Cloud Native | Cloud Native Architecture | Distributed Systems | EvaluationHybrid workSenior-level Full TimeItasca, United States1d ago
-
Member of Technical Staff - Imagine Model USD 180K-440KAudio Processing | C++ | Computer Vision | Data Annotation | Data Augmentation401k | Dental insurance | Disability insurance | Employee discounts | Health insuranceSenior-level Full TimePalo Alto, CA; Seattle, WA1d ago
-
Senior MLOps Platform Engineer {S} USD 120K-185KAWS EKS | Airflow | Amazon S3 | Argo CD | Batching401k match | Dental insurance | Employee assistance program | HSA contributions | Health insuranceSenior-level Full TimeColorado Springs, Colorado, United States R1d ago
-
Distinguished Software Engineer, Data Infrastructure USD 248K-406KAI Inference | AI Training | Batch Processing | Compliance | Data InfrastructureExecutive-level Full TimeMountain View, CA, United States1d ago
-
Manager, Data Engineering USD 130K-166KAWS | Access Controls | Apache Airflow | Audit Logging | AzureCollaborative team culture | Remote work | Work-life balanceSenior-level Full TimeRemote, United States R1d ago
-
AI Search | API Development | AWS | AWS Bedrock | Azure401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States1d ago
-
Staff Software Engineer, Data USD 270KApache Flink | Apache Kafka | Apache Spark | Cloud Data | Cloud data warehousing401k matching | ADND Insurance | Company holidays | Extended parental leave | Flexible spending accountSenior-level Full TimeUSA, Palo Alto1d ago
-
Software Engineer, Data Security - USDS USD 118K-237KData Security | Distributed Systems | Language Models | Large Language Models | Performance optimizationEntry-level Full TimeSan Jose, California, United States1d ago
-
Software Engineer - Language (Technical Leadership) USD 213K-293KASR | Benchmarking | C# | C++ | Conversational AISenior-level Full TimeMenlo Park, CA | Seattle, WA …1d ago
-
Code review | Contamination Checking | Data Generation | Data Pipelines | Data ProcessingEntry-level Full TimeMenlo Park, CA1d ago
-
Business Support Engineer USD 159K-223KCloud Computing | Data Analysis | Data Mining | Distributed Systems | Docker24x7 on-call rotation | Cross-functional team collaboration | Global partner supportSenior-level Full TimeMenlo Park, CA1d ago
-
Staff Software Engineer, Torch TPU USD 207K-300KCUDA | Computer Vision | Data Processing | Debugging | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA1d ago
-
Sr. Back-End Software Engineer - Machine Learning USD 170K-250KC++ | Computer Vision | Distributed Systems | Language Processing | Linux401k matching | Commuter benefits | Dependent Family Medical Premium Coverage | Employee Medical Premium Coverage | Employee referral programSenior-level Full TimeSanta Clara, CA1d ago
-
Forward Deployed Engineer USD 170K-256KAWS | Azure | Bash | Debugging | Distributed SystemsHybrid work | Travel requiredSenior-level Full TimeNew York, NY1d ago
-
Senior Data Engineer USD 132K-170KData Migration | Data Modeling | Data Pipelines | Data Validation | DatabricksHybrid workSenior-level Full TimeArlington, VA1d ago