Software Engineer, Search AI Infra Performance
Tasks
- Collaborate with machine learning research and software reliability teams
- Design tools and processes for large language model developer experience
- Ensure large language model and AI launch under latency capacity reliability constraints
- Optimize latency and throughput across search stack for large language models
- Optimize machine learning workloads using TPU and ML accelerators
Perks/Benefits
- N/A
Skills/Tech-stack
Data Processing | Debugging | Distributed Systems | Generative AI | Language Models | Language Processing | Large Language Models | Latency optimization | Machine Learning | Model Deployment | Model Evaluation | Model Optimization | Natural Language | Natural Language Processing | Performance Engineering | Search infrastructure | System Performance | System performance engineering | Throughput Optimization
Education
Roles
Regions
Countries
States
Related jobs
-
AI Software Engineer USD 100K-200KDeep learning | Language Models | Large Language Models | Machine Learning | PyTorchMid-level Full TimeSan Francisco, CA, US / Palo …1h ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Direct Preference Optimization | Graph Neural NetworksEntry-level Full TimeSeattle, Washington, United States3h ago
-
Machine Learning Engineer Intern (E-commerce Governance Algorithms) - 2026 Summer (BS/MS) USD 122K-246KAlgorithm Design | Fraud Detection | Machine Learning | Python | Risk AssessmentDevelopment workshops | Hands-on experience | Industry exposure | Social eventsEntry-level InternshipSeattle, Washington, United States3h ago
-
Artificial Intelligence | Big Data | Data Processing | Distributed Systems | High PerformanceEntry-level InternshipSan Jose, California, United States3h ago
-
Senior Data Engineer, YouTube Data Science USD 156K-226KApache Flume | Apache Spark | Automation | Business Intelligence | ComplianceSenior-level Full TimeSan Bruno, CA, USA4h ago
-
Staff Software Engineer, YouTube Data Science USD 207K-300KBig Data | Data Structures | Data Structures and Algorithms | Data analytics | Distributed ComputingSenior-level Full TimeSan Bruno, CA, USA4h ago
-
Software Engineer III, BigLake OSS USD 147K-211KApache Arrow | Apache Iceberg | Apache Spark | C++ | Data StorageSenior-level Full TimeSeattle, WA, USA4h ago
-
Senior Data Engineer USD 113K-188KApache Spark | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage401k retirement plan | Adoption Assistance | Employee referral program | Health savings account | Parental leaveSenior-level Full TimeGH Office: San Antonio, TX (9903 …15h ago
-
Senior Staff AI Data Infrastructure Engineer USD 203K-344KApache Iceberg | Apache Spark | C++ | Concurrent programming | Data LakehouseSenior-level Full TimeSanta Clara, CA17h ago
-
Software Engineer - GPU Inference USD 165K-330KAPI | Async Scheduling | CLI | CUDA | Distributed Systems401k | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco19h ago
-
Communication Protocols | Computer Vision | Control Systems | Debugging | GRPC401k plan | Dental insurance | Medical insurance | Relocation benefits | Unlimited PTOMid-level Full TimeSan Francisco, CA1d ago
-
Palantir Senior Data Engineer USD 135K-200KData Management | Data Processing | Data integration | Feature Engineering | Generative AISenior-level Full TimeAtlanta, Georgia, United States1d ago
-
Applied Research - Evals & Data USD 150K-300KAccelerate | Data Pipelines | Data Versioning | Distributed Systems | Distributed tracingConference attendance | Professional development budget | Relocation support | Remote work | Team offsitesSenior-level Full TimeSan Francisco1d ago
-
Staff Data Engineer USD 187K-245KAPI Gateway | Alerting | Amazon Redshift | Apache Airflow | BigQueryEquity | Flexible paid time off | Health insurance 100% paid premium | Lifestyle stipend | Parental leaveSenior-level Full TimeRemote, US R1d ago
-
Training: ML Framework Engineer USD 205K-445KDistributed Systems | Machine Learning | Performance optimization | Profiling | PythonHybrid work model | Relocation assistanceMid-level Full TimeSan Francisco1d ago
-
Staff AI engineer USD 125K-170KAI Evaluations | AWS | Agent Orchestration | Caching | Data PipelinesFlexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco1d ago
-
Machine Learning Engineer: Perception and Planning USD 184K-275KAutomated testing | Behavior Prediction | C++ | Classification | Code reviewSenior-level Full TimeOakland, CA1d ago
-
ML Infrastructure Engineer USD 160K-230KAmazon SageMaker | Apache Airflow | Apache Spark | Argo Workflows | Cloud platformEntry-level Full TimeOakland, CA1d ago
-
Embedded Software Engineer II USD 129K-193KBootloader | C# | C++ | CI/CD | DebuggingDental insurance | Disability insurance | FSA | HSA | Health insuranceSenior-level Full TimeWestminster, CO1d ago
-
Sr. Back-End Software Engineer - Machine Learning USD 190K-250KC++ | Computer Vision | Distributed Systems | Language Processing | Linux401k matching | Commuter benefits | Dependent Family Medical Premium Coverage | Employee Medical Premium Coverage | Employee referral programSenior-level Full TimeSanta Clara, CA1d ago
-
Space Operations Engineer (Embedded Software) USD 100K-160KAPI Integration | ARM | C plus plus | C# | Command and controlMid-level Full TimeSan Francisco, CA1d ago
-
Director, Perception USD 253K-318K3D Modeling | CUDA | Computer Vision | Deep learning | GPU ComputingExecutive-level Full TimeFoster City, CA1d ago
-
Senior Quantum Applications Engineer - QEC USD 120K-258KCUDA-Q | Decoder algorithms | Docker | End to End | End-to-End TestingSenior-level Full TimeNew Haven, CT1d ago
-
Platform Engineer - Generative AI USD 120K-160KAPI Development | Caching | Database Design | Fine Tuning | Flask401k employer contribution | Dental insurance | Health insuranceMid-level Full TimeNew York, San Francisco, Munich or …1d ago
-
Software Development Engineer - ML Ops USD 123K-222KAWS | ArgoCD | CI/CD | Distributed Systems | DockerFlex work | On-call rotationSenior-level Full TimeUSA, GA, Atlanta, United States1d ago