Senior/Staff AI Engineer
San Francisco - Remote, CA, United States
R
USD 150K-250K Senior-level Full Time
Tasks
- Build LLM serving systems
- Design scalable RAG infrastructure
- Engineer distributed AI systems for model serving
- Improve GPU CPU performance pathways
- Optimize KV cache performance
- Optimize LLM inference performance
- Reduce inference latency and bottlenecks
- Tune memory and storage throughput
Perks/Benefits
- N/A
Skills/Tech-stack
CPU | Distributed Systems | GPU | Inference | KV cache | Language Models | Large Language Models | Latency optimization | Memory Optimization | Model Serving | Performance Engineering | RAG | Retrieval | Storage Architecture | Throughput Optimization
Education
Roles
Regions
Countries
States
Related jobs
-
Senior-level Full TimePennsylvania-Remote, United States R1d ago
-
Senior AI Engineer USD 147K-198KA/B | A/B Testing | API Development | Agentic Workflows | B testingSenior-level Full TimePennsylanvia-Remote, United States R1d ago
-
Machine Learning Scientist, BioML USD 200K-330KAWS | Azure | Bioinformatics | Cloud Computing | Computational Biology401k employer match | Equity participation | Health, dental, vision insurance | Paid time off | Professional developmentMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R2d ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Reasoning | Fine TuningSenior-level Full TimeRemote-USA R2d ago
-
Senior Developer Advocate - Modern App Development USD 194K-237KAPI Integrations | AWS | Cloud platform | Code Quality | Google CloudCommunity groups | Employee stock purchase plan | Inclusion talks | Mental health benefits | Mentor/Buddy programSenior-level Full TimeCalifornia, USA, Remote; Nevada, USA, Remote; … R2d ago
-
Staff Software Engineer, AI Developer Tools USD 180K-245KAPI Design | Agent systems | CI/CD | Compliance | Data PrivacySenior-level Full TimeDenver, CO;San Francisco, CA;New York, NY;Seattle, … R2d ago
-
Staff Software Engineer, Big Data Storage USD 177K-364KApache Flink | Apache Hive | Apache Iceberg | Apache Spark | Column BackfillSenior-level Full TimePalo Alto, CA, US; Remote, US R2d ago
-
Lead AI Engineer, Business Operations (Hybrid or Remote USD 150K-220KAPI Design | Backend Development | Cloud Platforms | Evaluation Frameworks | Fine Tuning401k company match | Career advancement opportunities | Dental insurance | Flexible time off policy | Life insuranceSenior-level Full TimeDallas, Texas, United States; United States R2d ago
-
Principal Data Engineer/ Technical Lead USD 219K-298KAWS | Access Layer | Aggregation pipelines | Apache Kafka | Apache Spark401k match | Employer paid medical/dental/vision | Flexible spending account | Paid parental leave | Remote first work from homeSenior-level Full TimeUnited States (Remote) R2d ago
-
Senior Software Engineer II - (AI Core Platform) USD 100K-177KAPI Development | API Gateway | AWS | Agile | AlertingMid-level Full TimeRemote, United States R2d ago
-
Senior Software Engineer I - AI/ML USD 145K-190KAPI Development | Agile | Alerting | CI/CD | Data ModelingSenior-level Full TimeRemote, United States R2d ago
-
AI Expert USD 148K-175KAWS | Agile | Batch Processing | Data Mapping | Data ModelingHybrid work | Public Trust Clearance | Remote workSenior-level Full TimeMemphis, TN, United States R2d ago
-
People Analytics AI Engineer USD 146K-221KAPI Integration | AWS | Amazon Redshift | Automation | Data ModelingFlexible working | Health benefits | Parental leave plans | Professional development stipend | Remote ModelSenior-level Full TimeRemote - Seattle R2d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R3d ago
-
Generative AI Engineering Intern (Graduate) USD 70K-70KAWS | Agile | Azure OpenAI | Azure OpenAI Service | CI/CDDedicated mentorship | Flexible scheduling | Networking opportunities | Potential full-time employment | Remote friendly schedulingEntry-level Full Time InternshipUnited States R3d ago
-
Director of AI & Machine Learning USD 200K-272KAI Governance | API Integration | Access Control | Audit Logging | Cloud Security401k matching | Company-Paid Holidays | Corporate discounts | Insurance (medical, dental, vision) | Paid time offExecutive-level Full TimeRemote (All), United States R3d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | MLOps | Machine Learning | NumPy | PandasFlexible part-time hours | Project-based assignments | Remote workMid-level FreelanceTexas, United States - Remote R3d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | Language Models | Large Language Models | MLOps | NumPyFlexible weekly hours | Part-time availability | Project based workMid-level FreelanceNew York, United States - Remote R3d ago
-
Freelance Machine Learning Engineer USD 180KLLM | Langchain | MLOps | NumPy | PandasProject based workMid-level FreelanceUnited States - Remote R3d ago
-
Edge AI Engineer USD 100K-150KC plus plus | Core ML | Deep learning | Edge Computing | Embedded SystemsCareer growth | No third party employment | Remote work | W2 employmentSenior-level Full TimeUnited States - Remote R3d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data Validation | Data labelingMid-level Full TimeUnited States - Remote R3d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | Apache Spark | CI/CD | Caching | Code reviewCareer growth | Health benefits | Remote workMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 100K-150KAttention Mechanisms | Benchmarking | C++ | Continuous batching | Data pipelineCareer growth | Remote workMid-level Full TimeUnited States - Remote R3d ago
-
Prompt Engineering Architect USD 100K-150KAgent Frameworks | Chunking | Embeddings | Evaluation | Fine TuningCareer growth | Mentorship | Remote workSenior-level Full TimeUnited States - Remote R3d ago
-
AI/ML Implementation Engineer (m/f/d) USD 93K-155K8D | APQP | AWS | AWS Lambda | Amazon Bedrock12 paid holidays | Disability benefits | Employee assistance program | Life insurance | Medical, dental & vision coverageSenior-level Full TimeRemote, United States R3d ago