Senior/Staff AI Engineer
San Francisco - Remote, CA, United States
R
USD 150K-250K Senior-level Full Time
Tasks
- Build LLM serving systems
- Design scalable RAG infrastructure
- Engineer distributed AI systems for model serving
- Improve GPU CPU performance pathways
- Optimize KV cache performance
- Optimize LLM inference performance
- Reduce inference latency and bottlenecks
- Tune memory and storage throughput
Perks/Benefits
- N/A
Skills/Tech-stack
CPU | Distributed Systems | GPU | Inference | KV cache | Language Models | Large Language Models | Latency optimization | Memory Optimization | Model Serving | Performance Engineering | RAG | Retrieval | Storage Architecture | Throughput Optimization
Education
Roles
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R11h ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Cross Platform Inference | Cross-platform | DSPCareer growth potential | Full-time remote work | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code review100 percent remote | Career growth | Full-time employment | H1B transfer support | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KApache Beam | CI/CD | Code review | Data Lineage | Data Modeling100 percent remote | Career growthMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KAdapter methods | DPO | Deep reinforcement learning | Distributed Training | Efficient AttentionBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineer USD 100K-150KAgent architecture | Agent architectures | Agentic Workflows | Chunking | Deterministic systemsLong-term engagement | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering USD 100K-150KAgent systems | Agentic Workflows | Embeddings | Evaluation Pipelines | Fine TuningCareer growth potential | H1B transfer support | Long-term engagement | Remote work | Technical coding assessment requiredMid-level Full TimeUnited States - Remote R1d ago
-
Principal Applied AI Engineer, Finance USD 193K-340KAPI Development | AWS | Bias Mitigation | CI/CD | Churn modeling401k matching | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Massachusetts), United States R1d ago
-
Senior-level Full TimeRemote US, United States R1d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++100% remote | Full-time W2 employment | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++Senior-level Full TimeWest Windsor / Princeton Jct., NJ R1d ago
-
Lead AI Engineer (AI Systems & Automation) USD 130K-260KAlerting | Anthropic API | Automation | Distributed Systems | DockerFully remote | Global Engineering Organization | High ownership culture | Learning and development budget | Modern engineering practicesSenior-level Full TimeUnited States R1d ago
-
Senior AI Engineer USD 153K-259KAgent Frameworks | Embeddings | Evaluation | Graph Databases | Human-in-the-loop401k plan | Flexible vacation policy | Flexible work policy | Health and wellness benefits | Paid HolidaysSenior-level Full TimeRemote - US R2d ago
-
Machine Learning Engineer V USD 231K-382KAWS | Agent Orchestration | Automated testing | Azure | CI/CDBonus eligibility | Disability insurance | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, United States R2d ago
-
Senior AI Engineer USD 145K-181KAWS | Alerting | Azure | Docker | Embeddings401k match | Commuter benefits | Dental | Healthcare | Remote friendly workplaceSenior-level Full Time3750 Market Street, Philadelphia, PA, United … R3d ago
-
Senior Machine Learning Engineer USD 180K-250KComputer Vision | Data Pipelines | Data labeling | Deep learning | Embedding Models100 percent remote | 13 paid holidays | 401k plan | Dental insurance | Medical insuranceSenior-level Full TimeRemote USA R3d ago
-
Senior AI Engineer USD 250K-300KAPI Development | Artificial Intelligence | Cost Optimization | GitHub | Inference Optimization401k match | Co working sessions | Flexible PTO | Health and wellness allowance | Health insuranceSenior-level Full TimeSan Francisco (Hybrid) R3d ago
-
AWS | AWS CDK | Access Control | Airflow | Athena401k plan | Health insurance | Paid Holidays | Paid time off | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R3d ago
-
Sr AI Engineer - Agentic Systems USD 166K-205KAI Safety | API Integration | Agent Orchestration | Artificial Intelligence | Distributed SystemsSenior-level Full TimeAnywhere, US R3d ago
-
Applied AI Specialist, Commercial Customer Success USD 105K-142KAPI Integration | Accuracy Monitoring | Automated testing | CRM | Evaluation FrameworksRemote workSenior-level Full TimeRemote - US R3d ago
-
Principal Software Engineer, Data Infrastructure USD 295K-345KAWS | Airflow | Chaos Engineering | Data Catalog | Distributed SystemsEquity compensation | Health benefits | Onsite work flexibilitySenior-level Full TimeSan Mateo, CA, United States R3d ago
-
Principal Machine Learning Engineer USD 285K-457KArtificial Intelligence | Classification | Deep learning | Embeddings | ExperimentationIn-person onboarding | Remote work optionsSenior-level Full TimeRemote - USA R3d ago