Senior/Staff AI Engineer
San Francisco - Remote, CA, United States
R
USD 150K-250K Senior-level Full Time
Tasks
- Build LLM serving systems
- Design scalable RAG infrastructure
- Engineer distributed AI systems for model serving
- Improve GPU CPU performance pathways
- Optimize KV cache performance
- Optimize LLM inference performance
- Reduce inference latency and bottlenecks
- Tune memory and storage throughput
Perks/Benefits
- N/A
Skills/Tech-stack
CPU | Distributed Systems | GPU | Inference | KV cache | Language Models | Large Language Models | Latency optimization | Memory Optimization | Model Serving | Performance Engineering | RAG | Retrieval | Storage Architecture | Throughput Optimization
Education
Roles
Regions
Countries
States
Related jobs
-
Sr AI Engineer USD 124K-171KAPIs | Cause analysis | Code review | JavaScript | JiraCompany year end break | Flexible time off | Learning and development stipend | Medical/Dental/Vision insurance | Mental wellbeing resourcesSenior-level Full TimeRemote - United States R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Computer Vision | Data labeling | Deep learningRemote workMid-level Full TimeUnited States - Remote R2d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimizationMid-level Full TimeUnited States - Remote R2d ago
-
Prompt Engineering Architect USD 119K-228KAgent Frameworks | Chunking | Embeddings | Evaluation | Fine TuningSenior-level Full TimeUnited States - Remote R2d ago
-
Senior AI Engineer - Contract USD 136K-172KBehavior Trees | C# | C++ | CPU Optimization | Game AICareer improvement plan | Company events | Flexible work arrangements | Generous time-off policy | Medical, dental & vision coverageSenior-level Full TimeIrvine, CA R2d ago
-
Principal Engineer - GenAI Applications & MLOps USD 175K-242KAWS | Bigtable | Data integration | Distributed Systems | Event ProcessingRemote US basedSenior-level Full TimeUS Remote R2d ago
-
Staff AI Engineer USD 200K-300KAccuracy Monitoring | Agent systems | Artificial Intelligence | Authentication | Authorization401k eligibility | Hybrid work | Paid time off | Parental leave | Remote workSenior-level Full TimeUnited States (Remote) R2d ago
-
Senior AI Developer USD 106K-133KAI SDK | AVA | Agentic AI | Agile | Cloud Foundry401k matching | Bereavement | Employee assistance program | Health, dental, and vision care | HolidaysSenior-level Full TimeRemote - Nationwide, United States R2d ago
-
Sr. Agentic AI Software Engineer USD 139K-258KAgent Orchestration | Architecture | Claude Code | Context engineering | DebuggingSenior-level Full TimeFarmington Hills or Remote (US only) R2d ago
-
AI Product Builder USD 141K-203KAI Agents | AI coding | AI coding tools | Agent Frameworks | Artificial IntelligenceMid-level Full TimeRemote - USA R2d ago
-
Adversarial prompting | Computer Architecture | Computer Engineering | Computer networks | Data labelingFlexible schedule | Fully remote | No visa sponsorshipEntry-level ContractRemote (USA) R3d ago
-
Adversarial prompting | Engineering Mechanics | Engineering design | Engineering principles | Error detectionFlexible hours | Fully remoteMid-level ContractRemote (USA) R3d ago
-
Lead Forward Deployed Engineer, Databricks 2026- US, UK USD 180K-247KAgents | Apache Spark | Data Pipelines | Data product | DatabricksRemote workSenior-level Full TimeAtlanta, GA / London, GB - … R3d ago
-
Associate Director, Biostatistics & AI USD 173K-217K21 CFR | 21 CFR Part 11 | ADaM | Adaptive Design | Annex 11401k employer match | Company provided life and disability | Comprehensive health care | Employee stock purchase program | Flex Spending AccountsMid-level Full TimeRemote - USA R3d ago
-
Data Analysis | Deep learning | GenAI | Langchain | Language ModelsFreelance project-based work | Part-time hours | Project-based compensationMid-level FreelanceUnited States - Remote R3d ago
-
Freelance Machine Learning Engineer USD 180KLLMs | Langchain | MLOps | Machine Learning | NumPyFreelance engagement | Part-time project-based workMid-level FreelanceUnited States - Remote R3d ago
-
Database operations | LLMs | Langchain | MLOps | Machine LearningPaid per project | Part-time flexible schedule | Project based workMid-level FreelanceNew York, United States - Remote R3d ago
-
Langchain | Language Models | Large Language Models | MLOps | NumPyEnglish proficiency support | Flexible workload during active phases | Part-time schedule | Project based workMid-level FreelanceTexas, United States - Remote R3d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | Language Models | Large Language Models | MLOps | Machine LearningFlexible weekly hours during active phases | Part-time availability | Project based workMid-level FreelanceNew York, United States - Remote R3d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | MLOps | Machine Learning | NumPy | PandasProject based workMid-level FreelanceTexas, United States - Remote R3d ago
-
Principal Machine Learning Engineer USD 194K-326KAWS | Agentic AI | Airflow | Context engineering | Cost OptimizationCompetitive compensation | Equity awards | Remote work flexibilitySenior-level Full TimeRemote-USA, United States R3d ago
-
Senior AI Architect USD 134K-237KAI orchestration | Agentic AI | Auditability | Caching | Chunking401k matching program | Adoption Assistance | Development and career growth | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Colorado), United States R3d ago
-
Staff Software Engineer USD 178K-223KC++ | Cloud Object Storage | Database Internals | Debugging | Distributed SystemsContinued Career Development | Employee resource groups | Flexible work from home | Paid time off | Paid volunteer timeSenior-level Full TimeUS-Texas-Remote, United States R3d ago
-
Lead Data Engineer USD 110K-204KArtificial Intelligence | DBT | Data Governance | Data Warehousing | ETLCareer development | Commuter benefits | Employee assistance program | Fitness reimbursement | Flex My Way work life balanceSenior-level Full TimeUnited States of America, Eagan, Minnesota R3d ago
-
Staff AI Engineer, Internal Automation USD 62K-70KAlerting | CI/CD | Docker | FastAPI | HubSpot401k match | Dental insurance | Flexible PTO | Health insurance | Paid HolidaysEntry-level Full TimeRemote (United States) R3d ago