Lead AI Engineer (FM Hosting, LLM Inference)
Tasks
- Apply AI governance practices
- Build large language model inference services
- Design AI software components for foundation model training
- Develop model guardrails
- Evaluate AI models
- Implement similarity search systems
- Improve scalability cost latency throughput
- Optimize LLM training and inference
- Run AI experimentation
- Set up AI observability
Perks/Benefits
- N/A
Skills/Tech-stack
AI Governance | AWS | C# | C++ | Cloud Computing | Experimentation | Go | Huggingface | Java | JavaScript | LLM Inference | Model Evaluation | NVIDIA NeMo Guardrails | NVIDIA Nemo | Nemo Guardrails | Observability | PyTorch | Python | Scala | Similarity Search | VectorDB
Education
Roles
AI | AI Engineer | Engineer | Lead | Lead AI Engineer
Regions
Countries
States
Cities
Related jobs
-
Software Engineer, Video AI/ML Specialist USD 141K-211KAI | AV1 | AV2 | Audio Processing | Audio/VideoMid-level Full TimeBellevue, WA | Menlo Park, CA …2h ago
-
Tech Lead, AI Research Scientist (Robotics) USD 170K-251KAction Conditioned World Models | Artificial Intelligence | Computer Vision | Deep learning | Dexterous ManipulationMentorship opportunities | Open science contributions | Work authorization supportSenior-level Full TimeMenlo Park, CA2h ago
-
Network Engineer, Deployment & Support USD 101K-156K400G | 800G | AI | Automation | Coherent opticsMid-level Full TimeMenlo Park, CA | Eagle Mountain, …2h ago
-
Senior Software Engineer, Database Internals, AlloyDB USD 174K-252KC# | C++ | Code optimization | Concurrency Control | Database InternalsEntry-level Full TimeSunnyvale, CA, USA2h ago
-
Artificial Intelligence | Data Analysis | Data Structures | Data structures algorithms | Human-in-the-loopSenior-level Full TimeMountain View, CA, USA2h ago
-
Agent tooling | Artificial Intelligence | C++ | Cloud Architecture | Conversational AISecret clearance | TravelSenior-level Full TimeAtlanta, GA, USA; Austin, TX, USA2h ago
-
AI Pipelines | BigQuery | Cloud Composer | Cloud Pub/Sub | Cloud SpannerMid-level Full TimeChicago, IL, USA; Atlanta, GA, USA2h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Computer Vision | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA2h ago
-
Senior Software Engineer, Applied AI Commerce USD 174K-252KAutomated Evaluation | C++ | Cloud | Evaluation datasets | GeminiSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA2h ago
-
C++ | Data Structures | Data Structures and Algorithms | Embedded Linux | Software ArchitectureSenior-level Full TimeMountain View, CA, USA2h ago
-
Data Processing | Data Storage | Data Structures | Data Structures and Algorithms | Distributed SystemsSenior-level Full TimeMountain View, CA, USA2h ago
-
Senior Data Scientist - Clinical AI development USD 100K-155KAPI Design | CI/CD | Cloud Computing | Containerization | Data Pipelines401k | Disability insurance | Employee assistance program | Flexible vacation | Life insuranceSenior-level Full TimeLexington, MA, US5h ago
-
AI Data Engineer USD 120K-220KAgent memory | Amazon Web Services | Audio Processing | Batch Processing | Cloud infrastructureAccess to AI tools | Equity | Remote opportunitiesMid-level Full TimeSan Francisco Bay Area12h ago
-
Senior-level Full TimeRaleigh, NC, US13h ago
-
AI Innovation Analyst - Internal USD 65K-80KAI Governance | AI Services | Authentication | Automation | AzureEntry-level Full TimeMiami, FL13h ago
-
Instructor - Masterclass, GenAI/Agentic AI Workshops USD 110K-120KAI Assistants | AI coding | AI coding tools | API Integration | API Keys24x7 support | Collaborative learning environment | Live virtual classesSenior-level Full TimeUnited States - Remote R13h ago
-
C plus plus | C# | CAD | Dynamics | FDA Compliance401k | Company holidays | Dental insurance | Health insurance | Paid maternity/paternity leaveSenior-level Full TimeLos Angeles, California R14h ago
-
AI Engineer, Generative AI Agents USD 130K-188KAWS | Agile | Amazon Bedrock | Context engineering | Cost OptimizationOn-site work requiredSenior-level Full TimeDenver, CO14h ago
-
Principal Agentic AI Engineer USD 274K-338KAgent Orchestration | Auditability | Benchmarking | Confidence scoring | Distributed SystemsContinuing education support | Dental insurance | Flexible vacation policy | Health insurance | Paid parental leaveSenior-level Full Timesan francisconew york R15h ago
-
Mid-level Full TimeIrvine, CA16h ago
-
Senior Data Engineer USD 117K-162KAWS | Azure | BigQuery | DBT | Data Architecture401k | Annual wellness stipend | Cell phone reimbursement | Coaches and therapists access | Collective Pause DaysSenior-level Full TimeRemote - US R16h ago
-
Software Engineer, Enterprise AI Platform USD 230K-385KAPI Design | Agent systems | Applied AI | Auditability | AuthenticationMid-level Full TimeSan Francisco16h ago
-
Staff Machine Learning Engineer, Programmatic Ads USD 222K-389KA/B | A/B Testing | Ad Ranking | B testing | Data AnalysisSenior-level Full TimeSan Francisco, CA, US; Palo Alto, …16h ago
-
Embedded Software Engineer II USD 115K-140KBash | C plus plus | C# | CI/CD | D-busERGs | Family Caregiver Support | Flexible PTO | HSA match | Health benefitsMid-level Full TimeRemote - USA R16h ago
-
Senior-level Full TimeBoston, Massachusetts16h ago