Member of technical staff (Inference) - London
Tasks
- Collaborate with research teams on model architectures
- Develop GPU kernels for attention and matrix multiplication
- Develop inference pipelines
- Implement distributed computing techniques
- Implement state of the art inference techniques
- Optimize memory usage throughput latency
- Review research papers for inference optimization
Perks/Benefits
Skills/Tech-stack
C++ | CUDA | CUDA kernel | CUDA kernel programming | Caching | Continuous batching | Deep learning | Deep learning inference | Distributed Computing | Flash Attention | GPU Programming | Ggml | Kernel programming | Llama.cpp | Model Compression | NCCL | ONNX Runtime | Paged Attention | PyTorch | Python | Quantization | Rust | SGLang | TensorRT-LLM | Triton | VLLM
Education
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Software Engineer
Related jobs
-
Senior Platform and AI Engineer GBP 80K-110KAzure OpenAI | Azure Pipelines | Branch protection | Budget Management | CI/CDFlexible working | Work-life balanceSenior-level Full TimeLondon, United Kingdom11h ago
-
Senior AI/ML Engineer GBP 62K-80KAKS | AWS | Airflow | Azure | Azure Machine LearningAnnual holiday allowance | Colleague discount | Cycle to work scheme | Employee assistance programme | Employee discountsSenior-level Full TimeLondon, London, United Kingdom12h ago
-
AI Governance | Agile | Artificial Intelligence | Cloud | GoExecutive-level Full TimeLondon, Greater London, England, United Kingdom13h ago
-
Data Engineer GBP 40K-45KAzure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | Azure DatabricksFlexible working hours | Paid parental leave | Remote work options | Training and development programmesMid-level Full TimeLondon - Shell Centre, United Kingdom22h ago
-
Software Engineer (Embedded) GBP 45K-59KADC | ARM microcontrollers | Agile | Attestation | Bare MetalMid-level Full TimeCambridge22h ago
-
Senior-level Full TimeLondon, England, United Kingdom22h ago
-
Power BI Developer GBP 52K-62KALM Toolkit | Active Directory | Audit Logging | Azure Active Directory | DAXCharitable donations | Digital GP service | Employee assistance membership | Enhanced parental leave pay | Free Single Medical CoverSenior-level Full TimeGBR-Bridgwater-EDF NNB Hinkley Point C (040GB), …22h ago
-
Data Scientist – Analytics & ML GBP 50K-51KAmazon Redshift | Artificial Intelligence | Dashboarding | Data Governance | Data ModelingMid-level Full TimeUK-Dungannon, United Kingdom22h ago
-
Data Engineer - Kids Planet Central Support GBP 34K-45KAPI | AWS | Azure | Cloud platform | Data GovernanceAdoption leave enhancement | Anniversary Awards | Birthday leave | Career progression | Discounted childcareMid-level Full TimeEngland, WA13 0RN, GB1d ago
-
Data Engineer GBP 50K-68KAgile | Apache Spark | Azure Cosmos | Azure Cosmos DB | Azure DataDE&I initiatives involvement | Employee discount | Flex benefits allowance | Industry events participation | Paid annual leaveSenior-level Full TimeLondon, England, United Kingdom1d ago
-
Senior-level Full TimeLondon, United Kingdom1d ago
-
AI Solutions Developer GBP 55K-70KAWS | Autogen | CI/CD | CrewAI | FastAPIAdditional days off | Coffee and snacks | Extra parental leave | Health insurance | Hybrid workMid-level Full TimeLondon, England, United Kingdom1d ago
-
Applied AI & ML Lead – Markets Operations GBP 81K-109KAWS | AutoPrompt | Azure | Bedrock | Cloud platformSenior-level Full TimeLONDON, United Kingdom1d ago
-
Buildpacks | Data Structures | Data Structures and Algorithms | Envoy | GoSenior-level Full TimeLondon1d ago
-
Senior-level Full TimeLondon, United Kingdom1d ago
-
Software Specialist (TL) - AI/ML - Monetisation GBP 35K-40KC++ | Causal Inference | Data Analysis | Data Modeling | Data PipelinesSenior-level Full TimeRemote, UK | London, UK R1d ago
-
Coach - Data Engineer Level 5 GBP 55K-70KAWS | Apache Airflow | Apache Spark | Azure | Batch dataCertification support | Flexible working | Fully remote | Professional development | Travel as requiredSenior-level Full TimeLondon, United Kingdom R1d ago
-
AI Research Assistant (PhD) GBP 40K-46KC# | C++ | Deep learning | Java | LuaOpen source contributions | Open-science cultureEntry-level Full TimeLondon, UK1d ago
-
AI Research Scientist, Vision Language Models GBP 41K-65KComputer Vision | Deep learning | Generative AI | Language Models | Language ProcessingEntry-level Full TimeZurich, Switzerland | London, UK1d ago
-
Computer Vision | Diffusion Models | Generative AI | Language Models | Large Language ModelsMid-level Full TimeLondon, UK1d ago
-
Game Designer, Games, DeepMind GBP 150K-200KAgentic Workflows | Artificial Intelligence | C# | Game Design | Generative AISenior-level Full TimeLondon, UK1d ago
-
Senior AI Engineer GBP 77K-100KAWS | Agent architectures | Anthropic API | Cloud platform | DevOpsAnnual leave | Community involvement | Course and certification support | Cycle to work | Give as you earnSenior-level Full TimeBristol, England, United Kingdom1d ago
-
Senior AI Engineer GBP 78K-100KAWS | Anthropic API | Azure | DevOps | DevSecOpsCourse and training budget | Hybrid working | Learning budget | Mentoring and coaching | Paid time offSenior-level Full TimeManchester, England, United Kingdom1d ago
-
Senior-level Full TimeLondon, United Kingdom1d ago
-
Lead Data Engineer GBP 90K-106KApache Spark | Azure | Data Migration | Data Modeling | Data cleaningSenior-level Full TimeUK - London1d ago