Member of technical staff (Inference) - London
Tasks
- Collaborate with research teams on model architectures
- Develop GPU kernels for attention and matrix multiplication
- Develop inference pipelines
- Implement distributed computing techniques
- Implement state of the art inference techniques
- Optimize memory usage throughput latency
- Review research papers for inference optimization
Perks/Benefits
Skills/Tech-stack
C++ | CUDA | CUDA kernel | CUDA kernel programming | Caching | Continuous batching | Deep learning | Deep learning inference | Distributed Computing | Flash Attention | GPU Programming | Ggml | Kernel programming | Llama.cpp | Model Compression | NCCL | ONNX Runtime | Paged Attention | PyTorch | Python | Quantization | Rust | SGLang | TensorRT-LLM | Triton | VLLM
Education
Roles
AI | AI Engineer | Engineer | Learning Engineer | Machine Learning Engineer | Software Engineer
Related jobs
-
Bash | Cloud platform | Data Ingestion | Data Processing | DockerAsynchronous culture | Career growth opportunity | Friendly work environment | Impact on consumer and enterprise products | Remote friendly 100% distributed settingMid-level Full TimeLondon, United Kingdom20h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Entrepreneurial environment | Flexible management structure | Friendly laid-back atmosphereMid-level Full TimeBrighton, United Kingdom20h ago
-
Internal Audit AVP - Data Analytics GBP 47K-58KAWS | Alteryx | Audit Testing | Cloud Computing | DashboardsEntry-level Full TimeCanary Wharf, 1 Churchill Place, United …1d ago
-
AI/ML/Data Science Engineer GBP 50K-60KAnomaly Detection | Cloud Platforms | Computer Vision | Computer Vision Video Analytics | Data PreprocessingAnnual leave | Bank Holiday Leave | Dental care | Discounts | Enhanced maternity/paternity leaveMid-level Full TimeBiggin Hill, United Kingdom1d ago
-
LLM Specialist / AI Developer GBP 59K-65KAPI Security | Azure OpenAI | Copilot Studio | Evaluation | GroundingFixed term position | Hybrid working | Working permit sponsorshipMid-level Full TimeLondon, Greater London, United Kingdom1d ago
-
Senior GenAI Software Engineer GBP 76K-100KA/B | A/B Testing | B testing | Debugging | Diffusion ModelsDental insurance | Equity | Health insurance | Vision insurance | Wellness stipendsSenior-level Full TimeLondon, UK R1d ago
-
Senior-level Full TimeUnited Kingdom (Remote) R1d ago
-
Data Science Lead - Logistics GBP 62K-80KArtificial Intelligence | Big Data | Cloud | IoT | JavaAnnual leave | Buy As You Earn Scheme | Cycle to work scheme | Employee assistance programme | Employee discountsSenior-level Full TimeLondon, United Kingdom R1d ago
-
Senior Data Engineer BI Hub GBP 62K-77KAWS | Agile | Alerting | Azure | CI/CDAnnual holiday allowance | Buy additional holiday | Colleague discount | Cycle to work scheme | DiscountsSenior-level Full TimeLondon, London, United Kingdom1d ago
-
AWS | Amazon Bedrock | Azure | Azure OpenAI | DockerMid-level Full TimeLONDON, LONDON, United Kingdom1d ago
-
Applied AI ML Lead - LLM Suite Engineering GBP 72K-95KAPIs | AWS | AWS Bedrock | Agent communication | Agent to AgentSenior-level Full TimeLONDON, LONDON, United Kingdom1d ago
-
Senior AI Engineer GBP 75K-75KAWS | Agent systems | Artificial Intelligence | Azure | CI/CDAnnual bonus | Discounted gym membership | Electric vehicle leasing | Experience days | Hybrid workSenior-level Full TimeLondon, United Kingdom R1d ago
-
Full Stack AI Engineer GBP 78K-80KAlembic | Application Insights | Asynchronous programming | Auditability | AuthenticationMid-level Full TimeBelfast1d ago
-
Principal Computer Vision Engineer GBP 67K-109K3D Object Tracking | ARM | C# | C++ | Camera ModelsHybrid work | Mentoring | Professional developmentSenior-level Full TimeLondon, UK1d ago
-
Senior Data Engineer GBP 77K-80KAWS | AWS Glue | AWS Lambda | AWS S3 | Access ControlAnnual bonus programme | Annual learning stipend | Birthday day off | Collaborative Agile culture | Flexible working hoursSenior-level Full TimeLondon, United Kingdom1d ago
-
System Engineer - PostgreSQL GBP 150KDebian | Linux | MySQL | PostgreSQL | PythonBonus pay | Hybrid workMid-level Full TimeLondon, England, United Kingdom1d ago
-
Senior HPC Storage Engineer GBP 65K-80KCephFS | GPFS | Linux | Lustre | NASHybrid working | Quarterly bonuses | Signing bonusSenior-level Full TimeLondon, England, United Kingdom1d ago
-
Senior HPC Software Engineer GBP 65K-80KC++ | Cloud Storage | Databases | Deployment Monitoring | Distributed SystemsFlexible budgetSenior-level Full TimeLondon, England, United Kingdom1d ago
-
Data Mart | Data Modeling | Data Warehousing | HDFS | Hadoop100 percent remote | Outside IR35Mid-level Full TimeCoventry, England, United Kingdom R1d ago
-
Data Engineer GBP 60K-70KAPI | AWS | ETL | Python | SnowflakeCareer development | Health insurance | Life insurance | Paid time off | Paid volunteering leaveMid-level Full TimeManchester, England, United Kingdom1d ago
-
AI Engineer / Machine Learning Engineer GBP 85K-100KAlgorithms | Cloud Computing | Data Science | Data Structures | DockerSenior-level Full TimeLondon, England, United Kingdom1d ago
-
Data Science Lead - Fulfilment GBP 70K-90KArtificial Intelligence | Big Data | Cloud Computing | Data Science | Deep learningAnnual leave | Cycle to work scheme | Employee assistance programme | Employee discounts | Free deliverySenior-level Full TimeKrakow, Poland; London, United Kingdom R1d ago
-
Senior Data Engineer GBP 72K-80KAzure Data | Azure Data Factory | CI/CD | Change Data Capture | DAXAgile team collaboration | Coaching and mentoring | International environment | Workshops and design sessionsSenior-level Full TimeLondon, United Kingdom1d ago
-
Agent-based | Agent-based systems | Generative AI | Language Models | Large Language ModelsIn-office FlexibilityMid-level Full TimeLondon, United Kingdom2d ago
-
Senior Data Science Engineer GBP 72K-80KAutomated trading | Data Analysis | Data Visualization | Databricks | ExperimentationGaming license assistanceSenior-level Full TimeLondon, UK, United Kingdom2d ago