Staff Software Engineer, Inference
Tasks
- Benchmark inference performance
- Build low latency high throughput inference systems
- Conduct post-incident analysis
- Define inference platform architecture
- Design request routing and scheduling
- Develop speculative decoding
- Drive traffic management and failure mitigation
- Enable KV cache reuse and memory optimization
- Implement micro batching and dynamic scheduling
- Improve P95 P99 latency and throughput
- Improve testing and observability
- Lead capacity planning and autoscaling
- Lead cross-team design reviews
- Lead multi-service cross team design
- Manage GPU resources
- Mentor engineers and elevate engineering standards
- Optimize cost per token
- Own SLIs SLOs reliability
Perks/Benefits
- Critical illness cover
- Employee assistance programme
- Family dental insurance
- Family medical insurance
- Generous pension contribution
- Life assurance
- Living wage accredited employer
- Tuition reimbursement
Skills/Tech-stack
Autoscaling | BF16 | Benchmarking | C++ | CUDA | Capacity Planning | Cloud platform | Distributed Systems | Dynamic Scheduling | FP8 | Failure mitigation | Go | KV cache | Kubernetes | Latency optimization | Memory Optimization | Micro Batching | Mixed Precision | NCCL | NUMA | Observability | Performance optimization | Python | RDMA | SLI | SLO | Speculative decoding | Streaming token delivery | System design | Throughput Optimization | Traffic Management
Education
Roles
Related jobs
-
Senior Data Engineer (ADB, Python) CAD 120K-160KASP.Net Core | Angular | Apache Spark | Azure Cloud | Azure FabricHybrid/Remote flexibility | International projects | Medical healthcare | Ongoing learning reimbursement | Recognition programSenior-level Full TimeBulgaria, Georgia, Poland, Romania, Uzbekistan21h ago
-
Staff Software Engineer, Vertex AI, Workbench PLN 480K-492KCloud Computing | Data Structures | Data Structures and Algorithms | Distributed Systems | GolangSenior-level Full TimeWarsaw, Poland1d ago
-
C++ | Data Structures | Data Structures and Algorithms | Machine Learning | Multithreaded programmingSenior-level Full TimeKraków, Poland1d ago
-
Data Engineer PLN 211K-358KAirflow | Analytical Databases | Azure | CI/CD | Data ContractsInternational work environment | Professional development opportunities | Remote work option | Work-life balanceSenior-level Full TimeKatowice, Silesian Voivodeship, Poland1d ago
-
Senior/Lead Machine Learning Engineer PLN 179K-280KArgoCD | Azure | C# | CI/CD | Deep learningHybrid work | Inclusive workplaceSenior-level Full TimeWarsaw, Mazovia, Poland1d ago
-
Senior/Lead Machine Learning Engineer PLN 179K-280KArgoCD | Azure | CI/CD | Data Preprocessing | Deep learningHybrid work opportunitiesSenior-level Full TimeKrakow, Lesser Poland, Poland1d ago
-
[VKS] Data Engineer PLN 258K-396KBigQuery | Cloud platform | DBT | Data Modeling | Data PipelinesFlexible employment | Internal and external training | International business trips | International projects | Language classesSenior-level Full TimeWarsaw, Masovian Voivodeship, Poland1d ago
-
AI Engineer - Memory Retrieval PLN 145K-218KAgent Orchestration | Embedding | Information Extraction | Information Retrieval | Language ModelsCareer development | Collaborative team | Health insurance | Hybrid remote workMid-level ContractPoland R1d ago
-
Apache Kafka | Apache Spark | Experimentation | Feature Engineering | GoGlobal collaborationSenior-level Full TimeWarszawa, Masovian Voivodeship, Poland1d ago
-
Senior/Lead Machine Learning Engineer PLN 179K-280KArgoCD | Azure | CI/CD | Data Preprocessing | Deep learningHybrid work flexibility | Inclusive workplaceSenior-level Full TimeKrakow, Lesser Poland, Poland1d ago
-
Senior AI Engineer (All Genders) PLN 257K-400KCI/CD | Context engineering | Docker | Embeddings | JavaAnnual vacation days | Bike leasing | Discount on products | Hybrid working | Private medical health insuranceSenior-level Full TimeKrakow, Poland1d ago
-
Senior AI Engineer (All Genders) PLN 246K-400KAI Safety | API Design | API Testing | Amazon Bedrock | Automated testingBike leasing | Discount on products | Hybrid working | Option to work abroad | Subsidised transportSenior-level Full TimeKrakow, Poland1d ago
-
Junior Data Engineer PLN 84K-119KAgile | Apache Spark | Azure | Data Cleansing | Data GovernanceAdditional paid leave | Ergonomic office | Group insurance | Health insurance | Hybrid workEntry-level Full TimeKatowice, PL, 40-2022d ago
-
Junior Data Engineer PLN 87K-109KAgile | Azure | Azure Data | Azure Data Services | Data CleansingAdditional paid time off | Commuting support | Company events | Discount on company products | Ergonomic officeEntry-level Full TimeKatowice, PL, 40-2022d ago
-
Lead AI Engineer PLN 309K-309KA/B | A/B Testing | AWS | Agentic Systems | AgileCompany car | Employer supported pension plan | Extra paid holidays | Flexible benefits | Home-office allowanceSenior-level Full TimeWarszawa, Mazowieckie, PL R2d ago
-
Senior-level Full TimePL-Warsaw2d ago
-
AWS | Authentication | Authorization | Azure | CI/CDEmployee benefits | Flexible work schedule | Health benefits | Remote work | Well-being benefitsMid-level Full TimePoland R2d ago
-
Data Engineer/Power BI PLN 258K-396KApache Airflow | Azure Data | Azure Data Factory | Data Factory | Data ModelingSenior-level Full TimeWarsaw, Poland2d ago
-
Data Engineer PLN 276K-276KAWS | Apache Airflow | Apache Spark | Athena | CI/CDCareer growth | Coworking space access | English classes | MacBook provided | MentorshipMid-level Full TimePoland2d ago
-
Data Engineer PLN 312K-312KAgile | Azure | Azure Databricks | DBT | Data ModelingCo Financing Studies and Certification | Dental care | Employee referral programme | Home office | Language coursesMid-level Full TimeKrakow, Poland2d ago
-
Data Engineer - Data Quality & Testing (Mid / Regular) PLN 120K-190KAWS | Amazon Redshift | Azure | CI/CD | Cloud platformAccess to e-learning platforms | Additional parent privileges | Cafeteria benefits | Diversity charter | Employee shares planEntry-level Full TimePoland - Warsaw - ASEC2d ago
-
Experienced Python Consultant with AI/ML skills PLN 190K-258KAPI Design | Agentic architecture | Azure | Behavior-Driven Development | CI/CDMid-level Full TimeKraków, PL, 30-3022d ago
-
Senior Data Engineer USD 110K-125KAPI Development | Airflow | Alerting | Amazon Athena | Backend DevelopmentHealthcare coverage | Home office setup budget | Learning and development budget | Paid time off | Parental leaveSenior-level Full TimePoland - Remote R2d ago
-
Senior-level Full TimeKraków, Lesser Poland Voivodeship, Poland2d ago
-
[VKS] Data Engineer PLN 258K-396KBigQuery | Cloud platform | DBT | Data Modeling | Data pipelineExternal training | Flexible employment | Insurance | Internal training | International business tripsSenior-level Full TimeWarsaw, Masovian Voivodeship, Poland2d ago