Senior Software Engineer, Deep Learning Inference
Tasks
- Build production software in open source libraries
- Implement and optimize LLM inference algorithms
- Optimize fused MoE and quantized GEMM operators
- Own optimization features end to end
- Profile inference pipelines
- Solve distributed inference problems
- Write and tune GPU kernels
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | CUDA | CUDA profiling | Collective communication | Communication Compute Overlap | Deep learning | Distributed Computing | FP8 | GPU kernel optimization | Kernel optimization | Mamba | Memory hierarchy | Mixed Precision | Mixture of Experts | Multi-node | Multi-node deployment | Networking | Node deployment | Nsight | Pipeline parallelism | PyTorch | Python | Quantization | Roofline model | State Space Models | State-Space | Tensor Parallelism | Transformers | Triton
Education
Related jobs
-
AWS | Agentic AI | Docker | Evaluation Frameworks | Fine TuningFlexible schedule | Hybrid work model | Remote work flexibilityMid-level Full TimeIsrael - Raanana R4d ago
-
Senior AI&ML Researcher ILS 285K-366KAWS | Agentic AI | Docker | Evaluation Frameworks | Fine TuningFlexible schedule | Hybrid work | Remote workSenior-level Full TimeIsrael - Raanana R4d ago
-
AWS | Analytical Databases | CI/CD | ClickHouse | DBTEmployee stock option plan | Flexible working options | Health insurance | Home-office allowance | Parental leaveSenior-level Full TimeRemote Israel R6d ago
-
Staff Data Science Researcher ILS 285K-366KA/B | A/B Testing | AI Agents | AWS Bedrock | Agent systemsFlexible schedule | Hybrid work model | Mentorship culture | Remote work daysSenior-level Full TimeIsrael - Raanana R8d ago
-
Agile | C++ | Debugging Tools | Endpoint Security | Endpoint Security SystemEmployee networks | Office culture | Paid adoption leave | Paid parental leave | Professional developmentSenior-level Full TimeRamat Gan, Israel R10d ago
-
Airflow | Apache Spark | CI/CD | DBT | Data ObservabilityHybrid work | Mentorship | Work from office 3 days per weekSenior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL R13d ago
-
Sr. Data Engineer, Cloud Security (Hybrid, ISR) ILS 341K-443KAWS | Apache Iceberg | Apache Spark | Azure | CassandraCompetitive vacation and holidays | Comprehensive wellness programs | Employee volunteer opportunities | Paid adoption leave | Paid parental leaveSenior-level Full TimeTel Aviv (Sky Tower), Israel R15d ago
-
API Development | Argo CD | Argo Workflows | Cilium | GitOpsFull-time remote workMid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL R29d ago
-
Senior Designated Support Engineer GBP 67K-89KActive Directory | Erasure Coding | Ethernet | FTP | InfinibandFlexible hours | Remote work | Rotating on-call scheduleSenior-level Full TimeFrance, UK, Italy, Poland, Israel, Germany R1mo ago
-
API Integration | AWS | Access Control | Apache Airflow | AuthenticationFlexible work schedule | Hybrid work model | Remote work flexibilityMid-level Full TimeIsrael - Raanana R1mo ago
-
API Authentication | API pagination | API rate-limiting | AWS | Apache AirflowCareer growth opportunities | Collaborative work environment | Flexible schedule | Hybrid work model | Remote work optionMid-level Full TimeIsrael - Raanana R1mo ago
-
AWS | Azure | Distributed Systems | Django | ELKFlexible working | Hybrid work | Work from home optionSenior-level Full TimeIsrael R1mo ago
-
AWS | Azure | Cloud deployment | Containerization | CrewAIRemote workMid-level Full TimeTel Aviv; Haifa R1mo ago