Senior Performance Engineer - LLM Inference Frameworks
Tasks
- Build benchmarking and testing systems for latency utilization and efficiency
- Design high performance inference pipelines for large language models
- Develop and optimize context caching
- Develop and optimize speculative decoding
- Implement FP8 and INT4 quantization
- Implement memory management strategies for improved bandwidth and cache efficiency
- Profile and tune model execution across the stack
Perks/Benefits
Skills/Tech-stack
Debugging | Deep learning | Distributed Systems | FP8 | GPU | Huggingface | INT4) | Latency optimization | Memory Management | Model Benchmarking | Performance Profiling | PyTorch | Python | Quantization | Throughput Optimization
Education
Related jobs
-
Staff Data Platform Engineer ILS 285K-366KCDC | Cloud Architecture | Data Architecture | Data Engineering | Data WarehouseCareer coaching | Hybrid work model | Parking | Shuttle Services | Snacks and treatsSenior-level Full TimeTel Aviv District, Israel16h ago
-
AWS | Active Directory | Authentication | Azure | C#Fertility assistance | Hybrid work model | Parental leaveMid-level Full TimeTel Aviv21h ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL21h ago
-
AI Agent | AI Agent Frameworks | AI orchestration | API Integration | Agent FrameworksMid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL21h ago
-
Senior Agentic AI Developer and Malware Analysis Expert ILS 380K-473KAgent Orchestration | Air gapped deployment | Air-gapped | Autonomous Agents | Binary AnalysisSenior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL23h ago
-
AWS | Agentic AI | Algorithms | Apache Spark | Cloud platformSenior-level Full TimePetah Tikva, Central District1d ago
-
Senior AI Engineer ILS 341K-443KAWS | Azure | Cloud platform | DevOps | DockerCommuter benefits | Equity | Extra Time Off for Parents and Caregivers | Lunch stipend | Parking benefitsSenior-level Full TimeTel Aviv1d ago
-
Mid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL1d ago
-
Amazon EMR | Amazon Web Services | Apache Airflow | Apache Kafka | Apache SparkCareer coaching | Happy hours | Learning opportunities | Team outings | Work partially from homeMid-level Full TimeTel Aviv, Israel1d ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL2d ago
-
Sr Staff AI Software Engineer (CORA AI) ILS 341K-443KA2A | AWS | Agentic architecture | Language Models | Large Language ModelsFlexible work arrangement | In-office collaborationSenior-level Full TimePetah Tikva, Central District2d ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL2d ago
-
Staff Data Science Researcher ILS 285K-366KA/B | A/B Testing | AI Agents | AWS Bedrock | Agent systemsFlexible schedule | Hybrid work model | Mentorship culture | Remote work daysSenior-level Full TimeIsrael - Raanana R3d ago
-
Mid-level Full TimeJerusalem, Israel3d ago
-
3D Geometry | Active Learning | Airflow | Argo | C++On site work several days per weekMid-level Full TimeRamat Gan, Israel3d ago
-
Senior-level Full TimeTel Aviv, Israel3d ago
-
Mid-level Full TimeTel Aviv, Israel3d ago
-
Bash | Cloud platform | Data Ingestion | Data Processing | DockerMid-level Full TimeTel Aviv, Israel3d ago
-
AI Engineering Team Lead ILS 341K-443KAgentic AI | Cloud Computing | Compliance Automation | Cost Optimization | Data OperationsDirect access to leadership | Flexible culture | Meaningful equity | Output first cultureSenior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL5d ago
-
Computer Vision | Data Engineering | Data Pipelines | Data Storage | Data VersioningCareer growth opportunitiesMid-level Full TimeTel Aviv, Tel Aviv District, IL5d ago
-
Backend AI Engineer ILS 380K-473KAPI | Agentic AI | Backend Development | Distributed Systems | JavaScriptMid-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL5d ago
-
Computer Vision | Data collection | Data-collection pipelines | Deep learning | Generative AICareer growth opportunities | Innovation culture | Strategic industry partnershipsSenior-level Full TimeTel Aviv, Tel Aviv District, IL5d ago
-
Mid-level Full TimeTel Aviv, Tel Aviv District, IL5d ago
-
Mid-level Full TimeTLV6d ago
-
Mid-level Full TimeTLV6d ago