Research Engineer / Scientist - Storage for LLM
San Jose, California, United States
USD 136K-359K Entry-level Full Time
Tasks
- Collaborate with inference teams for integration
- Design distributed KV cache system
- Develop cache consistency and synchronization protocols
- Evaluate and extend open-source KV stores or build custom GPU-aware caching layers
- Implement memory-aware sharding eviction and replication strategies
- Monitor system performance and iterate caching algorithms
- Optimize low-latency access and eviction policies
Perks/Benefits
- Competitive salary
- Conference engagement
- Innovative culture
- Open source contribution opportunities
- Research resources
Skills/Tech-stack
AI infrastructure | Batched decoding | CUDA | Caching systems | Distributed Storage | Eviction policies | GPU Programming | Memory Management | Model Parallelism | Open Source | Open source contribution | RDMA | Sharding | System Optimization | System monitoring | Token streaming | Triton
Education
Bachelor's | Master's | PhD
Related jobs
-
Data Engineering | Deep learning | Fine Tuning | LLM | Language ModelsSenior-level Full TimeNew York, United States21h ago
-
C++/CUDA Systems Engineer – Surgical Robotics Platform USD 140K-160KC++ | C++17 | C++20 | CPU GPU Scheduling | CUDAEquity | Health insurance | Paid time off | Performance bonusMid-level Full TimeLos Angeles, California23h ago
-
Android Development | C Sharp | C plus plus | C# | Command LineMid-level Full TimeMountain View, CA, US; Redmond, WA, … R1d ago
-
Entry-level Full TimeSunnyvale, CA, United States1d ago
-
Computer Scientist, Level 3 (Network Engineer) USD 110K-143KAcropolis Hypervisor | Acropolis OS | CPU Overcommitment | Capacity Planning | Cisco Catalyst401k | Dental insurance | Education assistance | Health insurance | HolidaysSenior-level Full TimeOklahoma City, OK, 73169, US1d ago
-
Senior Software Engineer, ML Performance Infrastructure USD 174K-252KC# | C++ | Compute technology | Data Storage | Data StructuresSenior-level Full TimeSunnyvale, CA, USA1d ago
-
Senior Software Engineer, Jules AI, Labs, AI/ML USD 174K-252KAI infrastructure | Agent systems | Context engineering | Distributed Computing | Generative AISenior-level Full TimeMountain View, CA, USA; New York, …1d ago
-
Staff Software Engineer, Performance and Optimization USD 189K-303KAutomated testing | C# | C++ | EBPF | FtraceBonus | Equity compensation | Health benefits | Hybrid work environmentSenior-level Full TimeSeattle, Washington1d ago
-
Staff Software Engineer, Performance and Optimization USD 171K-273KAutomated testing | C# | C++ | Co-design | EBPFAnnual bonus | Benefits | Equity compensation | Hybrid work environmentSenior-level Full TimePittsburgh, Pennsylvania1d ago
-
Staff Software Engineer, Performance and Optimization USD 189K-303KAutomated testing | C plus plus | C# | EBPF | FtraceAnnual bonus | Benefits | Equity compensation | Hybrid work environmentSenior-level Full TimeMountain View, California1d ago
-
Senior Software Engineer, AI Training & Infrastructure USD 200K-300KAWS | Alerting | Algorithms | Automated testing | AzureSenior-level Full TimeSan Mateo1d ago
-
Analytics | Concurrency | Containerization | Core Java | Data pipeline401k plan | Commuter benefits | Disability benefits | Life insurance | Paid time offSenior-level Full Time112265-NJ-MetroPark, Iselin, United States1d ago
-
Array computing | Asynchronous programming | C++ | CUDA | CupyEquity | Health benefitsSenior-level Full TimeUS, CA, Santa Clara, United States1d ago
-
Benchmarking | C plus plus | CUDA | Code optimization | Data ProcessingSenior-level Full TimeUS, CA, Santa Clara, United States1d ago
-
AWS SageMaker | Azure Machine Learning | CUDA | Cloud Computing | Data acquisition401k | Coffee service | Dental insurance | Education assistance | Fitness CenterMid-level Full TimeMT-BRK-Red Speed, United States1d ago
-
Senior Staff Software Engineer, Storage USD 240K-310KAWS | Azure | C# | C++ | CAP TheoremCommuter benefits | Dental insurance | Health insurance | Mental health support | Paid HolidaysSenior-level Full TimeSan Francisco, CA - US1d ago
-
C++ | Cloud processing | Concurrency | Docker | GRPC401k retirement plan | Dental coverage | Employee referral bonuses | Flexible PTO | Free lunchSenior-level Full TimeColumbus, Ohio2d ago
-
Mid-level Full TimeHanover, Maryland2d ago
-
Machine Learning (ML) Bioengineer USD 146K-222KActive Learning | Bioinformatics | Data Analysis | Deep learning | Distributed Training401k | Education reimbursement program | Flexible schedule | Hybrid work schedule | Relocation assistanceMid-level Full TimeLivermore, CA, United States R2d ago
-
Applied Scientist 4 USD 120K-251KAutomatic Speech Recognition | C++ | Cloud Computing | Computer Vision | Data AnnotationMid-level Full TimePleasanton, CA, United States2d ago
-
Machine Learning Systems Engineer USD 160K-253KAWQ | C# | C++ | CUDA | Distributed TrainingDental insurance | Free meals and snacks | Health insurance | Professional development | Unlimited PTOSenior-level Full TimeMenlo Park, CA2d ago
-
Algorithms | Android | C# | C++ | CompilersSenior-level Full TimeMountain View, CA, USA2d ago
-
Benchmarking | C# | C++ | CPU architecture | Compiler optimizationHealth insurance | Hybrid work model | Retirement plan | VacationEntry-level Full TimeUSA - AZ - Chandler, United …2d ago
-
Senior-level ContractSunnyvale, CA2d ago
-
Software Development Engineer, Amazon MSK USD 143K-194KAmazon Kinesis | Apache Flink | Apache Kafka | Apache Spark | Apache StormCareer growth | Flexible work schedule | Mentorship | Work-life balanceMid-level Full TimeSanta Monica, California, USA2d ago