Research Engineer / Scientist - Storage for LLM
San Jose, California, United States
USD 156K-387K Entry-level Full Time
Tasks
- Design distributed KV cache system
- Develop cache consistency and synchronization protocols
- Evaluate open source KV stores and extend caching layers
- Implement memory aware sharding and replication
- Integrate cache with batched decoding
- Integrate cache with token streaming pipelines
- Monitor performance and iterate caching algorithms
- Optimize cache latency and eviction
Perks/Benefits
- Competitive compensation
- Conference attendance
- Generous research resources
- Innovation-driven culture
- Open source contributions
Skills/Tech-stack
Attention Mechanisms | CUDA | Caching | Distributed Systems | Eviction policies | GPU Computing | Key-Value Storage | Key-value | Language Models | Large Language Models | Latency optimization | Memory Management | NVIDIA Triton | RDMA | Replication | Sharding | Shared Memory | TTL | Throughput Optimization | Transformer | Windowed LRU
Education
N/A
Roles
Engineer | Research Engineer | Research Scientist | Scientist
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R6h ago
-
Agile | Automated testing | CI/CD | Cloud Computing | CrewAIDental insurance | Health insurance | Vision insuranceMid-level Full TimeAshburn, VA, United States12h ago
-
AI Engineer USD 180KAgent Orchestration | Cost Management | Data Pipelines | Distributed Systems | LLM401k | Commuter benefits | Dental insurance | Flexible spending | Health insuranceMid-level Full TimeNew York, New York, United States …14h ago
-
CV/NLP/Multimodal LLM Machine Learning Engineer Graduate (TikTok-Trust and Safety) - 2026 Start (PhD) USD 136K-246KActive Learning | Computer Vision | Content Classification | Data-Driven Strategy | Data-drivenEntry-level Full TimeSeattle, Washington, United States15h ago
-
CV/NLP/Multimodal LLM Research Scientist Graduate (Trust and Safety) - 2026 Start (PhD) USD 137K-237KComputer Vision | Deep learning | Distributed Training | Few-Shot Learning | Few-shotCareer development opportunities | Graduate onboarding support | Research opportunitiesEntry-level Full TimeSeattle, Washington, United States15h ago
-
Sr. Staff Data Scientist- Eng USD 145K-209KAgent systems | Agentic AI | BigQuery | Classification | Data GovernanceSenior-level Full TimeLowell,MA,United States R16h ago
-
Software Engineer III, Generative AI USD 147K-211KComputer Vision | Data Processing | Debugging | Language Models | Language ProcessingSenior-level Full TimeKirkland, WA, USA16h ago
-
API Design | Agent systems | Agentic Workflows | Apache Beam | Artificial IntelligenceSenior-level Full TimeSunnyvale, CA, USA; Cambridge, ON, Canada16h ago
-
Senior Software Engineer - Database Engineering USD 200K-287KAutomated testing | Debugging | Distributed Systems | Distributed key-value stores | Failure recoverySenior-level Full TimeUS-CA-Menlo Park21h ago
-
Data Modeling | Data analytics | Language Models | Large Language Models | Machine LearningCoaching | Hybrid work model | Mental health counseling | Mentorship | Paid volunteer timeMid-level Full TimeRaleigh, US, North Carolina23h ago
-
Principal AI Architect Engineer USD 118K-195KAWS | AWS Lambda | Amazon Bedrock | Amazon EC2 | Amazon EKSSenior-level Full TimeNew York, United States1d ago
-
AI Developer USD 77K-176KAWS | Agentic Workflows | Asynchronous Messaging | Audit Logging | Automated testingDependent care | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUSA, OH, Beavercreek (3800 Pentagon Blvd), …1d ago
-
AI Developer USD 77K-176KAWS | Asynchronous Messaging | Audit Logging | Automated testing | CI/CDDependent care | Disability insurance | Health benefits | Life insurance | Paid leaveMid-level Full TimeUSA, VA, Arlington (1550 Crystal Dr …1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Cross Platform Inference | Cross-platform | DSPCareer growth potential | Full-time remote work | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code review100 percent remote | Career growth | Full-time employment | H1B transfer support | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KApache Beam | CI/CD | Code review | Data Lineage | Data Modeling100 percent remote | Career growthMid-level Full TimeUnited States - Remote R1d ago
-
Data Scientist (Remote) USD 140K-215KContext Management | DPO | DeepSpeed | Experiment tracking | Experimental DesignEmployee networks | Great Place to Work certification | Paid adoption leave | Paid parental leave | Professional developmentMid-level Full TimeUSA VA Remote, United States R1d ago
-
LLM Engineer USD 100K-150KAdapter methods | DPO | Deep reinforcement learning | Distributed Training | Efficient AttentionBenefits | Career growth | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineer USD 100K-150KAgent architecture | Agent architectures | Agentic Workflows | Chunking | Deterministic systemsLong-term engagement | Mentorship | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering USD 100K-150KAgent systems | Agentic Workflows | Embeddings | Evaluation Pipelines | Fine TuningCareer growth potential | H1B transfer support | Long-term engagement | Remote work | Technical coding assessment requiredMid-level Full TimeUnited States - Remote R1d ago
-
Access Management | Agent Orchestration | Budget controls | CI/CD | CachingSenior-level Full TimeCharlotte, United States1d ago
-
Principal Applied AI Engineer, Finance USD 193K-340KAPI Development | AWS | Bias Mitigation | CI/CD | Churn modeling401k matching | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Massachusetts), United States R1d ago