Research Engineer - LLM/VLM Inference Optimization (Seed Infra)
Tasks
- Apply parallel computing and graph fusion
- Collaborate with research teams on model optimization
- Conduct performance analysis and identify bottlenecks
- Design high performance inference systems for LLMs and VLMs
- Develop CUDA kernels
- Develop inference engines and serving frameworks
- Develop model toolchains
- Enable streaming inference
- Implement compiler-level optimizations
- Implement speculative decoding
- Optimize end to end deployment pipelines
- Optimize high concurrency requests
- Use low precision computation
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | CUDA kernel | Compiler optimization | Deployment Pipelines | Graph Fusion | High concurrency | Inference Optimization | Language Models | Large Language Models | Low-precision computing | Parallel Computing | Performance Profiling | Precision computing | Speculative decoding | Streaming inference | Vision Language Models | Vision-language
Education
N/A
Roles
Related jobs
-
Senior Developer – AI/ML Autonomous Driving & Navigation USD 161K-240KBehavior Prediction | Behavior planning | C plus plus | CI/CD | CUDASenior-level Full TimeMelbourne, FL, United States3h ago
-
Applied Scientist - Business Integrity - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 136K-250KAdversarial Machine Learning | Deep learning | Language Models | Language Processing | Large Language ModelsMid-level Full TimeSan Jose, California, United States6h ago
-
Machine Learning Scientist Graduate (Global E-commerce Content Recommendation) - 2026 Start (BS/MS) USD 118K-187KA/B | A/B Testing | B testing | Data Analysis | Deep learningEntry-level Full TimeSeattle, Washington, United States6h ago
-
Research Scientist, Robotics, DeepMind USD 147K-211KHumanoid robotics | Imitation Learning | Python | Reinforcement Learning | Robot ControlSenior-level Full TimeCambridge, MA, USA7h ago
-
Senior Software Engineer, AI/ML GenAI, YouTube USD 174K-252KC++ | Computer Vision | Data Processing | Data Storage | DebuggingSenior-level Full TimeMountain View, CA, USA7h ago
-
Staff Software Developer, AI/ML, Safety and Security USD 207K-300KComputer Vision | Data Processing | Debugging | Deep learning | Fine TuningSenior-level Full TimeWaterloo, ON, Canada; New York, NY, …7h ago
-
Artificial Intelligence | Cybersecurity | Detection engineering | Language Models | Large Language ModelsMid-level Full TimeSan Francisco, CA, USA7h ago
-
Staff Software Engineer, Generative AI, Data Analytics USD 207K-300KBigQuery | Data Processing | Debugging | Fine Tuning | Generative AISenior-level Full TimeKirkland, WA, USA7h ago
-
Staff Software Engineer, Google Cloud Agentic AI USD 207K-300KArtificial Intelligence | C++ | Compliance | Database | Distributed ComputingSenior-level Full TimeSunnyvale, CA, USA; New York, NY, …7h ago
-
Software Development Engineer - Robotics USD 100K-170KC++ | CUDA | CUDNN | GPU Acceleration | IMUCareer growth opportunities | Comprehensive benefits | MentorshipMid-level Full TimeBoston, Massachusetts18h ago
-
Senior-level Full TimeWaterside Bldg, United States18h ago
-
Freelance Data Science Engineer (Python & SQL) USD 180K-180KBig Data | Big data processing | Customer Analytics | Data Processing | Feature EngineeringFreelance opportunities | Part-time schedule | Project based workMid-level FreelanceUnited States - Remote R18h ago
-
Machine Learning Developer (Freelance) USD 180K-180KLangchain | Language Models | Large Language Models | MLOps | NumPyEnglish proficiency requirement | Freelance project-based work | Part-time flexible scheduleMid-level FreelanceUnited States - Remote R18h ago
-
Machine Learning Developer (Freelance) USD 180K-180KGenerative AI | Langchain | Language Models | Large Language Models | MLOpsFreelance project-based workMid-level FreelanceTexas, United States - Remote R18h ago
-
Specialist Software Engineering - AI Engineer USD 131K-170KAPI Gateway | AWS Bedrock | AWS CDK | AWS CloudFormation | AWS Glue401k match | Adoption Assistance | Back Up Care Program | Career training and development | Dental insuranceMid-level Full TimeCedar Rapids, Iowa, United States18h ago
-
Autoscaling | CUDA | CUDA MIG | Concurrency Control | Continuous batching401-k plan | Disability benefits | Health benefits | Life insurance | Paid time offSenior-level Full Time142019-NC-300 South Brevard, Charlotte, United States18h ago
-
AWS | Angular | Artificial Intelligence | Azure | Data ScienceDependent care | Paid leave | Professional development | Tuition assistance | Work-life programsSenior-level Full TimeUSA, MD, Fort Meade (9800 Savage …18h ago
-
Generative AI Executive Director USD 150K-210KComputer Vision | DAG | Data parallelism | Deep learning | DeepSpeedBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersExecutive-level Full TimeNew York, NY, United States19h ago
-
Gen AI Engineer - NY USD 35K-123KAWS | Azure | CI/CD | Cloud platform | ETL401k retirement plan | Dental benefits | Medical benefits | Paid Holidays | Paid time offMid-level Full TimeUnited States19h ago
-
Senior Machine Learning Engineer USD 139K-227KAudio Processing | CUDA | ChromaDB | Computer Vision | Distributed Computing401k match | Continuing education support | Function health subscription | Health & wellness stipend | Health, dental, vision benefitsSenior-level Full TimeAustin, TX19h ago
-
Principal Data Engineer USD 160K-200KAI Agents | AWS Glue | AWS Lambda | Amazon ECS | Amazon MSK401k | Dental insurance | Discounts | Fully remote | Medical insuranceExecutive-level Full TimeNew York, NEW YORK, United States R20h ago
-
Senior Software Engineer, Data Engineering USD 130K-150KApache Kafka | Apache Spark | At least once | Automated testing | AvroSenior-level Full TimeAndover, Massachusetts, United States21h ago
-
GenAI Product Engineer USD 70K-136KAPI Development | Cloud Computing | Cloud platform | DevOps | Generative AICertification training | Fully remote within contiguous US | Hands-on experienceMid-level Full TimeArlington, VA R22h ago
-
Senior Staff Machine Learning Engineer - Agentic Systems USD 281K-401KAgent Orchestration | Data Pipelines | Distributed Systems | Evaluation systems | Experimentation401k retirement plan | Health insurance | Meal allowance | Paid Holidays | Paid parental leaveSenior-level Full TimeNew York, NY22h ago
-
Staff Machine Learning Engineer - VoIP Infrastructure USD 173K-303KAnsible | Cloud Native | Configuration Management | DevOps | Distributed Systems401k match | ESPP | Family leave programs | Flexible time off | Health plansSenior-level Full TimeSan Diego, California, United States23h ago