Research Engineer - LLM/VLM Inference Optimization (Seed Infra)
Seattle, Washington, United States
USD 232K-427K Mid-level Full Time
Tasks
- Apply parallel computing and graph fusion
- Collaborate with research teams on model optimization
- Conduct performance analysis and identify bottlenecks
- Design high performance inference systems for LLMs and VLMs
- Develop CUDA kernels
- Develop inference engines and serving frameworks
- Develop model toolchains
- Enable streaming inference
- Implement compiler-level optimizations
- Implement speculative decoding
- Optimize end to end deployment pipelines
- Optimize high concurrency requests
- Use low precision computation
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | CUDA kernel | Compiler optimization | Deployment Pipelines | Graph Fusion | High concurrency | Inference Optimization | Language Models | Large Language Models | Low-precision computing | Parallel Computing | Performance Profiling | Precision computing | Speculative decoding | Streaming inference | Vision Language Models | Vision-language
Education
N/A
Roles
Related jobs
-
Computer Vision | Data Analysis | Language Models | Language Processing | Large Language ModelsSenior-level Full TimeSeattle, Washington, United States14h ago
-
Classification Algorithms | Data Analysis | Deep learning | Language Models | Language ProcessingSenior-level Full TimeSan Jose, California, United States14h ago
-
Algorithms | Audio Software | C++ | Debugging | Embedded SystemsSenior-level Full TimeMountain View, CA, USA15h ago
-
C++ | Data Processing | Debugging | Information Retrieval | Language ModelsSenior-level Full TimeMountain View, CA, USA15h ago
-
Algorithms | C++ | Cloud Computing | Cloud platform | Data StructuresSenior-level Full TimeSunnyvale, CA, USA15h ago
-
Cloud Data and AI Engineer, Professional Services USD 127K-183KC++ | Capacity Planning | Cloud Databases | Data Migration | Data PipelinesTravel up to 30 percentMid-level Full TimeReston, VA, USA15h ago
-
Anomaly Detection | Calibration | Classification | Clustering | Decision TreesCommuter benefits | Disability benefits | Health insurance | Life insurance | Paid time offSenior-level Full TimeNew York, New York1d ago
-
Full-Stack AI Software Engineer USD 120K-150KAWS | Agile | Algorithms | Azure | C#401k company match | Company holidays | Deferred compensation plan | Dental insurance | Disability insuranceMid-level Full TimeCRC - Alpharetta, GA 3460 Preston …1d ago
-
API Development | Backend Engineering | CI/CD | Caching | Cause analysisDental insurance | Medical insurance | Paid time off | Retirement savings | Vision insuranceSenior-level Full TimeWork At Home-Illinois, United States1d ago
-
Mid-level ContractHarrisburg, PA1d ago
-
Data Domain Architect Lead USD 171K-205KArtificial Intelligence | Business Intelligence | Data Annotation | Data Modeling | Data ProcessingBackup childcare | Financial coaching | Health care coverage | Mental health support | On Site Health Wellness CentersSenior-level Full TimeWilmington, DE, United States1d ago
-
Senior Staff Machine Learning Engineer USD 173K-303KAnsible | Cloud Native | Communication Systems | Configuration Management | DevOpsSenior-level Full TimeHartford, Connecticut, United States1d ago
-
AI Engineer USD 139K-198KAI Search | AKS | AWS Bedrock | Amazon SageMaker | AutogenLeadership development | Professional developmentSenior-level Full TimeWashington, DC1d ago
-
Sr. AI Engineer USD 176K-240KAWS | Agentic Workflows | Autonomous Agents | Compliance | Context engineering401k plan with employer matching | Advancement opportunities | Employee development program stipend | Fertility/adoption assistance | Flexible PTOSenior-level Full TimeAtlanta, GA1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Data loading | Distributed Training | Gradient Computation | Kernel Fusion401k match | Dental insurance | Health Accounts | Health insurance | Health savings accountSenior-level Full TimeBoston, Massachusetts, United States R1d ago
-
Associate Data Scientist USD 124K-207KAWS | Atlassian Confluence | Atlassian Jira | D3.js | Data Analysis401k retirement plan | Disability coverage | Equity compensation | Life insurance | Medical/Dental/VisionMid-level Full TimeWashington, D.C.1d ago
-
Software Engineer III- Python, PySpark, ETL, AWS USD 170K-185KAWS | AWS Lambda | Agile | Amazon EMR | Amazon S3Senior-level Full TimeJersey City, NJ, United States1d ago
-
Senior-level Full TimeCA - San Francisco1d ago
-
Senior Robotics Software Engineer USD 150K-199KC++ | CUDA | Collision detection | Computer Vision | LinuxDental insurance | Medical insurance | Paid time off | Vision insuranceSenior-level Full TimeOakland, CA1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Kernel Fusion | NVIDIA Nsight | PyTorch | PyTorch Profiler401k match | Dental insurance | Health insurance | Health savings account | Life insuranceSenior-level Full TimeRemote U.S. R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Data loading | Distributed Training | Kernel Fusion | NsightMedical Dental Vision 401k with company match Health Savings Account Life Insurance Pet InsuranceSenior-level Full TimeLas Vegas, Nevada, United States R1d ago
-
Machine Learning Systems Engineer USD 144K-192KCUDA | Kernel Fusion | Nsight | Profiling tools | PyTorch401k match | Dental insurance | Health insurance | Health savings account | Life insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States R1d ago
-
Staff Machine Learning Engineer USD 205K-272KAWS | Active Learning | Azure | CI/CD | Cloud Computing401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimeRemote U.S. R1d ago
-
Senior Test Automation AI Engineer (Automation & Operations) - Vice President - Dallas USD 125K-150KAnomaly Detection | Artificial Intelligence | CI/CD | CrewAI | DeepEvalSenior-level Full TimeDallas, TX, United States1d ago
-
Principal Computer Vision Engineer USD 140K-281K3D Geometry | 3D Perception | ARM | Biometrics | C#401k match | Company holidays | Dental insurance | Health insurance | Paid leaveSenior-level Full TimeBoston, US1d ago