Find jobs in AI/ML, Data Science and Big Data
52 results
for GPU Architecture
(Skill/Tech stack)
-
AI Performance Optimization Engineer USD 100K-150KC++ | Continuous batching | Custom Kernel | Custom kernel development | CutlassCareer growth | H1B transfer support | Remote work | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | Continuous batching | DeepSpeed | Distributed Training | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
Senior Machine Learning Engineer – LLMs EUR 62K-90KAccelerate | Axolotl | BF16 | DPO | Data DeduplicationAutonomy | Hybrid work model | Professional growth | Top-spec equipmentSenior-level Full TimeNetherlands - Amsterdam1d ago
-
IN_Manager_AI Engineer_Data and Analytics_Advisory_Noida INR 1500K-2500KAWS Bedrock | Amazon Aurora | Amazon DynamoDB | Amazon RDS | Amazon SageMakerMid-level Full TimeNoida, India2d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Communication Primitives | Continuous batching | Distributed TrainingCareer growth potential | Remote workMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Continuous batching | DebuggingMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | CUDA | Continuous batching | Cutlass | Deep learningCareer growth | Health benefits | Remote workMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | Compiler optimization | Continuous batching | Distributed Training | FSDPMid-level Full TimeUnited States - Remote R3d ago
-
Sr AI/ Machine Learning Engineer - SAP BTP Fabric USD 148K-306KAMD GPU | AMD GPU Architecture | Application Performance Tuning | Application performance | CI/CDSenior-level Full TimePalo Alto, CA, US, 943043d ago
-
Product Manufacturing Test Lead, Machine Learning USD 144K-209KBoard assembly | CPU GPU | CPU/GPU architecture | Circuit board assembly | Custom electronicsSenior-level Full TimeSunnyvale, CA, USA4d ago
-
AWQ | AWS | Batching | CPU architecture | CUDASenior-level Full TimeGuangzhou, Guangdong, China5d ago
-
AI Performance Optimization Engineer USD 100K-150KAttention Mechanisms | Benchmarking | C++ | Continuous batching | Data pipelineCareer growth | Remote workMid-level Full TimeUnited States - Remote R6d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Compiler optimization | Continuous batching | CutlassBenefits | Full-time employment | Remote workMid-level Full TimeUnited States - Remote R6d ago
-
软件工程师 - pytorch训练框架国产芯片适配 CNY 240K-480KCUDA | GPU Architecture | GPU Programming | PyTorch | PythonMid-level Full Time北京6d ago
-
AI Kernel Optimization Engineer EUR 45K-75KAI accelerator | Activation | Assembly | Attention | C#Career development | Generous Paternity Leave | Health coverage | Meal vouchers | Remote workMid-level Full TimeMontbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France6d ago
-
AI Architect Lead (Hybrid-within BankUnited's footprint) USD 140K-200KAI Foundry | AWS | AWS Bedrock | Artificial Intelligence | AutogenHybrid workSenior-level Full TimeMiami Lakes, FL, United States R7d ago
-
Senior-level Full TimeUS, CA, Remote, United States R9d ago
-
Mid-level Full Time北京 R11d ago
-
Staff Data Center Design Lead USD 171K-248KAI system integration | Cooling systems | Cost Performance | Cost/performance modeling | Data Center DesignSenior-level Full TimeSunnyvale, CA, USA11d ago
-
Senior LLM Engineer EUR 60K-78KAWS | Benchmarking | Cloud Computing | Data Quality | Deep learningCareer growth | Discounted lunch options | Educational budget | Flexible working hours | Hybrid workSenior-level Full TimeMadrid, Spain12d ago
-
Bash | C# | C++ | CI/CD | CPU architectureBicycle subsidy | Company bonus | Friday cake | Health insurance | Wellness allowanceSenior-level Full TimeSweden - Lund13d ago
-
AI/ML | AI/ML Integration | C# | C++ | Compiler designComprehensive benefits packageSenior-level Full TimeChina, Shanghai14d ago
-
Workstation AI Architect USD 147K-230KAI | Agentic Systems | Benchmarking | CPU architecture | Compute architectureDental insurance | Disability insurance | Employee assistance program | Flexible paid vacation and sick leave | Flexible spending accountSenior-level Full TimeFTC03 - Ft. Collins, CO B-3 …16d ago
-
ASIC | C++ | CPU architecture | Debugging | Deep learningInsurance | Relocation assistance | Visa sponsorshipMid-level Full TimeDubai17d ago
-
Senior-level Full TimeChina, Shanghai20d ago
-
AI/ML ASIC Architect USD 163K-249KARM | ASIC architecture | AXI interconnect | Area Optimization | Attention MechanismsSenior-level Full TimeMilpitas, CA, United States20d ago
-
Auto-tuning | C++ | CUDA | Cache behavior | Computer ArchitectureComprehensive benefits packageSenior-level Full TimeIsrael, Yokneam22d ago
-
Solutions Architect - AI Technology Center, Foundation Model Building KRW 65000K-90000KAI model | AI model development | CUDA | Debugging | Fine TuningSenior-level Full TimeKorea, Seoul, Korea, Republic of22d ago
-
Senior Principal Machine Learning Engineer, vLLM USD 206K-351KCPU architecture | Code review | Computer Vision | Deep learning | GPU Architecture401k employer match | Employee stock purchase plan | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeBoston, United States R23d ago
-
Member of Technical Staff, AI Engineering USD 162K-297KAutogen | BF16 | C++ | CI/CD | CUDAIncome Protection for Illness or Injury | Medical, dental, vision plans | Paid Holidays | Paid family leave | Paid time offSenior-level Full TimeBoise, ID - Main Site, United …28d ago
-
Staff Software Engineer, GPU Performance USD 207K-300KAMD | CUDA | Code generation | Compiler optimization | CutlassSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA28d ago
-
Solutions Architect, Pre-training and Post-training KRW 65000K-90000KArtificial Intelligence | Debugging | Deep learning | Fine Tuning | GPU ArchitectureSenior-level Full TimeKorea, Seoul, Korea, Republic of29d ago
-
Build systems | CI/CD | Deep learning | Distributed Training | GPU ArchitectureEmployee stock options | Remote workSenior-level Full TimePalo Alto, California, United States29d ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA1mo ago
-
Senior Deep Learning Systems Engineer, Datacenters USD 184K-356KC# | C++ | CPU architecture | CUDA | CompilersEquity | Health insurance | Hybrid work | Paid time offSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior-level Full TimeTaichung - AATT, Taiwan1mo ago
-
Senior DL Algorithms Engineer - Inference Performance USD 152K-287KBenchmarking | CUDA | Computer Architecture | Deep learning | Diffusion ModelsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Deep Learning Performance Architect USD 184K-356KASIC architecture | C++ | Deep learning | GPU Architecture | InterconnectBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Lead Inference Platform Support Engineer - AI I CAD 140K-175KASIC architecture | AWS | Azure | C++ | CI/CDFlex My Way | Headspace app access | Hybrid work model | Mental health days | Paid volunteer days offSenior-level Full TimeCanada, Toronto, Ontario R1mo ago
-
Benchmarking | CUDA | CUDNN | Cutlass | Deep learningMid-level Full TimeUS-WA-Bellevue1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
Agent Orchestration | Automated testing | Benchmarking | CUDA | CUDA CompilerEquity | Health benefits | Paid time offSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Principal High-Performance LLM Training Engineer USD 272K-431KActivation checkpointing | Benchmarking | CUDA | Communication and Computation Overlap | CompilersBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Sr. HPC Systems Architect (Storage) USD 129K-220KApache | BeeGFS | CI/CD | CPU architecture | Chef401k matching | Dental | Development and career growth | Employee assistance program | Employee stock purchase programSenior-level Full TimeUSA-MI-Ann Arbor-KLA, United States1mo ago
-
Asynchronous programming | Asyncio | Deep learning | DeepSpeed | Distributed TrainingSenior-level Full TimeChina, Shanghai1mo ago
-
AI Engineer (Fluent in Spanish OR Portuguese) GBP 55K-80KArtificial Intelligence | Batching | CI/CD | CUDA | CachingAdditional paid leave purchase option | Employee assistance program | Employee stock purchase plan | Hybrid work | Learning and developmentMid-level Full TimeLondon, United Kingdom1mo ago
-
AI Inference | Algorithms | C# | C++ | Computer ArchitectureHybrid work model | In-office collaboration | Remote work flexibilityMid-level Full TimeKOR - Seoul, Korea, Republic of1mo ago
-
ARM Assembly | Adreno GPU | Annotation | Assembly | Binary AnalysisSenior-level Full TimeDublin, Ireland1mo ago
-
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles USD 184K-356KC++ | CUDA | Cutlass | Efficient Attention | GPU ArchitectureSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago