Find jobs in AI/ML, Data Science and Big Data
49 results
for Mixture of Experts
(Skill/Tech stack)
-
Computer Vision | Generative Models | Information Retrieval | Language Processing | Machine LearningSenior-level Full TimeSingapore, Singapore23h ago
-
Machine Learning Engineer - TikTok Search Ranking SGD 130K-180KComputer Vision | Generative Models | Information Retrieval | Language Processing | Machine LearningSenior-level Full TimeSingapore, Singapore23h ago
-
Continuous batching | Data parallelism | Deep learning | Distributed Training | Dynamic MemoryComputational resources access | Full sponsorship | Hired by Rakuten Asia after completion | Research exchangesMid-level Full TimeCrimson House Singapore4d ago
-
Software Engineer, Systems ML USD 141K-208KC plus plus | CUDA | Co-design | Compiler optimization | Deep learningSenior-level Full TimeBellevue, WA | Menlo Park, CA …4d ago
-
Mid-level Full Time上海5d ago
-
Attention Mechanisms | Batching | CUDA | DP | Distributed SystemsMid-level Full TimeSan Jose, California, United States5d ago
-
Attention Mechanisms | Batching | CUDA | Data parallelism | Distributed SystemsSenior-level Full TimeSan Jose, California, United States5d ago
-
Attention Mechanisms | Data parallelism | Deep learning | Distributed Training | Language ModelsSenior-level Full TimeSan Jose, California, United States5d ago
-
Attention Mechanisms | Deep learning | Distributed Training | Language Models | Large Language ModelsSenior-level Full TimeSan Jose, California, United States5d ago
-
LLM Inference Frameworks and Optimization Engineer USD 160K-230KC++ | CUDA | CUDA graph | Cluster scheduling | CompilerEquity | Health insuranceMid-level Full TimeSan Francisco, Singapore, Amsterdam7d ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv, ISR8d ago
-
Inference Software Engineer USD 175K-275KC++ | Compilers | Consensus Algorithms | Consistency models | DebuggingDaily meals | Housing subsidy | Medical, dental & vision coverage | Relocation support | Unlimited compute budgetMid-level Full TimeSan Jose11d ago
-
Adversarial Training | Chain-of-Thought | Computer Vision | Deep learning | Few-Shot LearningSenior-level Full TimeSingapore, Singapore13d ago
-
AI accelerators | C++ | CPU | Diffusion Models | Edge ComputingSenior-level Full TimeMountain View, CA, USA14d ago
-
Domain Adaptation | Human Feedback | Knowledge graphs | Language Models | Language ProcessingConference travel | Professional developmentMid-level Full TimeAbu Dhabi15d ago
-
Inference Intern USD 60K-142KC++ | Collective communication | Compilers | Consensus Protocols | Consistency modelsDaily meals | Direct mentorship | Housing support | Paid internshipEntry-level InternshipSan Jose16d ago
-
Senior Research Engineer, Olmo + Molmo USD 146K-220KAgentic Systems | Amazon Web Services | Cloud Computing | Cloud platform | ContainerizationFamily leave | Paid sick leave | Paid vacationSenior-level Full TimeSeattle, WA17d ago
-
BM25 | Embedding Models | Hallucination detection | Hugging Face | Hugging Face TransformersFlexible savings account | Health savings account | Medical/Dental/Vision | Paid Holidays | Paid sick timeSenior-level Full TimeSan Francisco, California (Remote) R20d ago
-
AWS Trainium | BF16 | Deep learning | Distributed Training | FP8Entry-level Full TimeCupertino, California, USA21d ago
-
AI院--训练Infra工程师 CNY 180K-300KComputer Vision | Distributed Training | Language Models | Language Processing | Large Language ModelsMid-level Full Time北京21d ago
-
Agentic AI | Artificial Intelligence | Causal Inference | Causal Models | Deep learningCoaching services | Health insurance | Learning and training access | Lunch vouchers | Mental health supportSenior-level Full TimeParis, Paris, France22d ago
-
Artificial Intelligence | Bottleneck analysis | CUDA | Deep learning | Diffusion ModelsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States22d ago
-
Apache Spark | Causal Debiasing | Causal Inference | Counterfactual Risk Minimization | Counterfactual reasoningAnnual refresh grants | Equity grants | Flexible work schedule | Remote workSenior-level Full TimeUnited States - Remote R25d ago
-
2026 Fall Intern, Computer Vision/AI USD 96K-126KComputer Vision | Diffusion Models | Generative AI | Human Feedback | Knowledge DistillationEntry-level Internship665 Clyde Avenue, Mountain View, CA, …26d ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore28d ago
-
Architecture Design | Autoregression | Diffusion Models | Generative Models | Knowledge DistillationEntry-level InternshipSan Jose, California, United States28d ago
-
Senior-level Full TimeMilpitas, CA, United States1mo ago
-
AI/ML ASIC Architect USD 163K-249KARM | ASIC architecture | AXI interconnect | Area Optimization | Attention MechanismsSenior-level Full TimeMilpitas, CA, United States1mo ago
-
Autoregressive models | Deep learning | Diffusion Models | Flow matching | Gaussian SplattingCompetitive salary | Flexible start dates | Holiday pay | Relocation assistance | Sick payEntry-level InternshipAmsterdam, North Holland, Netherlands1mo ago
-
Applied Scientist, Wayve Labs USD 147K-213KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | LanguageDaily yoga | Enhanced parental leave | Flexible working hours | Hybrid working | Large Social BudgetsMid-level Full TimeSunnyvale1mo ago
-
Applied Scientist, Wayve Labs CAD 100K-132KAutoregressive models | Computer Vision | Data sets | Depth Estimation | Diffusion ModelsDaily yoga | Enhanced parental leave | Flexible working hours | Large Social Budgets | Onsite barMid-level Full TimeVancouver1mo ago
-
Applied Scientist, Wayve Labs GBP 80K-96KAutoregressive models | Depth Estimation | Diffusion Models | Foundation Models | Human FeedbackDaily yoga | Enhanced parental leave | Flexible working hours | Onsite bar | Onsite chefMid-level Full TimeLondon1mo ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv, ISR1mo ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Delivery Prediction | Fraud DetectionCareer growth opportunities | Onboarding supportEntry-level Full TimeSeattle, Washington, United States1mo ago
-
A/B | A/B Testing | B testing | Data Ingestion | Deep learningAnnual health check | Overseas relocation support | Visa sponsorshipSenior-level Full TimeShibuya, Tokyo, Japan1mo ago
-
Senior AI Research Engineer USD 160K-180KAttention Mechanisms | Code Quality | Data Processing | Deep learning | Diffusion ModelsDiversity and inclusion | Reasonable accommodationsSenior-level Full TimeSan Jose, California, United States1mo ago
-
Staff AI Research Engineer USD 190K-240KAttention Mechanisms | Data representation | Deep learning | Diffusion Models | Fine TuningSenior-level Full TimeSan Jose, California, United States1mo ago
-
VP of Research, Machine Learning GBP 195K-280KAgent systems | Alignment | Deep learning | GPU Computing | GuardrailsExecutive-level Full TimeUnited Kingdom1mo ago
-
Adversarial Networks | Computer Vision | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSeattle, Washington, United States1mo ago
-
Adversarial Robustness | Agent learning | Audio Processing | Computer Vision | Content ModerationCareer growth | Research mentorshipNone Full TimeSan Jose, California, United States1mo ago
-
AIGC Detection | Adversarial Learning | Agentic Systems | Cross-modal alignment | GRPONone Full TimeSeattle, Washington, United States1mo ago
-
Adversarial Networks | Adversarial Training | Cross-modal alignment | GRPO | Generative Adversarial NetworksEntry-level InternshipSan Jose, California, United States1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China1mo ago
-
Intern Researcher – AI Foundation Model Training CAD 58K-104KAI Agent | AI agent systems | Agent systems | Architecture Search | Computational Graph OptimizationEntry-level InternshipMarkham, Ontario, Canada1mo ago
-
AI/ML Developer INR 1000K-1890KAWS SageMaker | Accelerate | Adapter Layers | Apache Spark | Azure Machine LearningMid-level Full TimeGurugram, India1mo ago
-
C++ | CUDA | CUDA profiling | Collective communication | Communication Compute OverlapSenior-level Full TimeIsrael, Tel Aviv R1mo ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Direct Preference Optimization | Graph Neural NetworksEntry-level Full TimeSeattle, Washington, United States1mo ago
-
Causal Inference | Cross-modal fusion | DPO | Data Modeling | Deep learningEntry-level Full TimeSan Jose, California, United States1mo ago