Find jobs in AI/ML, Data Science and Big Data
47 results
for Inference Engineer
(Role)
-
Senior Inference Engineer - AI USD 100K-204KAPI Design | API Integration | AWS | Azure | C++Career development | Flexible work hours | Hybrid work model | Mental health days | Retirement savingsSenior-level Full TimeUnited States of America, Eagan, Minnesota R2d ago
-
C++ | Diffusion Models | Edge Computing | Ggml | JavaScriptCollaboration with top talent | Fully remote | Globally distributed work environment | High ownership | Innovation-focused environmentSenior-level Full TimeNigeria R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Fully remote | Global distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeColombia R5d ago
-
C plus plus | Diffusion Models | Edge Computing | Ggml | Language ModelsFast-paced environment | Fully remote | Global collaboration | High ownership | Innovation focusSenior-level Full TimeArgentina R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Fully remote | Globally distributed team | High ownership | Innovation and experimentationSenior-level Full TimeKenya R5d ago
-
C++ | Diffusion Models | Edge Computing | Ggml | JavaScriptCollaboration with top talent | Fast-paced innovation | Fully remote | Global distributed team | High ownershipSenior-level Full TimeLuxembourg R5d ago
-
C++ | Diffusion Models | Edge Computing | Ggml | JavaScriptCollaborative environment | Competitive compensation | Fast-paced innovation | Fully remote | Global distributed work environmentSenior-level Full TimeNorway R5d ago
-
C++ | Deep learning | Edge Computing | Ggml | JavaScriptCollaboration with top talent | Fast-paced experimentation | Fully remote | Global distributed team | High ownershipSenior-level Full TimeNew Zealand R5d ago
-
C++ | Deep learning | Diffusion Models | Ggml | JavaScriptCollaboration with top talent | Exposure to advanced AI frameworks | Fully remote | High ownership | Innovation-focused environmentSenior-level Full TimeHungary R5d ago
-
C++ | Diffusion Models | Edge Computing | Ggml | JavaScriptCollaboration with top talent | Fully remote | Globally distributed work environment | High ownership and impact | Innovation-focused environmentSenior-level Full TimeCzechia R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Competitive compensation | Exposure to advanced AI frameworks | Fast-paced innovation environment | Fully remoteSenior-level Full TimeBulgaria R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Fast-paced innovation environment | Fully remote | Global distributed team | High ownershipSenior-level Full TimeFinland R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with experts | Exposure to cutting edge AI frameworks | Fully remote | Globally distributed team | High ownershipMid-level Full TimeSaudi Arabia R5d ago
-
C++ | Deep learning | Edge Computing | Ggml | JavaScriptCollaborate with top talent | Fully remote work | Global distributed team | High ownership | Innovation-driven environmentSenior-level Full TimeGreece R5d ago
-
C plus plus | Deep learning | Diffusion Models | Edge Computing | GgmlDynamic fast-paced environment | Fully remote | Global collaboration | High ownership | Opportunities to work on decentralized technologiesSenior-level Full TimeDenmark R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaborative engineering culture | Fully remote work | Globally distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeBelgium R5d ago
-
C plus plus | Deep learning | Diffusion Models | Edge Computing | GgmlCollaborative team environment | Fully remote work | Global distributed team | High ownership and impact | Innovation-focused cultureSenior-level Full TimeAustralia R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlFast-paced environment | Fully remote | Globally distributed team | High ownership | Innovation-focused workMid-level Full TimeAustria R5d ago
-
C plus plus | Deep learning | Edge AI | Ggml | JavaScriptFully remote | Global distributed team | High ownership | Innovation-focused environment | Top talent collaborationSenior-level Full TimeSweden R5d ago
-
C++ | Deep learning | Edge Computing | Ggml | Inference OptimizationCollaboration with top talent | Fully remote work | Global distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeUnited Arab Emirates R5d ago
-
C plus plus | Deep learning | Diffusion Models | Edge AI | GgmlCollaboration with top talent | Fully remote | Globally distributed team | High ownership | Innovation-focused environmentSenior-level Full TimePoland R5d ago
-
C plus plus | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Fully remote | Global distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeChile R5d ago
-
C++ | Deep learning | Diffusion Models | Ggml | JavaScriptCollaboration with top talent | Fast-paced innovation | Fully remote | Globally distributed team | High impactSenior-level Full TimeTurkey R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlExposure to cutting-edge AI | Fully remote work | Global distributed team | High ownership | Innovation-focused environmentMid-level Full TimeMexico R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlAccess to advanced AI frameworks | Distributed work environment | Fully remote | High ownership | Innovation-focused environmentSenior-level Full TimeSouth Africa R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Fully remote | Global distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeEstonia R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Exposure to advanced AI frameworks | Fast-paced innovation environment | Fully remote | Globally distributed work environmentSenior-level Full TimeSlovenia R5d ago
-
C++ | Deep learning | Diffusion Models | Edge AI | GgmlCollaboration with top talent | Fast-paced environment | Fully remote | Globally distributed team | High impactSenior-level Full TimeCroatia R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlFast-paced environment | Fully remote | Global collaboration | High ownershipSenior-level Full TimeIsrael R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCareer growth in AI systems | Collaboration with top talent | Distributed work environment | Fully remote | High ownershipMid-level Full TimeIreland R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Exposure to advanced AI frameworks | Fully remote | Global distributed work environment | High ownership and impactMid-level Full TimeRomania R5d ago
-
C++ | Deep learning | Ggml | JavaScript | Language ModelsCollaboration with top talent | Exposure to advanced AI technologies | Fully remote | Global distributed work environment | High ownershipSenior-level Full TimePortugal R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaborative team | Fully remote | Global distributed work environment | High ownership | Innovation-focused environmentSenior-level Full TimeNetherlands R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCareer growth opportunities | Fully remote | Global distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeItaly R5d ago
-
C++ | Deep learning | Edge Computing | Ggml | Inference OptimizationFast-paced innovation | Fully remote work environment | Global collaboration | High ownership and impactSenior-level Full TimeSwitzerland R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Fully remote work | Global distributed team | High ownership | Innovation-focused environmentSenior-level Full TimeBrazil R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlCollaboration with top talent | Exposure to advanced AI frameworks | Fast-paced innovation | Fully remote | Global distributed work environmentSenior-level Full TimeFrance R5d ago
-
C++ | Diffusion Models | Edge Computing | Ggml | JavaScriptCollaboration with top talent | Exposure to advanced AI frameworks | Fully remote | Global distributed work environment | High ownershipSenior-level Full TimeSpain R5d ago
-
C++ | Deep learning | Diffusion Models | Edge Computing | GgmlFully remote | Global collaboration | High ownership | Opportunity for innovationSenior-level Full TimeGermany R5d ago
-
C++ | Deep learning | Ggml | JavaScript | Latency optimizationFully remote | Globally distributed team | High impact | High ownership | Innovation-focused environmentMid-level Full TimeCanada R5d ago
-
C++ | Deep learning | Edge Computing | Ggml | JavaScriptCollaborative team | Exposure to cutting-edge technology | Fast-paced environment | Fully remote | High ownershipSenior-level Full TimeIndia R5d ago
-
Inference Technical Lead, Sora USD 380KData Movement | GPU Computing | Kernel optimization | Low-level programming | Model InferenceHybrid work model | Relocation assistanceSenior-level Full TimeSan Francisco8d ago
-
Senior-level Full TimeDublin, Ireland10d ago
-
AI Inference Engineer - Model Optimization & Deployment USD 205K-303KAccuracy evaluation | BF16 | C++ | CUDA | CUDA kernelsSenior-level Full TimeFoster City, CA15d ago
-
AI Platform Engineer INR 1500K-2000KAlerting | CUDA | Capacity Planning | Continuous batching | Distributed tracingMid-level Full TimeBangalore, India17d ago
-
具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 240K-480KCFG Parallelism | Diffusion Models | Expert parallelism | FP8 | Multi Token PredictionSenior-level Full Time北京21d ago
-
Generative AI Inference Engineer USD 152K-287KAWS | CUDA | Cloud platform | Diffusion Models | DockerSenior-level Full TimeUnited States1mo ago