Software Engineer, LLM Compilation
Tasks
- Debug performance issues with hardware team
- Implement FP8 quantization for FP16 models
- Implement drop in compatibility for transformer models
- Implement host CPU and accelerator synchronization
- Implement model parallelism and normalization components
- Integrate with vLLM and HuggingFace Transformers
- Optimize transformer attention kernels
Perks/Benefits
Skills/Tech-stack
Attention | C++ | CUDA | FP16 | FP8 | Hugging Face | Hugging Face Transformers | Machine Learning | Model Parallelism | Normalization | Quantization | VLLM
Education
N/A
Roles
Regions
Countries
States
Cities
Related jobs
-
AI Agents | AI Safety | AI Search | AWS | Agentic Workflows401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States4h ago
-
Active Learning | Deep learning | Fine Tuning | Golang | Human FeedbackRemote work flexibility | Workplace accommodation supportSenior-level Full TimeMountain View, CALIFORNIA, United States8h ago
-
C++ | Distributed Training | ETL | Go | Hugging FaceInclusive work environment | Remote work flexibilitySenior-level Full TimeMountain View, CALIFORNIA, United States10h ago
-
Data Scientist, AI/ML – Visa Consulting and Analytics USD 123K-191KApache Spark | Embeddings | Excel | GenAI | GitHub Copilot401 K | Dental insurance | Health insurance | Life insurance | Paid time offMid-level Full TimeAshburn, VA, United States11h ago
-
Senior-level Full TimeAnnapolis Junction, MD13h ago
-
AWS | Azure | CI/CD | Data Science | Docker401-k match | Dental insurance | Disability insurance | Life insurance | Medical coverageMid-level Full TimeHouston, TX, United States13h ago
-
Artificial Intelligence/Machine Learning Engineer USD 130K-177KData Modeling | Deep learning | Generative AI | Language Processing | Load forecasting401k match | Continuing education | Extra vacation days | Fitness device reimbursement | Flex TimeSenior-level Full TimeIndianapolis, IN, United States14h ago
-
AWS | Agentic AI | Angular | CI/CD | DatabricksHybrid work | Technical mentorshipSenior-level Full TimeNormal, United States15h ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-450KCUDA | Deep learning | Distributed Systems | GPU Performance | GPU Performance OptimizationEntry-level Full TimeSan Jose, California, United States16h ago
-
Artificial Intelligence | Data Modeling | Data Pipelines | Data Quality | Data Visualization401k match | Dental insurance | Life insurance | Medical insurance | Paid time offSenior-level Full TimeNew York16h ago
-
Computer Vision | Distributed Training | Language Processing | Learning operations | Low LatencySenior-level Full TimeSan Jose, California, United States16h ago
-
Computer Vision | Deep learning | Language Processing | Machine Learning | ModelingEntry-level Full TimeSan Jose, California, United States16h ago
-
Data-Driven Decision Making | Data-driven | Decision Making | Deep learning | Distributed TrainingSenior-level Full TimeSunnyvale, CA17h ago
-
Production Engineer USD 178K-200KApache | Apache Spark | Application Programming | Application Programming Interfaces | C++Entry-level Full TimeMenlo Park, CA17h ago
-
Research Engineer - MSL FAIR Foundations USD 117K-173KBenchmarking | Code review | Data Pipelines | Distributed Systems | Language ModelEntry-level Full TimeMenlo Park, CA17h ago
-
AI Model Serving | AI model | Benchmarking | Cache Management | Data AnalysisSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA17h ago
-
Staff Software Engineer, AI Data Generation Platform USD 207K-300KComputer Vision | Data Engineering | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA17h ago
-
C plus plus | C++ | Cloud Spanner | Cloud Storage | Cloud platformSenior-level Full TimeSunnyvale, CA, USA17h ago
-
Senior Software Engineer, AI/ML, Search Growth USD 174K-252KA/B | A/B Testing | B testing | Deep learning | Information RetrievalSenior-level Full TimeMountain View, CA, USA17h ago
-
Staff Software Engineer, Agentic Data and Evals USD 207K-300KC++ | CSS | Cloud | Data Storage | Data StructuresSenior-level Full TimeSunnyvale, CA, USA17h ago
-
Software Engineer, Applied AI USD 130K-500KData Pipelines | Data Quality | Evaluation Frameworks | Experimental Design | GoDental insurance | Equity grant | Free Equinox Membership | Health insurance | Housing bonusMid-level Full TimeSan Francisco22h ago
-
Staff Software Engineer, Data & AI USD 183K-214KAWS | Airflow | Analytics | Artificial Intelligence | BI AnalyticsSenior-level Full TimeCA - San Francisco; WA - …1d ago
-
Activity Detection | Automatic Speech Recognition | Barge In | Deep learning | Denoising401k match | Dental insurance | Free snacks and drinks | Healthcare | Hybrid workMid-level Full TimeSan Francisco, CA1d ago
-
Machine Learning Researcher, Multimodal LLMs USD 140K-250KAudio codecs | Data Analysis | Experiment design | Fine Tuning | Language ModelsDental insurance | Equity | Health insurance | High autonomy | High impactSenior-level Full TimeSan Francisco1d ago
-
Deployed Engineer (Phoenix) USD 150K-250KAWS | Agent Frameworks | Azure | Cloud Computing | Containers401k plan | Dental insurance | Flexible vacation | Meals on in office days | Medical insuranceSenior-level Full TimePhoenix, AZ1d ago