Software Engineer, LLM Compilation
Tasks
- Debug performance issues with hardware team
- Implement FP8 quantization for FP16 models
- Implement drop in compatibility for transformer models
- Implement host CPU and accelerator synchronization
- Implement model parallelism and normalization components
- Integrate with vLLM and HuggingFace Transformers
- Optimize transformer attention kernels
Perks/Benefits
Skills/Tech-stack
Attention | C++ | CUDA | FP16 | FP8 | Hugging Face | Hugging Face Transformers | Machine Learning | Model Parallelism | Normalization | Quantization | VLLM
Education
N/A
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior-level Full TimeAnnapolis Junction, MD11h ago
-
AWS | Agentic AI | Angular | CI/CD | DatabricksHybrid work | Technical mentorshipSenior-level Full TimeNormal, United States14h ago
-
Research Scientist - LLM Training System as a Service - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-450KCUDA | Deep learning | Distributed Systems | GPU Performance | GPU Performance OptimizationEntry-level Full TimeSan Jose, California, United States14h ago
-
Artificial Intelligence | Data Modeling | Data Pipelines | Data Quality | Data Visualization401k match | Dental insurance | Life insurance | Medical insurance | Paid time offSenior-level Full TimeNew York14h ago
-
Computer Vision | Distributed Training | Language Processing | Learning operations | Low LatencySenior-level Full TimeSan Jose, California, United States15h ago
-
Computer Vision | Deep learning | Language Processing | Machine Learning | ModelingEntry-level Full TimeSan Jose, California, United States15h ago
-
Data-Driven Decision Making | Data-driven | Decision Making | Deep learning | Distributed TrainingSenior-level Full TimeSunnyvale, CA16h ago
-
Production Engineer USD 178K-200KApache | Apache Spark | Application Programming | Application Programming Interfaces | C++Entry-level Full TimeMenlo Park, CA16h ago
-
Research Engineer - MSL FAIR Foundations USD 117K-173KBenchmarking | Code review | Data Pipelines | Distributed Systems | Language ModelEntry-level Full TimeMenlo Park, CA16h ago
-
AI Model Serving | AI model | Benchmarking | Cache Management | Data AnalysisSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA16h ago
-
Staff Software Engineer, AI Data Generation Platform USD 207K-300KComputer Vision | Data Engineering | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeSunnyvale, CA, USA16h ago
-
C plus plus | C++ | Cloud Spanner | Cloud Storage | Cloud platformSenior-level Full TimeSunnyvale, CA, USA16h ago
-
Senior Software Engineer, AI/ML, Search Growth USD 174K-252KA/B | A/B Testing | B testing | Deep learning | Information RetrievalSenior-level Full TimeMountain View, CA, USA16h ago
-
Staff Software Engineer, Agentic Data and Evals USD 207K-300KC++ | CSS | Cloud | Data Storage | Data StructuresSenior-level Full TimeSunnyvale, CA, USA16h ago
-
Software Engineer, Applied AI USD 130K-500KData Pipelines | Data Quality | Evaluation Frameworks | Experimental Design | GoDental insurance | Equity grant | Free Equinox Membership | Health insurance | Housing bonusMid-level Full TimeSan Francisco20h ago
-
Staff Software Engineer, Data & AI USD 183K-214KAWS | Airflow | Analytics | Artificial Intelligence | BI AnalyticsSenior-level Full TimeCA - San Francisco; WA - …1d ago
-
Activity Detection | Automatic Speech Recognition | Barge In | Deep learning | Denoising401k match | Dental insurance | Free snacks and drinks | Healthcare | Hybrid workMid-level Full TimeSan Francisco, CA1d ago
-
Machine Learning Researcher, Multimodal LLMs USD 140K-250KAudio codecs | Data Analysis | Experiment design | Fine Tuning | Language ModelsDental insurance | Equity | Health insurance | High autonomy | High impactSenior-level Full TimeSan Francisco1d ago
-
Deployed Engineer (Phoenix) USD 150K-250KAWS | Agent Frameworks | Azure | Cloud Computing | Containers401k plan | Dental insurance | Flexible vacation | Meals on in office days | Medical insuranceSenior-level Full TimePhoenix, AZ1d ago
-
Full Stack AI Developer USD 146K-222KAgile | Angular | Auto-tagging | CI/CD | Chunking401k | Education reimbursement program | Flexible schedule | Hybrid schedule | MentorshipSenior-level Full TimeLivermore, CA, United States R1d ago
-
AI Security | API Security | Adversarial Machine Learning | Data exfiltration | Evasion TechniquesLife insurance | Mental health support | Private medical coverageMid-level Full TimePortland, Oregon, United States1d ago
-
Adversarial Machine Learning | Data leakage | Fine Tuning | ISO 27001 | ISO 27017Life insurance | Mental Health Expenses | Private medical coverageExecutive-level Full TimePortland, Oregon, United States1d ago
-
AI/ML Subject Matter Expert (SME) / Analytics Team Lead USD 128K-206KData Visualization | Feature Engineering | Language Processing | Machine Learning | Model Evaluation100 percent on site | Active secret clearance requiredSenior-level Full TimeArlington, VA1d ago
-
AI Developer USD 128K-173KAI Automation | Artificial Intelligence | Databricks | Generative AI | Language Models401k matching | Dental insurance | Flexible work hours | Health insurance options | Paid time offSenior-level Full TimeUSA DC Washington - 475 L'Enfant …1d ago
-
AI Engineer, Global Operations USD 107K-143KArtificial Intelligence | Data Governance | ETL | Langchain | LogisticsContinuous learning and professional growth | Flexible work options | Health, wellness, and retirement plansMid-level Full TimeCalifornia - Remote, United States R1d ago