LLM Fine-Tuning Engineer
Tasks
- Build scalable training pipelines using distributed training frameworks
- Collaborate with cross functional teams to align fine tuning roadmaps
- Design and execute fine tuning experiments for large language models
- Design evaluation suites including automated benchmarks and human evaluation
- Document training methodology results and decisions
- Implement parameter-efficient fine-tuning methods
- Implement safety refusal and policy evaluations
- Lead dataset construction curation and quality assurance
- Manage model artifacts lineage tracking and reproducibility
- Mentor engineers on fine tuning best practices and responsible deployment
- Operate large scale training jobs on GPU clusters
- Optimize training throughput using mixed precision and efficient attention
- Stay current with LLM research and translate advances into production
- Tune hyperparameters and optimizer configurations for training stability
Perks/Benefits
Skills/Tech-stack
Adapters | Attention Optimization | Cluster operations | Data Generation | DeepSpeed ZeRO | Direct Preference Optimization | Distributed Training | Evaluation methodology | FSDP | GPU Cluster | GPU Cluster Operations | Human Feedback | Human evaluation | Hyperparameter Tuning | Language Models | Large Language Models | Learning from Human Feedback | LoRA | Mixed Precision | Model Reproducibility | Pipeline parallelism | Preference optimization | PyTorch | Python | QLoRA | Reinforcement Learning | Reinforcement Learning from Human Feedback | Safety evaluation | Sequence Packing | Supervised Learning | Synthetic Data Generation | Synthetic data
Education
Roles
Related jobs
-
Senior Systems Engineer, Storage - DGX Cloud USD 208K-414KAlerting | Algorithms | Ansible | Argo CD | CI/CDSenior-level Full TimeUS, CA, Remote, United States R19h ago
-
Azure Data | Azure Data Factory | Azure DevOps | CI/CD | Data Factory401k match | Disability insurance | Education benefit | Employee stock purchase plan | Life insuranceSenior-level Full TimePrudential Tower, 655 Broad Street, Newark, … R19h ago
-
Machine Learning Engineer USD 140K-220KApache Spark | Azure | Azure Machine Learning | CI/CD | Cloud StorageCareer development opportunities | High responsibilitySenior-level Full TimeUnited States - Remote R1d ago
-
Data Processing | GRPC | GraphQL | Large Scale Data | Large-scaleDirect product impact | Experimentation | Fast-paced startup culture | Rapid iteration | Remote OKMid-level Full TimeNew York, New York, United States R1d ago
-
Senior Data Engineer USD 150K-220KAWS | Anomaly Detection | DBT | Data Observability | Data QualityFully remoteSenior-level Full TimeRemote (U.S. based) R1d ago
-
Senior-level Full TimePennsylvania-Remote, United States R2d ago
-
Senior AI Engineer USD 147K-198KA/B | A/B Testing | API Development | Agentic Workflows | B testingSenior-level Full TimePennsylanvia-Remote, United States R2d ago
-
Senior Software Engineer (Pipeline team) USD 185K-259KA/B | A/B Testing | AWS Bedrock | AWS Lambda | AWS S3Senior-level Full TimeUnited States - Remote R2d ago
-
C# | MATLAB | NumPy | Numerical analysis | PandasFreelance opportunities | Part-time project-based workSenior-level FreelanceUnited States - Remote R2d ago
-
Senior GenAI Software Engineer (North America) USD 165K-230KA/B | A/B Testing | B testing | Debugging | EvaluationEquity | Health, dental, and vision benefits | In person team gatherings quarterly | Remote-first work | Wellness stipendsSenior-level Full TimeUnited States R2d ago
-
Machine Learning Scientist, BioML USD 200K-330KAWS | Azure | Bioinformatics | Cloud Computing | Computational Biology401k employer match | Equity participation | Health, dental, vision insurance | Paid time off | Professional developmentMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R2d ago
-
Machine Learning Platform Engineer USD 135K-160KAmazon SageMaker | Apache Flink | C++ | CI/CD | Cloud PubSub401k match | Annual bonus | Company equipment provided | Company medical dental vision plans | Disability benefitsMid-level Full TimeAtlanta, GA preferred, Remote R2d ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Reasoning | Fine TuningSenior-level Full TimeRemote-USA R2d ago
-
Senior Developer Advocate - Modern App Development USD 194K-237KAPI Integrations | AWS | Cloud platform | Code Quality | Google CloudCommunity groups | Employee stock purchase plan | Inclusion talks | Mental health benefits | Mentor/Buddy programSenior-level Full TimeCalifornia, USA, Remote; Nevada, USA, Remote; … R2d ago
-
Staff Software Engineer, AI Developer Tools USD 180K-245KAPI Design | Agent systems | CI/CD | Compliance | Data PrivacySenior-level Full TimeDenver, CO;San Francisco, CA;New York, NY;Seattle, … R2d ago
-
Staff Software Engineer, Big Data Storage USD 177K-364KApache Flink | Apache Hive | Apache Iceberg | Apache Spark | Column BackfillSenior-level Full TimePalo Alto, CA, US; Remote, US R2d ago
-
Senior AI Data Engineer USD 160K-200KAWS Glue | AWS Lambda | Amazon Kinesis | Amazon Redshift | Amazon S3401k matching | Dental insurance | Disability insurance | Life insurance | Medical insuranceSenior-level Full TimeSan Diego, California, United States R2d ago
-
Senior Embedded Software Engineer - Future Forward USD 153K-201KAuthentication | Board Bring-up | Bring-up | C# | C++Senior-level Full TimeSunnyvale, CA, United States R3d ago
-
Lead AI Engineer, Business Operations (Hybrid or Remote USD 150K-220KAPI Design | Backend Development | Cloud Platforms | Evaluation Frameworks | Fine Tuning401k company match | Career advancement opportunities | Dental insurance | Flexible time off policy | Life insuranceSenior-level Full TimeDallas, Texas, United States; United States R3d ago
-
AWS | Airflow | Apache Spark | Azure Synapse | Azure Synapse Analytics401k matching | Disability insurance | Employee assistance program | Life insurance | Medical/Dental/Vision insuranceMid-level Full TimeRemote, USA ; Remote, Canada R3d ago
-
Principal Data Engineer/ Technical Lead USD 219K-298KAWS | Access Layer | Aggregation pipelines | Apache Kafka | Apache Spark401k match | Employer paid medical/dental/vision | Flexible spending account | Paid parental leave | Remote first work from homeSenior-level Full TimeUnited States (Remote) R3d ago
-
Senior Software Engineer II - (AI Core Platform) USD 100K-177KAPI Development | API Gateway | AWS | Agile | AlertingMid-level Full TimeRemote, United States R3d ago
-
Senior Software Engineer I - AI/ML USD 145K-190KAPI Development | Agile | Alerting | CI/CD | Data ModelingSenior-level Full TimeRemote, United States R3d ago
-
AI Expert USD 148K-175KAWS | Agile | Batch Processing | Data Mapping | Data ModelingHybrid work | Public Trust Clearance | Remote workSenior-level Full TimeMemphis, TN, United States R3d ago
-
Machine Learning Engineer II USD 160K-210KAirflow | Apache_Spark | Autoscaling | C++ | CI_CDDental insurance | Disability insurance | Flexible vacation | Health insurance | Life insuranceSenior-level Full TimeRemote, USA R3d ago