ML Research Engineer
United States - Remote
Full Time Senior-level / Expert USD 155K - 196K
Accretive Technology Group
Founded in 1997, Accretive Technology Group is a global market leader in live streaming. We create live digital experiences that are almost tangible. Our goal is to push our technology and development even further, to provide better experiences...Company Overview:
Accretive Technology Group is an established, fast-paced web company based in the heart of Seattle searching for world-class talent for a variety of tech-based careers. We are a market leader in web-based live video streaming with over 20 years in the industry. In ATG's R&D Department, Lively Video, our specialty lies in offering a video streaming platform tailored for clients who aim to provide immersive real-time experiences to their users. Join us in shaping the future of streaming services with your expertise and passion for innovation.
About the Role
We're seeking an ML Research Engineer to work at the intersection of research and engineering, focusing on developing and implementing cutting-edge AI solutions. The ideal candidate will excel at translating complex research into deployable, business-impacting solutions while advancing our capabilities in large language models (LLMs), computer vision, and generative AI, with particular emphasis on optimizing models for resource-constrained environments.
The ideal candidate will thrive in a research-driven environment, work independently to pursue novel ideas, and have the engineering expertise to implement solutions at scale. We particularly value experience in optimizing and deploying models in resource-constrained environments. We seek individuals who can balance theoretical understanding with practical implementation and are passionate about pushing the boundaries of what's possible in AI.
Core Responsibilities:
- Drive independent research initiatives in natural language processing and computer vision, with a focus on LLMs and image-generation models
- Design and implement novel architectures for model fine-tuning and adaptation
- Develop and optimize training pipelines for both foundation models and domain-specific applications
- Optimize models for deployment in resource-constrained environments (edge devices, browsers, embedded systems)
- Collaborate with developers to design and guide the creation of robust tooling for model interaction, including interfaces for classification, data annotation, and model evaluation
- Work with infrastructure teams to develop scalable systems for model deployment and monitoring
- Contribute to the company's intellectual property through patent applications
- Conduct rigorous experimentation to advance our understanding of model behaviors
- Collaborate with product teams to translate research insights into practical applications
- MS/PhD in Computer Science, Machine Learning, or related field, or equivalent experience
- Demonstrated expertise in training and fine-tuning large language models
- Strong background in generative AI architectures
- Experience with distributed training systems and optimization techniques
- Experience with model optimization and deployment in resource-constrained environments
- Proven track record of independent research, evidenced by technical blog posts, projects, or patents
- Experience building tools and interfaces for model interaction and evaluation
- ML Frameworks: PyTorch (primary), TensorFlow, ONNX
- Model Optimization: TensorFlow Lite, ONNX Runtime, model quantization, pruning
- Edge Deployment: WebGL, WebAssembly, TensorFlow.js, ONNX.js
- ML Libraries: Transformers, Diffusers
- Infrastructure: Cloud platforms (AWS/GCP/Azure), Kubernetes, Docker
- ML Tools: Experiment tracking platforms (e.g., Weights & Biases, MLflow)
- Programming: Polyglot
- Distributed Training: Experience with distributed training frameworks and optimization
- Parameter-efficient fine-tuning methods (LoRA, QLoRA)
- Multi-modal model architectures
- Model compression, quantization, and optimization for constrained environments
- Edge deployment strategies and optimization techniques
- Prompt engineering and zero-shot learning
- Diffusion model optimization
- Ethical AI and responsible deployment
- Vector databases and similarity search
- Model evaluation and analysis tools
- Autonomy to pursue research directions that interest you
- Access to diverse deployment environments for experimentation
- Employer-paid Medical, Dental, and Vision benefits
- Life & Disability Insurance Coverage
- Health Care FSA
- Day Care FSA
- Tradition and Roth 401(k) with a 50% contribution match (no limit)
- Generous Vacation and PTO plan
- Paid Holidays
- Semi-Annual Profit Sharing
- Gym/Equivalent Exercise Program Reimbursement
- $175 Transportation Reimbursement ($100 of this may be used for home internet for remote and hybrid employees)
Employment opportunities and job offers at Accretive Technology Group will always come from Accretive Technology Group’s Talent Acquisition and hiring teams. Never provide sensitive, personal information to someone unless you’re confident who the recipient is. Accretive Technology Group does not extend job offers via email or any other messaging tools to individuals to whom we have not made prior contact. Accretive Technology Group will never send you money or request you return any money back to our company for any reason. Our email domain is @accretivetg.com. The official website to find and apply for job openings at Accretive Technology Group is https://accretivetg.com/
Accretive Technology Group is an Equal Employment Opportunity employer. All qualified candidates will receive consideration for employment without regard to race, color, religion, sex, or national origin.
- Unfortunately, we do not provide visa sponsorship, visa transfer, or corp-corp arrangements.
- Agencies - NO unsolicited submissions will be accepted and if any Agency does submit an unsolicited candidate that Agency shall have no recourse from Accretive Technology Group.
Tags: Architecture AWS Azure Classification Computer Science Computer Vision Docker Engineering Excel GCP Generative AI Kubernetes LLMs LoRA Machine Learning MLFlow Model deployment NLP ONNX PhD Pipelines Prompt engineering PyTorch R R&D Research Streaming TensorFlow Transformers Weights & Biases
Perks/benefits: 401(k) matching Career development Flex vacation Health care Insurance
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.