ML Research Engineer

United States - Remote

Full Time Senior-level / Expert USD 155K - 196K

Accretive Technology Group

Founded in 1997, Accretive Technology Group is a global market leader in live streaming. We create live digital experiences that are almost tangible. Our goal is to push our technology and development even further, to provide better experiences...

View all jobs at Accretive Technology Group

Apply now Apply later

Posted 5 hours ago

Location: Flexible — Remote or hybrid options available within the following states: Arizona, California, Colorado, Florida, Idaho, Michigan, Missouri, Nevada, South Carolina, Texas, or Washington.

Company Overview:
Accretive Technology Group is an established, fast-paced web company based in the heart of Seattle searching for world-class talent for a variety of tech-based careers. We are a market leader in web-based live video streaming with over 20 years in the industry. In ATG's R&D Department, Lively Video, our specialty lies in offering a video streaming platform tailored for clients who aim to provide immersive real-time experiences to their users. Join us in shaping the future of streaming services with your expertise and passion for innovation.

About the Role
We're seeking an ML Research Engineer to work at the intersection of research and engineering, focusing on developing and implementing cutting-edge AI solutions. The ideal candidate will excel at translating complex research into deployable, business-impacting solutions while advancing our capabilities in large language models (LLMs), computer vision, and generative AI, with particular emphasis on optimizing models for resource-constrained environments.

The ideal candidate will thrive in a research-driven environment, work independently to pursue novel ideas, and have the engineering expertise to implement solutions at scale. We particularly value experience in optimizing and deploying models in resource-constrained environments. We seek individuals who can balance theoretical understanding with practical implementation and are passionate about pushing the boundaries of what's possible in AI.

Core Responsibilities:

Drive independent research initiatives in natural language processing and computer vision, with a focus on LLMs and image-generation models
Design and implement novel architectures for model fine-tuning and adaptation
Develop and optimize training pipelines for both foundation models and domain-specific applications
Optimize models for deployment in resource-constrained environments (edge devices, browsers, embedded systems)
Collaborate with developers to design and guide the creation of robust tooling for model interaction, including interfaces for classification, data annotation, and model evaluation
Work with infrastructure teams to develop scalable systems for model deployment and monitoring
Contribute to the company's intellectual property through patent applications
Conduct rigorous experimentation to advance our understanding of model behaviors
Collaborate with product teams to translate research insights into practical applications

Required Qualifications:

MS/PhD in Computer Science, Machine Learning, or related field, or equivalent experience
Demonstrated expertise in training and fine-tuning large language models
Strong background in generative AI architectures
Experience with distributed training systems and optimization techniques
Experience with model optimization and deployment in resource-constrained environments
Proven track record of independent research, evidenced by technical blog posts, projects, or patents
Experience building tools and interfaces for model interaction and evaluation

Technical Skills:

ML Frameworks: PyTorch (primary), TensorFlow, ONNX
Model Optimization: TensorFlow Lite, ONNX Runtime, model quantization, pruning
Edge Deployment: WebGL, WebAssembly, TensorFlow.js, ONNX.js
ML Libraries: Transformers, Diffusers
Infrastructure: Cloud platforms (AWS/GCP/Azure), Kubernetes, Docker
ML Tools: Experiment tracking platforms (e.g., Weights & Biases, MLflow)
Programming: Polyglot
Distributed Training: Experience with distributed training frameworks and optimization

Research Areas of Focus:

Parameter-efficient fine-tuning methods (LoRA, QLoRA)
Multi-modal model architectures
Model compression, quantization, and optimization for constrained environments
Edge deployment strategies and optimization techniques
Prompt engineering and zero-shot learning
Diffusion model optimization
Ethical AI and responsible deployment
Vector databases and similarity search
Model evaluation and analysis tools

What We Offer:

Autonomy to pursue research directions that interest you
Access to diverse deployment environments for experimentation

Company Benefits/Perks:

Employer-paid Medical, Dental, and Vision benefits
Life & Disability Insurance Coverage
Health Care FSA
Day Care FSA
Tradition and Roth 401(k) with a 50% contribution match (no limit)
Generous Vacation and PTO plan
Paid Holidays
Semi-Annual Profit Sharing
Gym/Equivalent Exercise Program Reimbursement
$175 Transportation Reimbursement ($100 of this may be used for home internet for remote and hybrid employees)

A reasonable, good-faith estimate of the minimum and maximum base salary for this position is $155,000-$196,000. This position will also include a profit sharing that is dependent on a variety of factors.

Employment opportunities and job offers at Accretive Technology Group will always come from Accretive Technology Group’s Talent Acquisition and hiring teams. Never provide sensitive, personal information to someone unless you’re confident who the recipient is. Accretive Technology Group does not extend job offers via email or any other messaging tools to individuals to whom we have not made prior contact. Accretive Technology Group will never send you money or request you return any money back to our company for any reason. Our email domain is @accretivetg.com. The official website to find and apply for job openings at Accretive Technology Group is https://accretivetg.com/

Accretive Technology Group is an Equal Employment Opportunity employer. All qualified candidates will receive consideration for employment without regard to race, color, religion, sex, or national origin.

Unfortunately, we do not provide visa sponsorship, visa transfer, or corp-corp arrangements.
Agencies - NO unsolicited submissions will be accepted and if any Agency does submit an unsolicited candidate that Agency shall have no recourse from Accretive Technology Group.

Apply now Apply later

Job stats: 0 0 0

Categories: Engineering Jobs Machine Learning Jobs Research Jobs

Tags: Architecture AWS Azure Classification Computer Science Computer Vision Docker Engineering Excel GCP Generative AI Kubernetes LLMs LoRA Machine Learning MLFlow Model deployment NLP ONNX PhD Pipelines Prompt engineering PyTorch R R&D Research Streaming TensorFlow Transformers Weights & Biases