Applied AI Research Intern (USA)
North America/Remote
Articul8
Welcome to Articul8 - The GenAI platform that brings order to chaos.At Articul8 AI, we relentlessly pursue excellence and create exceptional AI products that exceed customer expectations. We are a team of dedicated individuals who take pride in our work and strive for greatness in every aspect of our business.
Job Description:Articul8 AI is seeking an exceptional Applied AI Researcher Intern to join us in shaping the future of Generative Artificial Intelligence (GenAI). As a member of our Applied Research team, you will be responsible for implementing novel algorithms and models capable of handling diverse modalities such as text, images, audio, video, and time series data.
Responsibilities:Serve as the subject matter expert in various domains such as data pipelines, pre-training and post-training, reinforcement learning, model architecture development and optimization, multi-expert systems, and multimodal models and techniques.
Play a pivotal role in pioneering technologies through all stages, from initial brainstorming and experimentation to validation and deployment.
Collaborate with cross-functional teams to seamlessly incorporate innovation and maintain our product technology leadership.
Continuously stay abreast of emerging trends and advancements in of GenAI and associated fields, while disseminating appropriate research results at top-tier conferences and journals.
Education: Enrolled in a Master's (MSc) or Doctoral (PhD) program focusing on Machine Learning, Deep Learning, Computer Science, Statistics, Mathematics, Engineering, or a closely related discipline.
Core technical skills:
Machine Learning: A solid understanding of machine learning algorithms, neural networks, and deep learning techniques. Familiarity with popular frameworks such as PyTorch and/or TensorFlow.
Mathematics: Strong foundations in algebra, calculus, optimization, graph theory, and numerical methods.
Deep expertise in at least one GenAI modality (e.g., text, image, audio, etc.).
Data Wrangling and Preparation: expertise in handling large datasets, data cleaning, normalizing, transforming, and preparing them for model training.
Model Evaluation and Interpretation: ability to assess model performance, compare different models, interpret results, and identify potential issues. Understanding evaluation metrics, bias-variance tradeoff, overfitting, underfitting, regularization techniques, and hyperparameter tuning.
Programming Skills: Proficiency in programming languages such as Python and experience working with version control systems (e.g., Git) and collaborating on code repositories is crucial.
Proven track record of publications in top-tier conferences and journals
Experience developing tools, libraries, and infrastructure for data preprocessing, model training/finetuning, and deployment of LLMs in research and production environments.
Experience with cloud computing platforms such as AWS, Azure, or GCP.
Problem Solving: ability to break down complex problems into manageable components, devising creative solutions, and iteratively refining ideas based on feedback and experimental evidence.
Collaboration and Communication: proficiency in working within cross-functional teams - communicating clearly, providing constructive criticism, delegating responsibilities, and respecting diverse perspectives.
Critical Thinking: ability to carefully evaluate assumptions, questioning established methodologies, challenging own biases, and maintaining skepticism when interpreting results.
Curiosity and Continuous Learning: ability to stay curious about advances in related fields and constantly seeking opportunities to expand knowledge base.
Emotional Intelligence and Intellectual Humility: capable of displaying empathy, resilience, adaptability, and self-awareness. Ability to recognize own limitations, embracing uncertainty, acknowledging mistakes, and valuing others' contributions.
If you're ready to join a team that's changing the game, apply now to become a part of the Articul8 team.
Tags: Architecture AWS Azure Computer Science Data pipelines Deep Learning Engineering GCP Generative AI Git LLMs Machine Learning Mathematics Model training PhD Pipelines Python PyTorch Reinforcement Learning Research Statistics TensorFlow
Perks/benefits: Career development Conferences
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.