AI/ML Engineer (NYC)

New York City

Apply now Apply later

US Mobile is on a mission to revolutionize connectivity. Imagine a world where you can go into a single app and buy terabytes of data for every one of your devices: phone, smart devices, car, home broadband, and more. That’s the future that US Mobile is building: a software platform built truly for the 21st century and the age of 5G and IoT, with world class engineering, best-in-class user experience, and features that will define the next generation of connectivity. 
At the core of it all, we have a team and culture that has been recognized by Forbes as one of the top 500 best startup employers in the US. Our team spans diverse backgrounds, cultures, and stories, with employees coming from 20+ countries.
We're a venture-backed company entering hypergrowth, having recently ranked 94th on Inc 5000's fastest-growing private companies in America, and we’re looking for someone exceptional to join our team.
Job Description:
We’re looking for an AI/ML Engineer who will develop, optimize, and scale machine learning models that power our next generation of user experiences. Working closely with product, engineering, and design, you’ll ensure our ML tools truly address user needs—whether they’re discovering new features, troubleshooting connectivity, or receiving proactive solutions to common issues.
The role is based out of our New York office.

Key Responsibilities:

  • Design & Deploy Conversational / Multi-Agent LLM Solutions
  • Craft multi-agent conversational flows capable of handling a wide range of user requests—both purely informational and action-oriented.
  • Employ advanced LLM techniques (prompt engineering, context retrieval, multi-step reasoning) to ensure robust, context-aware dialogues

  • Multi-Modal & Multi-Model Integration
  • Explore different input/output formats (e.g., text, potential voice or image-based flows) to enrich user interactions
  • Evaluate different models based on their intended use case, considering both technical capabilities and cost efficiency

  • Platform & Pipeline Building
  • Work with cross-functional teams to design data pipelines that feed your models real-time or near real-time data
  • Implement best practices around model lifecycle management—versioning, containerization, deployment orchestration, etc

  • Optimization & Scale
  • Ensure the chat system can handle thousands (eventually millions) of concurrent interactions, maintaining low latency and high availability
  • Monitor performance, define metrics (latency, user success rate, fallback rate, etc.), and iteratively improve

  • Ongoing Innovation & Experimentation
  • Remain current on the rapidly evolving AI/ML landscape, especially in generative models, multi-agent orchestration, and knowledge retrieval
  • Propose new ways to extend AI across our platform—e.g., advanced personalization, proactive customer engagements, etc.

Qualifications:

  • Core AI/ML Expertise
  • 3+ years hands-on experience building and deploying machine learning solutions at scale
  • Solid understanding of NLP techniques, including transformer models and embeddings, with hands-on experience using modern tools like Hugging Face, AWS Bedrock, and OpenAI’s API

  • Backend & Data Infrastructure
  • Proficient in Python or a similar language for data pipelines and model development
  • Experience with cloud platforms (AWS strongly preferred), containerization (Docker, Kubernetes), and microservices

  • Research & Problem-Solving Mindset
  • Up-to-date on AI/ML trends—especially in multi-agent systems, generative modeling, or multi-modal approaches
  • Skilled at diagnosing bottlenecks, scaling solutions, and balancing innovation against real-world constraints

  • Collaboration & Communication
  • Comfortable presenting complex ML concepts to non-technical stakeholders
  • Passion for iterative development—able to pivot based on user feedback and product metrics.

Bonus Points:

  • Familiarity with vector search solutions (e.g. Pinecone, Weaviate, or Elasticsearch with vector plugins)
  • Familiarity with building or deploying large language models and related tooling in the AWS Bedrock ecosystem
  • Experience designing or contributing to multi-agent LLM frameworks or orchestrations (e.g., specialized agent-based approaches in advanced NLP)

Benefits:

  • Competitive salary - $180k-240k (NYC based)
  • Gym reimbursement
  • Free cellular service on the best network in the US
  • Free lunch in NYC office & fully stocked kitchen
  • Metrocard reimbursement
  • Flexible working hours
Think you’d be a great fit? Apply to learn more!
Apply now Apply later
Job stats:  4  1  0

Tags: APIs AWS Data pipelines Docker Elasticsearch Engineering Generative modeling Kubernetes LLMs Machine Learning Microservices ML models NLP OpenAI Pinecone Pipelines Prompt engineering Python Research Weaviate

Perks/benefits: Career development Competitive pay Flex hours Snacks / Drinks Startup environment

Region: North America
Country: United States

More jobs like this