Solution Architect - Agentic AI
Japan, Tokyo
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
NVIDIA
NVIDIA on grafiikkasuorittimen keksijä, jonka kehittämät edistysaskeleet vievät eteenpäin tekoälyn, suurteholaskennan.NVIDIA is a world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, NVIDIA has been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the world's hardest problems.
We are looking for a Solution Architect to work with our customers and partners in Japan, promoting the adoption of Agentic AI and providing technical support to enable them to use our portfolio of GPU-accelerated computing solutions—including machine learning, deep learning, and generative AI.
What you'll be doing:
Exploring the latest advancement in model training, fine tuning and customization, while supporting building agentic LLM applications.
Enabling NVIDIA strategic customers to build enterprise AI solutions using accelerated computing stack including NIMs and NeMo microserviecs.
Collaborate with developers and onboard them to NVIDIA AI platforms and services by providing deep technical guidance.
Establishing and building repeatable reference architecture, communicate standard processes and understand solution trade-offs. Share findings and feedback to improve products and services.
Drive pre-sales conversations, build architectures and demos to accelerate the customer AI journey based on NVIDIA products, and work closely with Sales Account Managers to secure design wins.
Create or run Proofs of Concept and demos that require presentation skills, the explanation of complex topics, and Python coding to execute data pipelines, train ML/DL models, and deploy them on container-based orchestrators.
What we need to see:
Excellent verbal, written communication, and technical presentation skills in Japanese. Business level English communication is also a requirement.
BS or MS in Computer Science, Engineering, Mathematics, or Physics (or equivalent experience)
5+ years of industry or academic experience related to Generative AI or Deep Learning
Strong coding development and debugging skills. Including experience with Python, C/C++, Bash, and Linux
Ability to multitask effectively in a dynamic environment
Strong analytical and problem-solving skills
Proactive and have a strong desire to share knowledge with clients, partners and co-workers
Ways to stand out from the crowd:
Expertise in deploying large-scale training and inferencing pipeline
Experience with pre-training, post-training of transformer-based architectures for language or vision
A deep understanding of the latest generative AI or deep learning methods and algorithms
Experience using or operating Kubernetes, as well as experience writing or customizing Kubernetes configurations
NVIDIA is widely considered to be one of the technological world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us. If you're creative and autonomous, we want to hear from you!
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Computer Science Data pipelines Deep Learning Engineering Generative AI GPU Kubernetes Linux LLMs Machine Learning Mathematics Model training Physics Pipelines Python Research
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.