Software Engineer - AI

San Jose, California, United States

Supermicro

The premier provider of advanced Server Building Block Solutions® for 5G/Edge, Data Center, Cloud, Enterprise, Big Data, HPC and Embedded markets worldwide.

View all jobs at Supermicro

Apply now Apply later

Job Req ID: 26294

About Supermicro:

Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary:

Supermicro is seeking an experienced and exceptional Application Software Engineer to work on web-based applications for business process automation. This is a key role that will give you the opportunity to expand your existing knowledge in programming.

Essential Duties and Responsibilities:

Includes the following essential duties and responsibilities (other duties may also be assigned)
• Integrating open-source LLMs (e.g., Llama 3.2 90B) with open-source vector databases, search indexing, and contextual query management
• Design and implement Retrieval-Augmented Generation (RAG) pipelines, incorporating embedding generation, vector search, re-ranking, and contextual retrieval techniques
• Optimize search and retrieval systems using Elasticsearch and vector databases
• Develop and deploy an intelligent AI Agent to assist customers in selecting and purchasing the correct servers based on their unique requirements and use cases
• Integrate AI Agents with backend databases, recommendation engines, and decision-making pipelines
• Design workflows for task automation, contextual reasoning, and real-time recommendations
• Design scalable web scraping pipelines using tools like Scrapy, Selenium, and BeautifulSoup to acquire structured and unstructured data
• Process and clean scraped data to integrate it seamlessly into databases and knowledge retrieval systems
• Design and manage relational databases (include., PostgreSQL, MySQL, MS SQL) for structured data storage and retrieval
• Work with document-based databases (e.g., MongoDB) for handling unstructured data sources
• Optimize database queries and structures to ensure efficient system performance
• Design, test, and optimize prompts for large language models (LLMs) to improve response accuracy, context management, and task completion
• Experiment with prompt tuning and contextual input adjustments to enhance LLM performance in specific use cases
• Extract, clean, and preprocess data from various sources, including relational databases, document databases, PDFs, and images
• Write code for parsing and processing non-text data formats
• Develop Python-based web services using popular framework to enable backend APIs and real-time interactions
• Create interactive dashboards for data visualization and system control using Streamlit
• Collaborate with frontend developers to ensure seamless integration between APIs and user-facing interfaces
• Deploy system components in Linux environments using Docker for scalability and portability
• Optimize system performance for GPU-intensive tasks, ensuring efficient resource utilization
• Identify common user queries, challenges, and areas for improvement
• Test the system regularly from the user’s perspective to validate its performance and accuracy.
• Analyze user feedback and satisfaction, iterating on system design, prompts, and workflows to improve response quality and relevance
• Collaborate with cross-functional teams to implement enhancements based on user behavior and feedback trends
• Handle intricate, repetitive, or time-consuming tasks, such as dataset cleaning, normalization, and troubleshooting
• Ensure data accuracy and reliability, understanding that these foundational tasks are critical for system success

Qualifications:

• BS or above in Computer/Information Science or other relevant degree 
• Minimum 5 years of working experience in software development preferred
• Programming Skills in C#, SQL, Java, JavaScript, AJAX 
• C# ASP.NET project experience is a plus

Salary Range

$147,000 - $168,000 

The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

EEO Statement

Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

Apply now Apply later
Job stats:  3  0  0

Tags: APIs Big Data Data visualization Docker Elasticsearch GPU Hadoop HPC Java JavaScript Linux LLaMA LLMs MongoDB MS SQL MySQL .NET Open Source Pipelines PostgreSQL Python RAG RDBMS Selenium SQL Streamlit Unstructured data

Perks/benefits: Equity / stock options Salary bonus

Region: North America
Country: United States

More jobs like this