Software Engineer II - Bing Platform - Microsoft AI

Redmond, Washington, United States

Microsoft

Entdecken Sie Microsoft-Produkte und -Dienste für Ihr Zuhause oder Ihr Unternehmen. Microsoft 365, Copilot, Teams, Xbox, Windows, Azure, Surface und mehr kaufen

View all jobs at Microsoft

Apply now Apply later

Why work at Bing Platform?


The Heterogeneous Computing Architecture team within Microsoft’s Bing Platform Engineering is seeking a highly motivated and independent Software Engineer II to join us in pioneering the development of world-leading AI infrastructure at scale. By leveraging the massive parallelism offered by current and future computing hardware, our team accelerates various ML workloads, including large-scale inference and training. We apply accelerated computing to challenging applications that are crucial to Microsoft and our customers, including Bing search.

Our responsibilities encompass developing, optimizing, deploying, and maintaining our software, infrastructure, and automated optimization tools. Collaborating with Microsoft model developers and data scientists to deploy their models provides us with the unique opportunity to work across the entire DNN stack, from cutting-edge DNN models, techniques, and frameworks to diverse accelerator architectures. This innovative development occurs within the context of a live, global-scale distributed system that impacts millions of users daily.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

  • Understand different Deep Neural Network (DNN) workloads (e.g., Transformers and LLMs) and map them efficiently to the underlying system and hardware architecture.
  • Apply state-of-the-art AI and machine learning techniques to build and optimize the system to continuously achieve improved performance and user experience.
  • Develop ML system infrastructure in C/C++/C# and Python.
  • Build end-to-end optimized ML inference solutions, working across ML frameworks, runtimes, and distributed hardware devices.
  • Design and build model deployment processes leveraging automation and scripts.
  • Participate in/lead designs, and write production-level, high-quality code.
  • Deploy and maintain software to meet significant reliability and service level requirements.

Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Familiarity with neural networks, ML frameworks (e.g., PyTorch and TensorFlow), and optimization techniques for inference and training.

Other Requirements:

 

Candidates must be able to meet Microsoft, customer and/or government security screening requirements that are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Preferred experience in GPU programming (e.g., CUDA) and familiarity with CUDA libraries, or in FPGA hardware and application development.
  • Preferred experience with compiler and automation tools and workflows.
  • Solid software development experience, particularly in high-performance, distributed, and fault-tolerant systems.
 Software Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.  Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay       Microsoft will accept applications for the role until December 20, 2024.

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

 

#Bing

Apply now Apply later
Job stats:  0  0  0

Tags: Architecture Computer Science CUDA Engineering FPGA GPU Java JavaScript LLMs Machine Learning ML infrastructure Model deployment Python PyTorch Security TensorFlow Transformers

Perks/benefits: Career development Medical leave

Region: North America
Country: United States

More jobs like this