

Overview
Microsoft Silicon Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Skype, OneDrive and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, dedicated engineers to help achieve that mission.
Azure Memory and Storage Center of Excellence (AMS COE) is part of the SCHIE organization focusing on Memory and Storage devices going into the cloud hardware servers. AMS provides memory and storage solutions to Azure, drives memory and storage suppliers to deliver high quality products meeting our requirements. We are looking for trusted engineers with a passion for customer-focused solutions, insight and industry knowledge to architect and specify memory and storage hardware solutions that optimize for quality, reliability, cost, and performance.
As a Principal Hardware Engineer in the AMS Memory Team, you will apply system and DRAM expertise to provide best-in-class support for Azure cloud servers. You will use technical expertise in DRAM and High Bandwidth Memory (HBM) to influence future hardware architecture, ensure proper technology integration, enable relevant telemetry, and monitor memory health in the Azure and Azure AI fleet. To achieve this, you will combine skills in data analysis, hardware debugging, subject-matter expertise, and collaborate closely with other hardware and software engineers at both Microsoft and outside entities worldwide. In these collaborations you will prepare and present data analyses, provide recommendations, develop solutions to address problems, and define requirements for others’ solutions.
Responsibilities
Provide principal-level memory subject-matter expertise and guidance in collaboration forums.
Stay up-to-date on memory-related hardware, capabilities, system-level usage, and industry trends.
Work with memory suppliers and the industry to establish memory requirements and specifications.
Identify and work with teams to isolate available, needed telemetry and enable analysis.
Identify and quantify the value of memory-related quality/reliability initiatives.
Pull, manipulate, and analyze available data for the purpose of debugging, trending, etc…
Identify and drive adoption of processes/tools to ensure positive initiative impact.