Microsoft Corporation Sr. Site Reliability Engineering in Redmond, Washington
We are looking for a senior Site Reliability Engineer (SRE) who is passionate about running world-class services at very large scale. Be part of a fun-loving global team whose mission is the build a product that customers love and make Azure Data Explorer the technology for analytics.
Azure Data Explorer a.k.a. Kusto is a fully managed big data system hosted in Azure that has taken Microsoft and our clients by storm, empowering them gain insights into their data like never. Not only it has changed the big data landscape in Microsoft but it’s also powering mission critical products like Azure Monitor, IoT, Sentinel to just name a few. Azure Data Explorer is a very large-scale service running on >158K nodes and growing rapidly. Ingesting Trillions of rows and querying them in seconds is a norm in the land of Azure Data Explorer.
We are looking for someone who has a technical depth, independence, cross-team agility, with astute customer focus and a bias towards action. You will have the opportunity to collaborate with engineering counterpart teams and help define the vision of the product as well as improve the architecture on an ongoing basis.
You will be part of a passionate SRE team responsible for running the show and keeping the system working at high availability and performance. A team which also provides guidance and consultancy to its customers on how to best overcome their analytical challenges.
This role provides learning, growth and leadership opportunities while becoming an expert on Azure Data Explorer. If this resonates with you then this is the opportunity you don’t want to pass by.
As an SRE in Kusto your primary responsibilities will be:
Live Site Management – Almost all Microsoft come to rely on Azure Data Explorer (Kusto) for keeping their business running. This fuels a lot of passion in our team to keep the systems working at high availability and performance. As an SRE you will be part of a global team driving huge scale live sites 24X7 and is passionate to deliver the best service within Microsoft as well as to external customers.
Customer Focus – Microsoft core value is Customer First. We in Kusto take pride in that and bring unwavering customer focus and support to help our customers utilize, embedded and build deep solutions on top of Kusto tailored to their needs.
Automation – As Kusto scale is huge and is expected to keep growing rapidly, we are committed to deliver automation and tooling to improve our live site management and adhere to scale without scale methodology.
Design - Evaluate and contribute to product, service design and architecture, help shape Site Reliability Engineering strategies, review specifications, design and improve upon core processes
Observability - Identify system problems and recommend monitoring solutions & automation to improve processing efficiency and stability.
Provide engineering design across different workloads including incident & problem management, change management, security and compliance.
Community Building - Help us build and contribute to a strong Azure Data explorer community.
- 5+ years of scripting and programming experience (preferably .NET, PowerShell, Python, C#)
Excellent troubleshooting skills are a must to be successful in this role.
Deep understanding of cloud services is a big plus (preferably Azure)
Strong working knowledge of Database as well as Big Data systems is a plus
Strong understanding of BCDR
Strong understanding and working knowledge of CI\CD pipeline is a plus
Out of the box, quick and agile thinking to adapt to fast pace and changing environment
Deep knowledge of system design & architecture, and running of complex, large scale online services
Ability to work as part an on-call rotation is a must
Ability to contribute to multiple projects/demands simultaneously
Ability to work effectively with customers both internal and external to Microsoft is a must
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form (https://careers.microsoft.com/us/en/accommodationrequest) .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
- Microsoft Corporation Jobs