Oracle Senior Site Reliability Engineer (SRE) in Bellevue, Washington
Job Identification : 111577
Job Category : Product Development
Job Locations :
Bellevue, WA, United States
Burlington, MA, United States
Austin, TX, United States
About Oracle GBU Cloud Services
Oracle GBU Cloud Service (GBUCS) is responsible for managing cloud infrastructure for hundreds of SAAS products developed internally at Oracle. In particular, our team is responsible to build, maintain, and operate the Monitoring-as-a-Service portfolio of monitoring solutions for the entire GBU organization. Our solutions monitor several million entities, at high frequency, that span all layers of the GBUCS stack that includes Storage, Network, Server and the Application services. The monitoring platform is a highly distributed, 24*7 system that collects, transports, processes time-series and log events and provides utilities to alert and visualize massive telemetry data accurately and reliably.
About the Job
Do you want the challenge of working in a cutting-edge environment, solving technical problems, identifying improvements, and implementing your recommendations?
This role lets you design, develop, troubleshoot, debug software for controlling and managing distributed services, multi-level abstractions, end-end automation, monitoring and telemetry and all activities to deliver infrastructure services via code. If you have hands-on experience with analyzing, designing, testing, and implementing solutions, this key role might be for you.
What You'll Do
As an SRE in the monitoring services team you will be designing and maintaining hosting, process, transform, and analyze operational processes. Your first mission will be to work closely with our software developers and Cloud architects to define a sustainable operational model for monitoring services. This includes mechanisms to scale the systems by way of easy-to-use tooling and automation. You will work in concert with developers and other organization members to evolve systems/products for better scalability, reliability, and monitoring. Some of the responsibilities include:
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Work as a member of the development team and share full stack ownership of a collection of services and/or technology area. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
Articulate technical characteristics of services and technology areas and guide development teams to engineer and add capabilities to internal Oracle services.
Understand and explain the effect of product architecture decisions on distributed systems.
Professional curiosity and a desire to a develop deep understanding of services and technologies
Analyze best practices and emerging concepts in SRE.
What You Need to Have
You need to have the following knowledge, skills, and experience: Education and Work Experience
Bachelor’s/Master’s degree in Computer Science or a similar field.
Infrastructure as code experience
Monitoring, orchestration, and configuration management experience in cloud infrastructure
Expertise in container management tools and solutions.
Expertise in building highly-scalable distributed solutions.
Experience implementing a continuous integration (CI) and continuous deployment (CD) pipeline with working knowledge of container management and orchestration tools.
Track record of delivering assigned projects on time with high quality, using Agile, DevOps, and SRE practices and toolsets.
Demonstrated experience in mentoring of team members and provide architectural guidance and lead detailed code reviews.
Preference for demonstrated practical experience with the following technologies:
Programming skills in Java and/or Python
Expert level skills in distributed computing.
Experience using streaming services (e.g. Kafka)
Hands on experience of configuration management frameworks (Terraform, Chef, Ansible, etc.).
Know-how of scripting/automation languages (PERL, Python, Bash etc.)
Experience with infrastructure or network automation tools and protocols e.g. chef, ansible.
Hands on experience understanding of container based deployments
Experience with large scale code optimization and performance tuning.
Experience with development and use of large scale monitoring systems.
Soft Skill Qualifications:
Good written and oral communication skills. Should be able to clearly convey your thoughts and ideas to others.
Committed self-starter who enjoys working in a collaborative environment with personnel at all levels in the organization
Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.
As a member of the software engineering division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems.
Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. Leading contributor individually and as a team member, providing direction and mentoring to others. BS or MS degree or equivalent experience relevant to functional area. 7 years of software engineering or related experience.If you are a Colorado resident, Please Contact us or Email us at firstname.lastname@example.org to receive compensation and benefits information for this role. Please include this Job ID: 111577 in the subject line of the email.
Innovation starts with inclusion at Oracle. We are committed to creating a workplace where all kinds of people can be themselves and do their best work. It’s when everyone’s voice is heard and valued, that we are inspired to go beyond what’s been done before. That’s why we need people with diverse backgrounds, beliefs, and abilities to help us create the future, and are proud to be an affirmative-action equal opportunity employer.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status, age, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
- Oracle Jobs