company_logo

Full Time Job

Site Reliability Engineer

Playstation

Adelaide, Australia 1 day ago
Apply @ Employer
  • Paid
  • Full Time
  • Senior (5-10 years) Experience
Job Description
Sony Interactive Entertainment

Site Reliability Engineer

As a part of Sony Computer Entertainment, the Future Technology Group (FTG) is leading the cloud gaming revolution, putting console-quality video games on any device, from TVs to consoles to mobile devices and beyond.
Our Site Reliability Engineering team plays a significant role in delivering on the promise of a great cloud gaming experience to our customers. We do this by influencing design and operational decisions towards the overall stability of the gaming service. Our SREs focus on three main things: overall ownership of production, production code quality, and deployments. The successful candidate will be self-directed and able to participate in the way we make decisions at different levels.

We expect our SREs to have opinions on the state of our service and provide critical feedback during different phases of the operational lifecycle. We are engaged throughout the software development lifecycle, ensuring the operational readiness and stability.

Requirements
• Minimum of 5+ years working experience in Software Development and/or Linux Systems Administration role.
• Strong interpersonal, written and verbal communication skills.
• Available to be scheduled in on-call rotation.

Skills & Knowledge
• Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
• Development experience in one or more of the following programming languages:
• Python (preferred)
• Bash, Go, Java, C++, or Rust
• In addition, experience with at least 3 of the following topics:
• Distributed data storage at scale (Hadoop, Ceph)
• NoSQL at scale (MongoDB, Redis, Cassandra)
• Data Aggregation technologies. (ElasticSearch, Kafka)
• Scaling and running traditional RDBMS (PostgreSQL, MySQL) with High Availability
• Monitoring & Alerting (Prometheus, Grafana), and Incident Management toolsets 
• Kubernetes and/or AWS (deployment and management)
• Software Distribution (Package management and distribution at scale)
• Configuration Management (ansible, saltstack, puppet, chef)
• S/W Performance analysis and load testing (QA or SDET experience: a plus)

Responsibilities
• Lead team technical discussions, especially around ongoing improvements in Reliability and Scalability
• Be involved in creating High Level Designs (HLDs) for new products and platforms
• Mentor junior SRE staff and enable them for success
• Lead incident response and post-mortem activities within your assigned service team
• Work with other Engineers in a cross-functional team to prioritise reliability improvements to address technical debt and toil
• Contribute to code to improve reliability
• • Implement automation to reduce ongoing toil

Equal Opportunity Statement:

Sony is an Equal Opportunity Employer. All persons will receive consideration for employment without regard to gender (including gender identity, gender expression and gender reassignment), race (including colour, nationality, ethnic or national origin), religion or belief, marital or civil partnership status, disability, age, sexual orientation, pregnancy, maternity or parental status, trade union membership or membership in any other legally protected category.

We strive to create an inclusive environment, empower employees and embrace diversity. We encourage everyone to respond.

PlayStation is a Fair Chance employer and qualified applicants with arrest and conviction records will be considered for employment.

Jobcode: Reference SBJ-r1q4e0-3-144-48-157-42 in your application.

Company Profile
Playstation

Recognized as a global leader in interactive and digital entertainment, Sony Interactive Entertainment (SIE) is responsible for the PlayStation® brand and family of products and services.