Job Details

Home

Looking For Site Reliability Engineer: Remote : no CPT at Remote, Remote, USA

http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=164253&uid=

From:
Aakanksha Singh,
RCI
[email protected]
Reply to: [email protected]

Role: Site Reliability Engineer

Fully Remote

6+month

Any visa

Production Infrastructure & Engineering (PI&E) organization provides the essential platforms and infrastructure hosting solutions that power EA's live services. Our charter is to make EA's games and services available to all players anytime and anywhere. To do this, we focus on the high availability of infrastructure, primary services, and studio services. We aim to help developers to experiment and build new games quickly with infrastructure services on-demand and workflows that promote rapid development in the cloud. In all of this, we focus on being there for players where and when they want to play.
As a Site Reliability Engineer, your role covers the entire life-cycle of a product - from helping developers with architecture and delivery to on-call incident response and triage. You will be responsible for on-prem and cloud resources and should have a good understanding of cloud infrastructure fundamentals.

Responsibilities:

You will design and architect distributed systems in the cloud and understand how to move systems from on-prem data centers to the cloud
You will create monitoring, alerting and dashboarding solutions that improve visibility into EA's application performance and business metrics.
You will develop and troubleshoot distributed, large-scale production systems spanning on-prem. and cloud-based hosting
You will perform root cause analysis and post-mortems with an eye towards future prevention.
You will use automation technologies to ensure repeatability, eliminate toil, reduce mean time to detection and resolution (MTTD & MTTR) and repair services.
You will design CI/CD pipelines.
You will produce documentation and support tooling for online support teams.

Qualifications:

Experience monitoring infrastructure and application availability to ensure SLI and SLO.
Experience with Virtualization, Containerization, Cloud Computing (AWS preferred), VMWare ecosystems, Kubernetes, or Docker.
Knowledge of ElasticSearch, Prometheus, Graphite, Kafka
Systems Administration experience, including an understanding of *nix.
Network experience, including an understanding of standard protocols/components.
Automation and orchestration experience including Chef, Puppet, Terraform, Packer, or Jenkins.
Experience writing code in Python, Golang, and/or Java.
Experience working with distributed systems.

Keywords: continuous integration continuous deployment
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=164253&uid=

[email protected]
View All

02:45 AM 23-Nov-22

To remove this job post send "job_kill 164253" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

aakanksha@rconsultinginc.com wrote:
From:
                                   Aakanksha Singh,
                                   RCI                                            
       aakanksha@rconsultinginc.com
       Reply to:   aakanksha@rconsultinginc.com

Role: Site Reliability Engineer

Fully Remote

6+month

Any visa

Production Infrastructure & Engineering (PI&E) organization provides the essential platforms and infrastructure hosting solutions that power EA's live services. Our charter is to make EA's games and services available to all players anytime and anywhere. To do this, we focus on the high availability of infrastructure, primary services, and studio services. We aim to help developers to experiment and build new games quickly with infrastructure services on-demand and workflows that promote rapid development in the cloud. In all of this, we focus on being there for players where and when they want to play.
As a Site Reliability Engineer, your role covers the entire life-cycle of a product - from helping developers with architecture and delivery to on-call incident response and triage. You will be responsible for on-prem and cloud resources and should have a good understanding of cloud infrastructure fundamentals.

Responsibilities:

You will design and architect distributed systems in the cloud and understand how to move systems from on-prem data centers to the cloud
You will create monitoring, alerting and dashboarding solutions that improve visibility into EA's application performance and business metrics.
You will develop and troubleshoot distributed, large-scale production systems spanning on-prem. and cloud-based hosting
You will perform root cause analysis and post-mortems with an eye towards future prevention.
You will use automation technologies to ensure repeatability, eliminate toil, reduce mean time to detection and resolution (MTTD & MTTR) and repair services.
You will design CI/CD pipelines.
You will produce documentation and support tooling for online support teams.

Qualifications:

Experience monitoring infrastructure and application availability to ensure SLI and SLO.
Experience with Virtualization, Containerization, Cloud Computing (AWS preferred), VMWare ecosystems, Kubernetes, or Docker.
Knowledge of ElasticSearch, Prometheus, Graphite, Kafka
Systems Administration experience, including an understanding of *nix.
Network experience, including an understanding of standard protocols/components.
Automation and orchestration experience including Chef, Puppet, Terraform, Packer, or Jenkins.
Experience writing code in Python, Golang, and/or Java.
Experience working with distributed systems.

Keywords: continuous integration continuous deployment

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]

Time Taken: 0

Location: ,