Home

Urgent :: Requirement for Senior Site Reliability Engineer, Owing Mills, Hybrid, must be onsite day 1 (hybrid)-need local at Mills, Wyoming, USA
Email: [email protected]
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=2236170&uid=

From:

Akhand Singh,

Adventa Tech Inc

[email protected]

Reply to:   [email protected]

I hope you are doing great,

Please let me know if you have any consultants for that requirement.

Face to Face

Interview required for this role

Must need local candidate of MD

Title : Senior Site Reliability Engineer

Location: Owing Mills, Hybrid, must be onsite day 1 (hybrid)need local

Visa: No H1b/CPT/OPT

MOI: Skype + F2F

They will need to use their own device to connect via citrix. 

he Technology Engineering team is looking for an experienced Site Reliability Engineer to join us as we are reimagining the production application and infrastructure management. The team is responsible for engineering scalable and resilient hybrid cloud solutions (both AWS and On-prem). You will be responsible for creating tooling and software that monitors and improves the reliability of our systems. In this role, you will research problems, evaluate modern technologies, create prototypes, develop (integrated process, automation, define standards) observability tooling, and provide SRE consulting on complex projects.

Requires specialized in-depth knowledge and expertise in your own job discipline, Amazon Web Services (AWS) platform and/or other cloud-based platforms and deep experience in integrating related disciplinary knowledge

Works independently, receives minimal guidance

Accountable for work of yourself and others; sets standards around which others will operate

Proactively identifies problems and can present and implement solutions to these problems

Role summary and job responsibilities

Design and implement highly automated systems/services that ensure the availability, reliability, and scalability of infrastructure and applications.

Build and maintain monitoring and alerting to provide timely feedback on the performance and health of systems, network, and applications.

Design and implement automation tools to reduce manual toil, streamline repetitive tasks, and enhance overall operational efficiency.

Design and build Service Level Indicator (SLIs) metrics, including but not limited to Service Level Objectives (SLOs), Error Budget, Burn Rate Alerts

Work closely with development teams to embed reliability best practices into the software development process. Provide mentorship and training to cross-functional teams on SRE principles, encouraging a shared responsibility for the reliability of our services.

Collaborating with our support, operations and engineering teams to investigate and troubleshoot complex problems

Observe and monitor systems to make sure you have the insight into system performance, health, availability and what is happening internally in the system.

Understands what to monitor based on the system(s) you are managing, how the monitoring data is stored, and how to look at the data to make determinations about future actions.

Participates in continuous improvement efforts that span multiple multi-functional domains and informs the generation of new standards

Be a part of an on-call rotation, continuously enhance automation & documentation, and mentor others on the standard methodologies of infrastructure automation to encourage adoption.

Able to overcome differences of opinion and drive team alignment around a specific goal or solution

Holds associates and teams accountable for adhering to practices and policies

Business knowledge

Demonstrates deep knowledge of products/flows within supported businesses

Decomposes the most complex problems into discrete work units.

Identifies non-obvious relationships and anomalies often overlooked by others.

Balances strategic and pragmatic concerns when solving problems.

Makes sound decisions with limited facts or resources.

Makes decisions that are cognizant of the firms broader business strategy.

Demonstrates deep knowledge of products/flows within the businesses they support.

Articulates broader business concerns and/or regulatory landscape, including key risks and controls (e.g., GDPR, MIFID, SOX).

Requirements

Strong experience with Monitoring and Alerting tools such as Prometheus, Grafana, New Relic

Experience in container orchestration solutions in AWS with ECS, Fargate

Docker container development experience

Scripting languages like Python, Groovy, Power, Bash, Perl etc.

Skilled in building and maintaining dashboards using tools like Grafana, Prometheus and Statsd to provide critical insights

Worked with Service Reliability Engineering team to design SLI and SLO for respective applications

Strong experience with AWS cloud infrastructure and container orchestration operating in a GitOps framework

A solid core foundation in infrastructure and systems engineering including Unix/Linux compute, networking, storage, and monitoring stacks.

Have experience using automation tools such as Terraform, Ansible

Excellent written and oral communication skills

Strong interpersonal skills, adaptable and able to learn quickly

Off-hour implementations are required

Ability to build positive working relationships with the business contacts, within our IT team, and other IT departments

Ability to identify tasks and help develop project plans for medium and large-scale projects

Preferred

College degree in computer science or related technical field with 7+ years of systems design, programming, implementation, and integration experience

3+ years of experience within the Amazon Web Services platform

AWS, Kubernetes Certifications

Unfeigned Regards,

  |

Akhand Singh

Adventa Tech Inc.

Cell: +1 571 463 1138

E-Mail:

[email protected]

Website:

www.adventatech.com |

Disclaimer
:
This communication, along with any documents, files or attachments, is intended only for the use of the addressee and may contain confidential information. If you are not the intended recipient, you are hereby notified that any dissemination, distribution or copying of any information contained in or attached to this communication is strictly prohibited, To remove your email address permanently from future mailings, please send REMOVE to

[email protected]
.

  |

Keywords: information technology Maryland
Urgent :: Requirement for Senior Site Reliability Engineer, Owing Mills, Hybrid, must be onsite day 1 (hybrid)-need local
[email protected]
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=2236170&uid=
[email protected]
View All
05:24 AM 07-Mar-25


To remove this job post send "job_kill 2236170" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]


Time Taken: 2

Location: ,