Utkarsh pankaj - Site Reliability Engineering SRE |
[email protected] |
Location: Alpharetta, Georgia, USA |
Relocation: Yes |
Visa: H1B |
With 10 years of strong experience in Site Reliability Engineering (SRE), DevOps, AWS, and Build/Release Engineering, expertise has been developed in Software Configuration Management (SCM), Build/Release Management, Continuous Integration, and Continuous Delivery using a wide range of tools.
Skilled in configuring and deploying infrastructure and applications in the cloud using AWS services such as EC2, S3, RDS, EBS, VPC, SNS, IAM, Route 53, Auto Scaling, CloudFront, CloudWatch, CloudTrail, CloudFormation, OpsWorks, and Security Groups, with a focus on fault tolerance and high availability. Strong understanding of SCM processes, including compiling, packaging, and deploying applications. Proficient in Continuous Integration and Deployment methodologies using Jenkins, SonarQube, and GitLab. Use Infrastructure as Code (IaC) and CI/CD pipelines to automate deployment processes using Google Cloud Platform. Skilled in troubleshooting production issues related to CPU resource utilization, application performance, and code logic. Solid knowledge of Object-Oriented Design and Programming concepts in Java. Experienced in scripting with Shell, Python, C, Bourne, and Perl for maintaining and developing scripts, as well as troubleshooting. Proficient in using build automation tools like Jenkins and Maven to implement end-to-end automation and working experience with Dynatrace Hands-on experience with tools such as POSTMAN and SOAP in order to test the web-service. Utilized AWS CloudWatch to monitor environments for operational and performance metrics during load testing. Extensively worked with Docker for virtualization, deploying and securing applications for streamlined Build/Release Engineering processes. Describe the testing techniques you would use (e.g., black-box, white-box, boundary value analysis) to effectively cover different scenarios Skilled in creating Docker containers from scratch and leveraging Linux Containers and AMIs, along with Dockerfiles. Managed Docker containers with Kubernetes, automating container maintenance and working with REST APIs. Utilized Terraform for managing AWS Infrastructure as Code (IaC). Designed scalable and reliable systems on the Google Cloud Platform which includes providing efficiency such as Compute Engine, App Engine and Kubernetes. Integrated machine learning models into production environments using CI/CD pipelines, leveraging AWS services, Kubernetes, and Docker for automated deployment and monitoring. Collaborated with data scientists and developers to streamline model versioning, testing, deployment, and monitoring, ensuring the smooth transition of models from development to production. Leveraged AI-driven monitoring tools, such as Splunk and AWS CloudWatch, to automate incident detection, root cause analysis, and performance optimization, enhancing system reliability and operational efficiency. Actively mentored junior engineers, providing guidance on best practices for DevOps, AWS infrastructure, and Build/Release Engineering and Linux environment. Worked closely with development and operations teams to communicate chaos experiment results and coordinate necessary mitigation strategies. Created dashboards for log analysis and visualization using Prometheus and Grafana. Advocated for a culture of embracing failure and continuous improvement through chaos engineering practices Provided 24x7 production support, including on-call and weekend shifts. Experienced in troubleshooting, backup, and recovery processes. Utkarsh pankaj 4694989595 Keywords: cprogramm continuous integration continuous deployment artificial intelligence sthree Keywords: cprogramm continuous integration continuous deployment artificial intelligence sthree |