Senior Site Reliability Engineer

Senior Site Reliability Engineer

13 oct
|
Talent Software Services
|
Puerto Rico

13 oct

Talent Software Services

Puerto Rico

Senior Site Reliability Engineer

Job Summary: Talent Software Services is in search of a Senior Site Reliability Engineer for a contract position in VA(Remote). The opportunity will be six months with a strong chance for a long-term extension. Position Summary: As a Senior Site Reliability Engineer, you will research, design, and implement solutions to attain high-quality process automation within the Information Technology division and across the client Career Access business units. You have designed, developed, and implemented solutions that support business functionality as well as the underlying infrastructure required to run and deploy those solutions.





You must possess hands-on technical skills and experience with Amazon Web Services and continuous delivery systems. As an engineer, you must have excellent written and oral communication skills and be adaptive to the changing needs of the department and the organization. You must have experience with building and maintaining highly effective relationships with team members and multiple stakeholders across multiple projects. This is the position in which you will exercise all the knowledge gained when you were receiving your Computer Science, Electrical Engineering, or any other related engineering field degree.

Primary Responsibilities/Accountabilities: Design, develop and implement automated solutions, based on a set of standards and processes which establish consistency across the enterprise, to reduce risk and promote efficiencies in support of the organization's goals and objectives.





Responsible for the quality of your work; will develop and implement a set of quality criteria and the associated validation methods to ensure that any deliverable meets the expected quality levels of our customers, use quality management standards/metrics to ensure quality levels are maintained, seek new approaches and techniques to improve quality levels and analyze the impact of quality control and quality assurance on project performance. Actively review Observability custom and COTS products and implement improvements seen within the industry to drive continuous improvement of the Observability products' efficiency, scalability, and quality. Managing and resolving incidents, conducting incident reviews, and managing problems with a focus on proactivity Incident management - Act in key response roles during major incidents. Participate in an on-call rotation with other team members.





Participate in the post-mortem review of incidents for Root Cause Analysis (RCA) Participate in system design consulting, AWS platform management, and capacity planning Provide support (coaching and mentoring) for teammate's work activities on a regular basis Use product SLAs, enterprise standards/metrics to ensure product availability and user experience quality levels are maintained, seek innovative approaches and techniques to improve quality levels and analyze the impact of the product changes on application performance and availability. Design and develop tools and processes to aid in improving infrastructure reliability and allow for monitoring and reporting. Write complex code, building infrastructure as code,





work with serverless based cloud environments and build the supporting automated toolsets necessary to support the continuous metric collection pipeline. Integrate COTS products across the continuous delivery pipeline to provide a comprehensive automated system from epic definition, development, test and deploy of client's applications within our data center and Amazon. A hands-on engineer who leads by doing. Take responsibility for creating design specifications, unit testing, and preparing technical documentation. Develop solutions from business initiation through operational integrity.





Support the development of Observability standards by creating templates for ease of use and increase of Observability capabilities' adoption Foster and build a community of practice for collective learning of the Observability tools and systems across all development teams. Be in an on-call rotation to respond to incidents that impact Client availability and provide support for Development team engineers with customer related incidents. Use your on-call experiences to analyze and prevent incidents from ever happening. Qualifications: A bachelor's degree preferably in Computer Science, Engineering or MIS. 5-8 years of experience in software systems, programming, and infrastructure development and administration. Preferred: Strong, proven experience as a DevOps engineer in a scalable production environment administrating one or more of the following: Atlassian Suite of products (Jira, Confluence, Bitbucket, Crowd). Ability to operate in a high-pressure environment,





quickly troubleshoot complex issues and successfully handle multiple priorities Strong practical Linux-based systems administration skills and scripting experience in a Cloud-based environment. Experience with Node.js/JavaScript programming language and it's frameworks and design patterns. Experience working with APIs and Microservices. Working knowledge of IP Networking, VPCs, DNS, Load Balancing, and Firewalls. Experience building infrastructure as code using AWS CDK, Cloud Formation, or similar scripting techniques. Experience managing releases into production using AWS Code Pipeline. Expertise with Git and Bitbucket, including branching workflows. Experience with monitoring suites (Ex: New Relic, Splunk, Sumo Logic)is a plus.





Excellent interpersonal and collaboration skills with the ability to work with a diverse set of colleagues. Strong decision-making, problem-solving skills, critical thinking, and testing skills. Self-starter with the ability to set priorities, work independently, and attain goals. The ethos of continuous improvement and interest in learning new things. Strong ability to understand and internalize the big picture and broader implications. If this job is a match for your background, we would be honored to receive your application!

Providing consulting opportunities to TALENTed people since 1987, we offer a host of opportunities including contract, contract to hire and permanent placement. Let's talk!

Suscribete a esta alerta:
Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: senior site reliability engineer

Cajero Recaudador- Puerto Rico Caqueta

Cajero Recaudador- Puerto Rico Caqueta

Importante empresa del sector Financiero requiere para su equipo de trabajo cajero recaudador 8 horas en Puerto Rico Caqueta quien será encargado del manejo de dinero, servicio al cliente y procesos administrativos, importante contar con al menos [...]
Puerto Rico
12 oct
    Puerto Rico
    12 oct

Soldador Puerto Rico / Caqueta

Soldador Puerto Rico / Caqueta

En KMA CONSTRUCCIONES buscamos SOLDADOR para trabajar en PUERTO RICO, CAQUETÁ. DESCRIPCION DE LA CARGO: Garantizar las reconstrucciones y reparaciones de algunas partes de las maquinas, por medio de la aplicación de la soldadura. FORMACION: Básica [...]
Puerto Rico
15 oct
    Puerto Rico
    15 oct

Spanish VRI/OPI Interpretation Vacancy (Puerto Rico)

Spanish VRI/OPI Interpretation Vacancy (Puerto Rico)

We are hiring Spanish English OPI / VRI Interpreters If you are passionate about different languages and interpretation, we need you! We are looking for professional remote interpreters who want to join an international company and be a great help [...]
Puerto Rico
16 oct
    Puerto Rico
    16 oct

12411 Promotor Integral en salud Puerto Rico Meta

12411 Promotor Integral en salud Puerto Rico Meta

En importante empresa del sector Salud se requieren Promotores Integrales de Salud para Puerto Rico - Meta. Técnico Auxiliar de Enfermería, Salud Pública, Administrativo en Salud, Informador en Salud, Salud Comunitaria o Estudiante Universitario d [...]
Puerto Rico
21 oct
    Puerto Rico
    21 oct
Suscribete a esta alerta:
Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: senior site reliability engineer