Our Client is hiring a Sysadmin / Site Reliability Operator (SRO) to join our Site Operations team in Buenos Aires. You will be responsible for helping the team in keeping our customers' applications running at peak performance. Not only will you be the first point of contact to external worldwide customers, but you will also help to identify, analyze, and resolve first-tier technical issues on large scale productive platforms. You will assist in modifying and improving our monitoring infrastructure which has multiple metrics and graphs generated every minute from a very diverse environment.
Required Qualifications
Advanced Linux skills and troubleshooting experience in a production environment.
Experience with monitoring graphing metrics and alerting services.
Experience in tracking problems with ticketing systems.
Strong communication and teamwork skills.
Strong communication skills in English.
Willingness to learn from others and share knowledge within teammates. The ability to rapidly self-educate on new concepts and tools while actively searching for increased self-knowledge.Preferred Qualifications
Good experience in on-premise infrastructure management and cloud-based infrastructure, particularly AWS.
Good understanding of scripting in Bash, Python, or similar.
Experience managing web servers.
Good understanding of the following tools: Nagios, Grafana, Zabbix, and JIRA.
An understanding of networking concepts of DNS, routing, load balancers, and firewalls.Job Duties And Responsibilities
Following incident management procedures in production environments.
Understanding Root Cause Analysis determination and timeline creation.
Creating and maintaining documentation on installations, incidents, and procedures.
Analyzing and troubleshooting large-scale distributed systems.
Monitoring specific metrics for availability, latency, and overall system health.
Development and implementation of new IT infrastructure monitoring.Benefits
Career Path:
Developing your monitoring skills by using complex systems such as Sensu or Zenoss.
Interacting with Cloud Services from AWS and receiving continuous training and courses from our AWS Specialists.
Using and deploying different applications with Containerization Software such as Docker Engine.
Learning to automate daily tasks using Orchestration Software such as Puppet, Ansible, or Salt.What We Offer
Onboarding in San Francisco for approximately 3 weeks.
Direct contact with clients and the opportunity to share ideas.
Flexible retribution plan: you can adjust your compensation composition according to your needs.
Training and certifications.
Professional growth.
Flexible Home-office.
Trips to events…and more!
Location: PALERMO Bs As, Argentina 100% RemoteShift
Night: 23:00 to 7:00 AM (Monday to Friday)
#J-18808-Ljbffr