Site Reliability Engineer Job at Infosys, Washington DC

azFUVXlPZUhCamhLMDU0aElBR0pybVMwU1E9PQ==
  • Infosys
  • Washington DC

Job Description

Required Qualification:

  • Bachelor’s degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • At least 2 years of Information Technology experience.
  • SRE Mindset in Production support : Proactive issue identification using observability tools.
  • Skilled in using different monitoring & observability tools to track system performance
  • Incident commander: Ability to diagnose complex issues and actively drive incident calls working with technical, product SMEs, and Tier 2 SREs.
  • Experience in Splunk (including Splunk APM and Splunk O11y), AppDynamics,
  • Experience in DB, Network, Linux / Unix, Kubernetes
  • Experience in APM, NMON , Wireshark usage and analysis

Preferred Qualification:

  • Knowledge of Grafana, RedMetrics, 1000Eyes
  • Knowledge of VMs, Load balancers, Firewalls, API Gateways,
  • Knowledge of Containerization, Docker, AWS, PCF, GCP, ServiceNow (including AIOps, tools for Self-Heal and automated playbooks)
  • Experience in UEM and synthetic monitoring tools
  • System Administration: Strong knowledge of infrastructure, including command-line tools and system internals. (Kubernetes triage, linux administration)
  • Networking: Understanding of network protocols, configurations, and troubleshooting. (nmon, Wireshark)
  • Cloud Computing: Experience with cloud understanding, including cloud architecture (on-perm and public) and services. (AWS and Azure)
  • Application Management: Familiarity with continuous integration and continuous deployment processes and tools.
  • Advanced programming knowledge: Experience with triaging issues with application code. (Java, Python)
  • DB troubleshooting: Familiarity in troubleshooting issues with traditional and NoSQL databases (eg: Oracle, SQL Server, MySQL, MongoDB, Cassandra)
  • Monitoring and Observability: Skills in using monitoring tools to track system performance and detect issues including all the backend systems, database, and API's (Splunk, AppDynamics, Splunk o11y, Open Telemetry)
  • Ability to diagnose and resolve complex issues quickly and efficiently
  • Collaboration: Strong communication skills to work effectively with cross-functional teams
  • Adaptability: Flexibility to handle changing priorities and technologies
  • Attention to Detail: Precision in managing configurations and deployments to avoid errors
  • Communication : Excellent communicator who could interact with Director/Sr. Director and above.
  • Production support activities including proactive identification of issues leveraging observability tools with the aim of reducing MTTD and MTTR
  • Coordinate all activities required to lead incident triage in compliance with SLAs and OLAs. Corelating inputs from various dashboards & tools to drive resolution.
  • Flexibility to work in rotation (as and when needed)

Job Tags

Permanent employment,

Similar Jobs

UW Health in Northern Illinois

Call Center Customer Service Representative Job at UW Health in Northern Illinois

 ...Additional components of compensation may include:Evening & night shift differentialOvertimeOn-call payBenefits information: At UW Health in northern...  .../ABILITIES: Minimum one year experience in a call center environment; medical clinic or hospitality preferred.Minimum... 

HCA Healthcare

Regulatory and Accreditation Services Manager Job at HCA Healthcare

Introduction Do you want to join an organization that invests in you as a(an) Mgr RAS? At HCA Healthcare, you come first. HCA Healthcare has committed up to $300 million in programs to support our incredible team members over the course of three years. Benefits ...

TT&E Iron & Metal, LLC

Truck Driver Job at TT&E Iron & Metal, LLC

This Truck Driver position involves driving various types of trucks (roll off, lugger truck, tractor trailer) transporting material to/from...  ...locations. Qualifications ~ Have experience driving straight trucks, Roll offs, Lugger, Tractor-Trailers ~ Must be at... 

Casa Esperanza

Bilingual Peer Recovery Coach Job at Casa Esperanza

 ...homelessness; and achieve health and wellness through comprehensive, integrated care.Job summary statement: The role of the Peer Recovery Coach is to provide non-clinical services intended to aid individuals in establishing and maintaining individual recovery from... 

Clearwater Pool Management, LLC

Lifeguard Job at Clearwater Pool Management, LLC

 ...pool operation experience, Clearwater has on-staff Professional Pool Operators (PO), Aquatic Facility Operators (AFO), and Certified Lifeguard Instructors (LGI). As a family-owned and operated business, we serve the Tidewater Area. Role Description This is a part-...