Site Reliability Engineer Job at Covetus, Overland Park, KS

a1ZYVXorZUJCamhNMFp3cElnK05xR0MwUWc9PQ==
  • Covetus
  • Overland Park, KS

Job Description

Job Title : Lead SRE Engineer

Location: : Oakland Park, KS / Seattle, WA

Duration : Longterm Contract

Job Overview:

  • Client is looking at an Lead SRE Engineer
  • Experience into Lead SRE triage calls
  • Ability to resolve ticket and translate to team
  • Ability to work across the stake holders and cross stake holder management

Roles And Responsibilities

  • System Monitoring and Incident Response: for implementing monitoring solutions to track system health, performance, and availability. They proactively monitor systems, identify issues, and respond to incidents promptly, working to minimize downtime and mitigate impacts.
  • Post-Incident Analysis: Led incident response efforts, coordinated with cross-functional teams, and conducted post-incident analysis to identify root causes and implement preventive measures.
  • Continuous Improvement and Reliability Engineering: SREs drive continuous improvement efforts by identifying areas for enhancement, implementing best practices, and fostering a culture of reliability engineering.
  • They participate in post-mortems, conduct blameless retrospectives, and drive initiatives to improve system reliability, stability, and maintainability.
  • Collaboration and Knowledge Sharing: SREs collaborate closely with software engineers, operations teams, and other stakeholders to ensure smooth coordination and effective communication. They share knowledge, provide technical guidance, and contribute to the development of a strong engineering culture.
  • Support and maintain configuration management for various applications and systems Implement comprehensive service monitoring, including dashboards, metrics, and alerts.
  • Define, measure, and meet key service level objectives, such as uptime, performance, incidents, and chronic problems
  • Partner with application and business stakeholders to ensure high quality product development and release
  • Collaborate with the development team to enhance system reliability and performance.

Qualifications

  • Bachelor’s degree in Information Technology, Computer Science, or related field.
  • Strong knowledge of software development processes and procedures.
  • Strong problem-solving abilities.
  • Excellent understanding of computer systems, servers, and network systems.
  • Ability to work under pressure and manage multiple tasks simultaneously.
  • Strong communication and interpersonal skills.
  • Strong knowledge of coding languages like Python, Java, Go, etc.

Job Description

  • Experience with cloud computing platforms such as AWS, Azure, or Google Cloud
  • Experience with DevOps tools such as Git, Jenkins, Ansible, Terraform, Docker, etc.
  • Experience with monitoring tools such as Splunk, Prometheus

Skills: Problem solving, post-incident analysis,aws, monitoring tools, cloud computing, key service level objectives, reliability engineering, configuration management, devops practices, coding languages, monitoring tools (splunk, prometheus),continuous improvement, site reliability engineering, service monitoring, incident response, reliability, software development processes, system monitoring, splunk, devops tools (git, jenkins, ansible, terraform, docker), kubernetes, cloud computing (aws, azure, google cloud), devops, ansible, programming (python, java, go, c/c++, ruby, javascript).

Job Tags

Contract work,

Similar Jobs

Nexrise Commerce

Cell Phone Repair Technician Job at Nexrise Commerce

 ...carbon footprint and giving devices a second life. We offer cell phones at competitive prices, ensuring all devices are fully tested and...  ...your cell phone needs. Job Description: As a Cellular Repair Technician, you will be responsible for diagnosing and repairing... 

Emerson

Senior System Integration Engineer Job at Emerson

 ...If you are a System Integration Engineering professional looking for an opportunity to grow, Emerson has an exciting opportunity for you! Based in our Austin, Texas location, you will become a member of the System Integration and Quality team responsible for ensuring... 

Capgemini Engineering

Data Scientist- VLM (Vision Language Model) Job at Capgemini Engineering

 ...About the job youre considering We are seeking a highly skilled and detail-oriented Vision-Language Models (VLM) Data Scientist/ Vision Data Analyst to join our team. The ideal candidate will have a strong background in computer vision, natural language processing,... 

Canadian Union of Public Employees

Articling Student Job at Canadian Union of Public Employees

 ...public services people depend on. Our members work in hospitals, schools, municipalities, and...  ...at our National Office, the Articling Student will conduct research, write legal memos and...  ...Office, but is permitted to work from home when in-person attendance is not required.... 

PRIDE Health

Travel Mental Health Coordinator (LPC, LCSW, LMFT) - $2,940 per week Job at PRIDE Health

 ...PRIDE Health is seeking a Social Work Licensed Independent Clinical Social Worker for a travel job in Milan, New Mexico. Job Description & Requirements ~ Specialty: Licensed Independent Clinical Social Worker ~ Discipline: Social Work ~ Duration: 13 weeks...