Summary THIS ROLE DOES NOT SUPPORT SPONSORSHIP CANDIDATES As a Site Reliability Engineer (SRE) Level II, you will play a key role in maintaining the availability, scalability, and performance of critical infrastructure and services. You will be responsible for building and automating solutions that enhance system reliability and support continuous delivery. In this role, you will handle more complex operational tasks and incidents, provide mentorship to junior SREs, and collaborate with development teams to ensure systems are designed for reliability from the ground up. Responsibilities Incident Management Lead troubleshooting efforts for high-impact production issues, provide detailed root cause analysis (RCA) and preventative measures. Participate in on-call rotations, acting as an escalation point for Level 1 SREs during major incidents. Manage complex incidents and ensure service uptime. Automation & Infrastructure as Code (IaC) Develop and maintain automation scripts and infrastructure using tools like Terraform, Ansible, or CloudFormation. Implement automation solutions to eliminate manual tasks and improve system reliability, scalability, and performance. Performance & Scalability Analyze system performance and recommend optimizations for scalability and reliability. Support capacity planning by monitoring system metrics, traffic patterns, and usage trends to predict future resource needs. System Design & Architecture Collaborate with software engineering teams to influence the design of new services and applications, ensuring they are scalable, reliable, and resilient from the start. Contribute to architectural decisions, ensuring alignment with best practices in fault tolerance, redundancy, and recovery. Monitoring & Observability Build and maintain robust monitoring, alerting, and observability solutions to proactively detect and resolve issues before they impact end users. Optimize existing monitoring tools (e.g., Prometheus, Grafana, Datadog, Dynatrace) and build custom dashboards for better visibility into system health. Security & Compliance Ensure systems and infrastructure are secure, compliant, and aligned with organizational policies and industry best practices. Assist with vulnerability management, system patching, and implementing security measures to protect the integrity and availability of services. Continuous Improvement Lead efforts to continuously improve operational processes, tools, and workflows. Implement and enforce best practices in deployment, monitoring, and incident management to improve overall system reliability and reduce downtime. Basic Qualifications Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent work experience. 3 years of experience in site reliability engineering, DevOps, systems administration, or related roles. Proven track record of managing complex infrastructure, troubleshooting production issues, and optimizing system performance. Preferred Qualifications Strong experience with Linux/Unix administration and proficiency in scripting (e.g., Python, Bash, Go). 5 years of experience in site reliability engineering, DevOps, systems administration, or related roles. Deep understanding of cloud platforms (AWS, Google Cloud Platform, Azure) and related services (EC2, S3, Lambda, Kubernetes, etc.). Experience with containerization and orchestration technologies like Docker and Kubernetes. Proficiency with monitoring and observability tools such as Dynatrace, Prometheus, Grafana, Datadog, ELK Stack, or similar platforms. Strong understanding of networking fundamentals (DNS, TCP/IP), load balancing, and CDNs. Experience with CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure automation (Terraform, Ansible, Puppet). Familiarity with distributed systems and microservices architecture. Excellent problem-solving and troubleshooting skills, especially in diagnosing production issues in high-scale environments. Microsoft Office experience Experience working in multi-platform environment Ability to balance both development and support roles Experience in working on projects that involve business segments Strong analytical, strong troubleshooting skills and excellent communication skills Strong interpersonal skills, focus on customer service, and the ability to work well with other IT, vendor, and business groups Notes Exempt Status: Yes = not eligible for overtime pay; No = eligible for overtime pay. Workplace Type: Office Our Approach to Office Workplace Type: Certain positions outside our branch network may be eligible for a flexible work arrangement. We’re combining the best of both worlds: in-office and work from home. Remote roles will also have the opportunity to come together in our offices for moments that matter. Specific work arrangements will be provided by the hiring team. Huntington is an Equal Opportunity Employer. Tobacco-Free Hiring Practice: Visit Huntington’s Career Web Site for more details. Note to Agency Recruiters : Huntington Bank will not pay a fee for any placement resulting from the receipt of an unsolicited resume. All unsolicited resumes sent to any Huntington Bank colleagues, directly or indirectly, will be considered Huntington Bank property. Recruiting agencies must have a valid, written and fully executed Master Service Agreement and Statement of Work for consideration. #J-18808-Ljbffr Huntington National Bank
Position SummaryJobs for Humanity is collaborating with Upwardly Global and Walmart to build an inclusive and just employment ecosystem. We support individuals coming from all walks of life.ResponsibilitiesDevelop and support Membership by providing information on ...
...degree from an accredited institution in health science, health education, biology, nursing... ...or Educational Experience in Health Coaching Professional Certification in Motivational... ...Coast Community College District Online Employment Application. # A current resume...
...Job Description Job Description MMR Craft Training Coordinator Company Culture: At MMR, our most valuable assets are not our buildings or equipment, it is our family of employees with diverse backgrounds and experiences. Our investment in training programs and...
...critical firewall and perimeter security infrastructures. ~ Strong expertise with next-generation firewall platforms (e.g., Palo Alto, Fortinet, Cisco, Juniper). ~ Solid understanding of routing/switching concepts, intrusion detection/prevention, segmentation, and zero-...
...Motion is an integrated marketing agency with a broad range of clients and services. We're independently owned and results-driven... ...direct project and account management experience at a marketing or advertising agency. Overview: We are seeking a highly organized...