Site Reliability Engineer SRE Job at SOMERSET STAFFING, Washington DC

akxmVnhHVGl0QkU5K0pjZXd3bFFYWW8z
  • SOMERSET STAFFING
  • Washington DC

Job Description

Randstad is seeking a Site Reliability Engineer for a high-impact role with a premier client based in Washington, DC . In this position, you will bridge the gap between development and operations by applying a software engineering mindset to system administration and infrastructure. You will be responsible for ensuring the scalability, performance, and high availability of cloud-based services across AWS and Azure environments. By leveraging Infrastructure-as-Code, advanced observability with Dynatrace, and SRE principles like error budgets and SLOs, you will drive operational excellence and lead incident response efforts for mission-critical applications.

Key Responsibilities
  • Deployment & Automation: Architect and manage CI/CD pipelines (GitHub Actions, AWS CodePipeline) and automate global infrastructure using Terraform, CloudFormation, or CDK.
  • Performance & Capacity: Drive cost-optimization initiatives, manage auto-scaling thresholds, and execute resiliency/performance testing to ensure system durability.
  • Incident Management: Act as a primary on-call responder using ITIL frameworks and ServiceNow; develop Root Cause Analysis (RCA) documentation and maintain knowledge bases.
  • Observability & Monitoring: Implement distributed tracing and optimize monitoring via Dynatrace and Kibana to create advanced dashboards and anomaly detection.
  • Reliability Engineering: Define and monitor SLIs and SLOs while managing error budgets to balance feature velocity with system stability.
  • Security & Compliance: Oversee service accounts, manage digital certificates, and execute rapid remediation for security incidents.
Qualifications
  • Education: Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Experience: 2 to 4 years of professional experience in SRE, DevOps, or Infrastructure roles.
  • Cloud Proficiency: Practical, hands-on experience with both AWS and Azure platforms.
  • Technical Skills: Mid-level proficiency in Python (or similar scripting languages) and configuration management tools like Ansible.
  • Containerization: Solid understanding of Docker and orchestration via Kubernetes or ECS.
  • Infrastructure Fundamentals: Strong knowledge of Linux systems, networking protocols, and both Relational/NoSQL database architectures.
  • Soft Skills: Excellent written and verbal communication skills with the ability to manage competing priorities independently.
  • Flexibility: Ability to participate in a production on-call rotation, including work outside standard business hours.

Required Skills :

Basic Qualification :

Additional Skills :

This is a high PRIORITY requisition. This is a PROACTIVE requisition

Background Check : No

Drug Screen : No

Job Tags

Similar Jobs

Axiom Software Solutions Limited

Site Reliability Engineer (SRE) Job at Axiom Software Solutions Limited

 ...Role: Site Reliability Engineer (SRE) Location: Miami FL Onsite Position Type: Contract Required Skills & Qualifications 9+ years of experience in Site Reliability Engineering, DevOps, or similar role. Strong experience with Linux/Unix systems administration... 

Aramark

Valet Driver - Houston Methodist Hospital - Houston Methodist Hospital - Valet Job at Aramark

Job Description Position Summary: The Route Sales Driver is responsible for driving a company vehicle within an established route or territory and delivering goods and products to various customer locations. Essential functions and responsibilities of the position may...

Metropolitan Transportation Authority

Executive Agency Counsel, Litigation (A-B) Job at Metropolitan Transportation Authority

 ...This senior litigation attorney position is responsible for handling complex, high-exposure personal injury lawsuits brought against MTA agencies during all phases of litigation in state and federal trial courts from inception to resolution. Handle, complex, high-exposure... 

Gundersen Health System

Spanish Interpreter Job at Gundersen Health System

 ...are energized by working in a fast-paced environment as a Spanish speaking interpreter. What you will work: On Call (hours vary depending on need...  ...La Crosse and Onalaska locations What you will do: Relay medical information between speakers of two different languages Ensure... 

Chef Tanya's Kitchen

Kitchen Team Member Job at Chef Tanya's Kitchen

 ...Greetings future Team Members! Chef Tanya's Kitchen is a growing vegan company currently with 2 locations! This position is hourly PLUS tip pool!! We need some skilled and enthusiastic people to add to our kitchen team! Please have 1 year of high-volume of kitchen...