Site Reliability Engineer (SRE) Job at JustinBradley, Reston, VA

M0RMeDJUT3pCR01BUTEwTVlwTWRvZjRCdVE9PQ==
  • JustinBradley
  • Reston, VA

Job Description

JustinBradley’s client , a leading source of mortgage financing, is seeking a highly experienced Site Reliability Engineer (SRE) to design, implement, and maintain secure, scalable, and resilient cloud infrastructure. The ideal candidate brings a deep understanding of cloud platforms, DevOps practices, and software development methodologies to ensure operational excellence, high availability, and system reliability.

Responsibilities:

  • Design, build, and manage cloud-based infrastructure on AWS, Azure, or GCP.
  • Automate deployments and configurations using Infrastructure-as-Code tools like Terraform, CloudFormation, and Ansible.
  • Create automation for anomaly detection, self-healing systems, and recovery workflows to reduce toil and optimize cloud costs.
  • Develop and manage CI/CD pipelines using tools such as Jenkins, GitLab, SonarQube, Docker, and Nexus/Artifactory.
  • Implement DevSecOps best practices, including IAM roles, RBAC, SAST/DAST/SCA tooling, and vulnerability remediation.
  • Build observability solutions with monitoring, logging, and tracing tools like AWS CloudWatch, Splunk, SignalFX, Dynatrace, and OpenTelemetry.
  • Define and track reliability metrics including SLOs, SLIs, error budgets, MTTR, and MTTD.
  • Architect and support microservices, serverless applications, and RESTful APIs with resilience patterns like Circuit Breaker, Retry, and Timeout.
  • Conduct chaos engineering experiments using AWS FIS, Chaos Toolkit, and perform resiliency testing via AWS Resilience Hub.
  • Manage and optimize various databases including PostgreSQL, MongoDB, DynamoDB, Oracle, and Redshift.
  • Support production systems including incident response, problem management, and runbook creation as part of on-call rotations.
  • Collaborate with cross-functional teams to embed shift-left testing strategies (e.g., BDD, TDD, unit, regression).
  • Maintain architecture documentation, disaster recovery plans, and internal knowledge articles.

Requirements:

  • 8+ years of experience in site reliability engineering or a related field with demonstrated leadership in complex projects.
  • Strong expertise in cloud platforms (AWS, Azure, or GCP), container orchestration, and infrastructure automation.
  • Proficiency in scripting/programming languages such as Python, Java, Bash, Node.js, and PowerShell.
  • Experience with DevOps and observability tools (e.g., Jenkins, Docker, Splunk, Dynatrace, OpenTelemetry).
  • Deep knowledge of databases including PostgreSQL, MongoDB, DynamoDB, Oracle, and Redshift.
  • Familiarity with event-driven architecture, distributed systems, and AI/ML integrations.
  • Strong understanding of security best practices, compliance frameworks, and incident management.
  • Hands-on experience with chaos engineering, resiliency assessments, and performance testing (e.g., JMeter, LoadRunner).
  • Excellent communication and collaboration skills.
  • AWS Solutions Architect or related cloud certification; Agile Certified Practitioner (ACP) a plus.
  • Experience with AI/ML frameworks such as Spacy, Transformers, SciPy, and tools like SageMaker and GenAI.
  • Familiarity with project management and ITSM tools (e.g., JIRA, Confluence, ServiceNow).
  • Experience with utilities and developer tools like AWS CLI, Postman, and curl.

JustinBradley is an EO employer – Veterans/Disabled and other protected employees.

Job Tags

Shift work,

Similar Jobs

Confidential

Chair, Psychiatry & Behavioral Sciences Job at Confidential

 ...Chair, Psychiatry & Behavioral Sciences About the Company Recognized hospital specializing in cancer research & treatment Industry Hospital & Health Care Type Non Profit Founded 1884 Employees 10,001+ Categories Education Health Care... 

New York Life Insurance Company

Life Health Insurance Agent Job at New York Life Insurance Company

 ...About the Company - New York Life Insurance Company believes in getting ahead and doing the right thing. They value personal satisfaction, financial growth, and opportunities for advancement. About the Role - As a New York Life Financial Professional, you will... 

The Limitless organisation

Entry-Level Sales Role - Ideal for Customer Service Pros Job at The Limitless organisation

 ...customer needs, and offer tailored solutions. Performance-Based Pay : Your earnings will reflect your hard work and results, making it ideal for those who thrive on performance-driven rewards. Autonomy and Flexibility : You'll manage your own schedule and appointments,... 

AutomotoSocial

Express Technician - Union City, Georgia, United States Job at AutomotoSocial

 ...Job Description:Heritage Volkswagen in Union Cityis looking for an express technician to join our busy service department!We offer a flexible schedule, NO SUNDAYS OR LATE NIGHTS, a professional work environmentand great paycommensurate with experience!If you're looking... 

Intuit

Finance Intern, Global Business Services (Undergrad) Job at Intuit

 ...strategy to execution, process optimization, scaling change, and delivering enduring business value. The primary objective of our summer internship program is to bring highly talented people into the organization. Summer internships are a direct pipeline to full time...