Senior Site Reliability Engineer
 
                            Company : Apex Systems
Location : Fairfax, VA, 22030
Posted Date : 28 October 2025
Job Details
Senior Site Reliability Engineer
Our client is seeking a Senior Site Reliability Engineer.
Our client is seeking talented professionals to join our successful and growing team in building the next-generation Continuous Diagnostics and Mitigation (CDM) Cyber data solution. The CDM Program is the Cybersecurity and Infrastructure Security Agency's (CISA) dynamic approach to strengthening the cybersecurity of Federal networks and systems through better awareness and visibility into their security posture and cyber threats. ECS is responsible for designing, building, deploying, operating, and maintaining a complete 'Data Services' solution which includes the collection, normalization, visualization, and sharing of cyber data from more than 100 Federal agencies. The CDM Data Services product is an integrated suite of multiple Commercial Off the Shelf (COTS) products, software configuration packages, and custom code which work together to operate as an integrated solution tailored to meet Department of Homeland Security (DHS) requirements.
We are seeking professionals who thrive in a dynamic, fast-paced, and highly collaborative environment where problem-solving, critical thinking, and a holistic approach to serving the mission are key. Our program operates within the Scaled Agile Framework (SAFe). An aptitude and enthusiasm for continuous learning, improvement, and cyber security is a must!
Role & Responsibilities:
ECS is seeking a talented Senior Site Reliability Engineer (SRE) to play a key role in defining, implementing, and growing our SRE practice to ensure the reliability, availability, and performance of our critical production environments.
The Senior SRE will contribute to a culture of continuous improvement, identifying areas for enhancement, and driving initiatives to improve system reliability, scalability, and efficiency.
The successful candidate will have demonstrated hands-on experience designing, implementing, and maintaining solutions to ensure that systems, including infrastructure and applications, are resilient, highly available, and performant. The Senior SRE will also play a critical role in defining and measuring the Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for our solution.
The Senior SRE will be responsible for setting up comprehensive logging, monitoring, and alerting solutions using the Elastic stack and other tools as necessary to ensure the continuous performance of services. Additionally, they will respond to incidents, perform root cause analyses, and implement solutions to prevent reoccurrences. The Senior SRE will work in close collaboration with other SRE team members, developers, testers, infrastructure engineers, DevOps engineers, and other stakeholders to integrate reliability and observability into the software development lifecycle.
Required Skills:
- US citizenship with ability to obtain Public Trust Suitability
- 6+ years of experience as a Site Reliability Engineer (SRE) or equivalent
- 6+ years of demonstrated experience designing, implementing, and maintaining observability solutions to include logging, monitoring, and alerting
- 6+ years of hands-on experience with SRE tools (e.g., Elastic, Prometheus, Grafana, Splunk, etc.)
- 3+ years defining and measuring SLOs and SLIs
- 3+ years of relevant experience using cloud platforms (AWS GovCloud preferred)
- 3+ years of hands-on programming or scripting (e.g., Python, Bash, etc.)
- Strong knowledge of microservices, containerization, and orchestration tools (Docker, Kubernetes)
- Proven ability to collaborate with cross-functional teams (development, testing, and product) to integrate reliability and observability into the software development lifecycle
- Strong problem-solving and analytical skills
- Proactive, detail-oriented approach to identifying inefficiencies and implementing improvements
Desired Skills:
- Bachelor's degree in Computer Science, Engineering, or a related field (or 4 additional years of related experience)
- Experience working in an Agile/SAFe environment using ALM tools (Jira, Confluence, or similar)
- Strong understanding of CI/CD principles and platforms (Jenkins, CircleCI, GitLab, GitHub Actions, Argo, Travis CI, etc.)
- Expertise in configuration management tools (Ansible, Puppet, Chef)
- Experience with infrastructure as code (Terraform, CloudFormation)
- In-depth understanding of networking, security, and system administration of Linux operating systems
- Knowledge of version control platforms and branching strategies
- Knowledge of disaster recovery planning, backup strategies, and data replication
- Experience supporting large Federal programs ($200M+)
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at employeeservices@apexsystems.com or 844-463-6178.
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico.
Apex Benefits Overview: Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Apex team member can provide.
Trending Searches in Fairfax, VA
- Full time jobs near me Fairfax, VA
- Local job openings
- Places hiring near me
- Job vacancies near me
- Site reliability engineer jobs near me Fairfax, VA
- Site reliability engineer jobs hiring near me Fairfax, VA
- Site reliability engineer jobs hiring near Fairfax, VA
- Site reliability engineer jobs near Fairfax, VA
- Site reliability engineer jobs near me in Fairfax, VA
- Site reliability engineer jobs in Fairfax, VA
Top trending job titles hiring now
Popular Searches for Site Reliability Engineer
- Site reliability engineer jobs
- Engineering jobs near me
- Site reliability engineer jobs in the last 3 days
- Jobs near me
- Site reliability engineer jobs since yesterday
- Sre jobs near me
- Senior Site Reliability Engineer
- Jobs near me in the last 3 days
- Jobs hiring near me in the last 3 days
- Real estate jobs near me
Other Jobs You May Like
Lead Data Engineer (Enterprise Platform Technology)
Company : Capital One
Location : Falls Church, VA
Lead Software Engineer, Back End (Enterprise Platforms Technology)
Company : Capital One
Location : Falls Church, VA
Top searches
- Jobs hiring immediately
- Part time jobs near me
- Full time jobs near me
- Jobs that are hiring near me
- Jobs near me hiring now
- Site reliability engineer jobs near me
- Site reliability engineer jobs
- Site reliability engineer jobs hiring near me
- Site reliability engineer openings near me
- Site reliability engineer vacancies near me
Employment opportunities at Apex Systems
- Apex Systems jobs near me Fairfax, VA
- Apex Systems jobs hiring near me Fairfax, VA
- Apex Systems jobs near Fairfax, VA
- Apex Systems jobs hiring near me
- Apex Systems openings near me
- Apex Systems jobs near me in Fairfax, VA
- Apex Systems jobs hiring in Fairfax, VA
- Employment opportunities near me
- Job openings near me
- Jobs hiring immediately
 Jobs USA
Jobs USA