Description
Leidos seeks a Network Reliability Engineer to support NASA’s Network and Telecommunications organizations. This position will serve as the Network Reliability and Availability Lead, ensuring that NASA’s critical network systems and services are dependable, resilient, and meet defined availability and performance targets.
The successful candidate will combine deep technical expertise in network engineering with strong analytical and reliability-focused problem-solving skills. This role is vital to maintaining the operational integrity and uptime of NASA’s LAN, WAN, Mission, Firewall and Voice networks, supporting mission success across NASA centers and facilities.
Key Responsibilities
- Lead network reliability and availability engineering initiatives across NASA’s LAN, WAN, Mission, FW and Voice infrastructure.
- Develop and implement reliability frameworks, including uptime metrics, service availability targets, and performance baselines.
- Monitor and analyze network performance and fault data to identify trends, predict failures, and proactively mitigate risks.
- Collaborate with engineering, operations, and cybersecurity teams to ensure fault-tolerant and resilient architectures.
- Conduct root cause analysis (RCA) for network incidents and implement corrective actions to improve overall reliability.
- Design and maintain redundancy, disaster recovery, and failover strategies that meet NASA’s mission needs.
- Develop and maintain availability dashboards, performance reports, and SLA tracking for NASA and Leidos leadership.
- Support capacity planning and modernization efforts to ensure scalability and performance consistency.
- Provide clear communication and technical reporting to both NASA technical teams and leadership stakeholders.
Basic Qualifications
- Bachelor’s degree in Computer Science, Engineering, Information Technology, or related field (Master’s preferred).
- 7+ years of experience in network or IT systems engineering with a focus on availability, reliability, and performance optimization.
- U.S. Citizenship required; must be eligible for a NASA or Federal security clearance.
- Hands-on experience with network monitoring and observability tools (e.g., SolarWinds, Splunk, NetScout, or equivalent).
- Strong understanding of LAN/WAN architectures, fault tolerance, redundancy, and failover technologies.
- Demonstrated success improving system uptime and service delivery performance in complex enterprise environments.
- Excellent analytical, problem-solving, and documentation skills.
Preferred Qualification
- Experience supporting NASA or other Federal mission-critical networks.
- Familiarity with SRE (Site Reliability Engineering) principles and frameworks.
- Knowledge of network automation, scripting, or AIOps tools.
- Industry certifications such as CCNP, DevNet, ITIL, or equivalent.
- Experience with high-availability architectures, performance baselining, and SLA management.
At Leidos, we don’t want someone who "fits the mold"—we want someone who melts it down and builds something better. This is a role for the restless, the over-caffeinated, the ones who ask, “what’s next?” before the dust settles on “what’s now.”
If you’re already scheming step 20 while everyone else is still debating step 2… good. You’ll fit right in.
Original Posting:
October 29, 2025
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
Pay Range:
Pay Range $72,150.00 - $130,425.00
The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.