Platform & HPC Data Engineer (TS/SCI with CI Poly)
Company: Maxar Technologies
Location: Herndon
Posted on: January 23, 2025
Job Description:
Please review the job details below. Maxar is seeking a skilled
Platform and HPC Data Engineer --to support the design,
implementation, and optimization of data management solutions in
high-performance computing (HPC) environments. The ideal candidate
will have extensive experience working with various file systems,
data labeling/tagging systems, and the configuration of a wide
range of storage appliances. This role involves ensuring that data
workflows, storage configurations, and metadata management are
efficient, scalable, and aligned with organizational and government
security requirements. The successful candidate will work within a
cross-disciplinary team to support the technical needs of HPC
platforms, data management, and large-scale computational
workflows. Key Responsibilities:
- Platform and HPC Data Engineering: --Design and implement data
management systems and architectures for HPC platforms, focusing on
optimizing data flow, storage, and access in large-scale computing
environments.
- File System Management: Oversee the configuration, maintenance,
and optimization of distributed file systems (e.g., Lustre, IBM
Spectrum Scale, NFS, GPFS) and storage solutions used in HPC
environments to ensure efficient performance, scalability, and
reliability.
- Data Labeling and Tagging: --Implement and manage
metadata-driven systems for data labeling/tagging. This includes
the development of strategies for classifying, indexing, and
organizing datasets to enhance data discoverability, access
control, and auditing.
- Storage Appliance Configuration: Configure and maintain various
storage appliances (e.g., NetApp, Dell EMC, HPE) and integrated
storage solutions. Ensure that storage devices are optimized for
performance, capacity, and availability within the HPC
ecosystem.
- Data Integration and Workflow Optimization: Integrate data
storage and management systems with HPC clusters, ensuring seamless
data flow between compute nodes and storage appliances. Optimize
data pipelines to support high-throughput workloads and minimize
bottlenecks in I/O performance.
- Performance Tuning: --Monitor and improve the performance of
storage systems, focusing on I/O throughput, latency, and efficient
resource allocation. Use performance metrics to guide optimizations
across storage appliances and file systems.
- Security and Compliance: --Implement security best practices
for data access, protection, and management, ensuring compliance
with government regulations and internal data governance policies.
Configure encryption, access control, and secure data sharing
methods.
- A utomation and Scripting: Develop and maintain automation
scripts (e.g., using Python, Bash, or Perl) to streamline storage
configurations, data labeling/tagging, and system monitoring tasks.
Automate processes related to data integration and HPC platform
management.
- Collaboration and Support: Work closely with data scientists,
HPC administrators, software developers, and other technical staff
to support ongoing projects. Provide expertise in troubleshooting
data storage issues and ensuring optimal system performance.
- Documentation and Reporting: --Maintain thorough documentation
for storage configurations, file system setups, data
labeling/tagging procedures, and performance optimization
strategies. Provide regular reports on system health, data
management processes, and any improvements made. Required
Qualifications:
- Education:--Bachelor's degree in Computer Science, Information
Technology, Engineering, or a related field. A Master's degree or
higher is a plus.
- Experience:
- 7+ years of experience in managing data infrastructure in HPC
environments, with expertise in file systems, storage appliances,
and data workflows.
- Hands-on experience with distributed file systems, including
Lustre, IBM Spectrum Scale (GPFS), NFS, and others commonly used in
HPC settings.
- Proven experience with storage appliance configuration (e.g.,
NetApp, Dell EMC, HPE, or similar systems), including performance
tuning, capacity management, and reliability.
- Strong experience in implementing data labeling/tagging
systems, metadata management, and structuring large datasets for
efficient access and compliance.
- Knowledge of high-performance networking protocols (e.g.,
InfiniBand, RDMA) and their role in data transfer and storage
optimization.
- Familiarity with data access protocols like GridFTP, rsync, and
NFS for large-scale data transfer. Desired Skills:
- Experience with cloud storage integration or hybrid cloud
environments, with knowledge of cloud-native storage solutions
(e.g., AWS S3, Ceph, OpenShift).
- Familiarity with high-performance computing (HPC) schedulers
(e.g., SLURM, PBS, Torque) and their interaction with data storage
systems.
- Understanding of data protection mechanisms, including data
replication, backup strategies, and disaster recovery in HPC
environments.
- Experience with containerization (Docker, Singularity) in an
HPC context for data processing and application deployment.
- Experience with machine learning or data science workflows in
HPC environments. #cjpost #LI-RD In support of pay transparency at
Maxar, we disclose salary ranges on all U.S. job postings. The
successful candidate's starting pay will fall within the salary
range provided below and is determined based on job-related
factors, including, but not limited to, the experience,
qualifications, knowledge, skills, geographic work location, and
market conditions. Candidates with the minimum necessary
experience, qualifications, knowledge, and skillsets for the
position should not expect to receive the upper end of the pay
range. --- The base pay for this position within the Washington, DC
metropolitan area is: $131,000.00 - $219,000.00 annually.
For all other states, we use geographic cost of labor as an input
to develop market-driven ranges for our roles, and as such, each
location where we hire may have a different range. We offer a
comprehensive package of benefits including paid time off, health
and welfare insurance, and 401(k) to eligible employees. You can
find more information on our benefits at: -- The application window
is three days from the date the job is posted and will remain
posted until a qualified candidate has been identified for hire. If
the job is reposted regardless of reason, it will remain posted
three days from the date the job is reposted and will remain
reposted until a qualified candidate has been identified for
hire.-- The date of posting can be found on Maxar's Career page at
the top of each job posting. To apply, submit your application via
Maxar's Career page. Maxar Technologies values diversity in the
workplace and is an equal opportunity/affirmative action employer.
All qualified applicants will receive consideration for employment
without regard to sex, gender identity, sexual orientation, race,
color, religion, national origin, disability, protected veteran
status, age, or any other characteristic protected by law.
Keywords: Maxar Technologies, Towson , Platform & HPC Data Engineer (TS/SCI with CI Poly), Engineering , Herndon, Maryland
Didn't find what you're looking for? Search again!
Loading more jobs...