Descrizione Lavoro
HPC Cloud Architect primary responsibilities are to establish the approach for integrating information applications, projects and programs and ensuring alignment with business strategies and drivers, and priorities relevant to Baker Hughes High Performance Computing (HPC) portfolio. This role needs to have deep knowledge of cloud computing platforms. This position is critical for Baker Hughes to run large-scale computational workloads, such as scientific simulations, data analytics, and machine learning, on the cloud. In this role the Cloud HPC Architect designs, implements, and manages the infrastructure, software, and processes to ensure these demanding workloads run efficiently, securely, and cost‑effectively.
Responsibilities
Architectural Design: Design and architect high-performance computing clusters and platforms on public cloud providers (e.g., AWS, Azure, Google Cloud). This includes selecting appropriate compute resources (CPUs, GPUs, FPGAs), storage systems, and high-speed networking solutions.
Solution Implementation: Implement and deploy cloud HPC solutions using Infrastructure as Code (IaC) tools like Terraform or Ansible. This involves configuring virtual networks, setting up distributed file systems (like Lustre, GPFS, or Ceph), and deploying cluster management and job scheduling software (e.g., Slurm, PBS).
Performance Optimization: Conduct performance analysis and benchmarking to identify and eliminate bottlenecks in the HPC stack. Tune application code for parallel processing using paradigms like MPI (Message Passing Interface) and OpenMP, and optimize hardware and software configurations to maximize performance.
Collaboration and Consultation: Work as a technical advisor to researchers, engineers, and data scientists, helping them migrate their workloads to the cloud, adopt best practices, and troubleshoot application‑specific issues.
Automation and Management: Develop and maintain automated solutions for provisioning, software deployment, monitoring, and scaling HPC resources. Implement robust backup, disaster recovery, and security protocols to ensure the integrity and availability of the systems.
Strategic Planning: Stay current with emerging cloud and HPC technologies, influencing product roadmaps and developing strategic plans for the organization’s cloud HPC adoption and growth.
Qualifications
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field with 10+ years experience.
Extensive experience with HPC system design and management, including hands‑on experience with hardware (CPUs, GPUs, InfiniBand) and software (compilers, schedulers, parallel programming libraries).
Deep understanding of parallel computing concepts and programming models (MPI, OpenMP, CUDA).
Proficiency in performance analysis tools and techniques.
Hands‑on experience with at least one major public cloud platform (AWS, Azure, or Google Cloud).
Knowledge of cloud services relevant to HPC, such as compute instances, high-performance storage, and networking.
Proficiency with Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible).
Strong Linux/Unix system administration skills.
Proficiency in scripting and automation languages (e.g., Python, Bash).
Experience with containerization and orchestration technologies like Docker and Kubernetes.
Familiarity with distributed file systems (Lustre, GPFS, Ceph) and job schedulers (Slurm, PBS).
Benefits
Contemporary work‑life balance policies and wellbeing activities.
Comprehensive private medical care options.
Safety net of life insurance and disability programs.
Tailored financial programs.
Additional elected or voluntary benefits.
Seniority level
Mid‑Senior level
Employment type
Full‑time
Job function
Information Technology
Baker Hughes Company is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
#J-18808-Ljbffr