Lead Infra Engr, Infra Hybrid IT

Date: 15 Apr 2024

Location: Singapore, Singapore

Company: Singtel Group

PRIMARY PURPOSE

  • The Lead System Engineer is responsible to lead the team in implementing change requests to system changes, manage budget and resources meeting client’s requirements and quality standards. He is also required to evaluate internal and external business environment and develop long term strategies and propose integration strategies for the various technology components to achieve a holistic solution.

 

RESPONSIBILITIES
Business Development

  • Evaluates internal & external business environment to develop long-term strategy for the unit/organization 
  • Advice on the selection, design and justification, implementation & operation of technology, information security controls and security management techniques adoption based on business & technical consideration. Propose integration strategies for the various technology components to achieve a holistic solution

 

Project Delivery

  • Gather business and/or application requirements on the infrastructure, conduct impact analysis, suggest design/re-design to integrate the change into the existing environment 
  • Test systems in accordance to specifications & service level. Where relevant, perform the necessary system programming & configuration 
  • Manage systems changes through established change request process & provide status reports to the relevant parties 
  • Respond promptly to incident, investigate & provide temporary &/or permanent resolution of incidents escalated. Provide timely status updates to relevant parties 
  • Conduct root cause analysis & implement pro-active measures. Monitor effectiveness of implemented measures 
  • Monitor & measure the performance & availability of systems proactively; implement corrective actions identified to improve performance & availability 
  • Monitor the agreed service level (.e.g. service request, system availability), document & maintain the configuration of the systems; provide regular reporting to relevant parties 
  • Execute service continuity measures, e.g. backup/restore procedures & disaster recovery plan dry run to ensure continuous operation of the business 
  • Ensure the management of ICT systems adhere to established ISO20000 & ISO27001 processes / procedures and ITIL best practices / ITSM and methodologies (e.g. change management, release management, incident management, problem management, configuration management) 
  • Manage budget, resources and schedule for system implementation / operation activities; ensure deliverables meet client requirements and quality standards 
  • Established facility management standards/best practices to ensure operation consistency across project/facility management teams.

 

Customer /Team Management

  • Provide systems related technical advice to customers or project team. Lead and assist in the development of bids and proposals 
  • Manage support resources to prioritize task allocation and schedule resource to ensure that services are not impacted
  • Liaison with other associated project teams to ensure that the service is monitored, secured and operating at its optimal level.

 

REQUIREMENTS

  • Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
  • Proven experience as a Systems Engineer with a focus on High-Performance Computing.
  • Knowledge of HPC architectures, technologies, and parallel programming languages.
  • Technical Proficiency: 
  • Familiarity with cluster management tools, job scheduling systems, and distributed file systems.
  • Experience with high-speed interconnects (e.g., InfiniBand) and networking in HPC environments.
  • Problem-Solving Skills: Strong analytical and problem-solving skills to address complex HPC challenges.
  • Communication: Excellent communication and collaboration skills to work effectively in interdisciplinary teams.