Snr Service Delivery Engineer (Cloud)
Date: 21 Nov 2024
Location: Singapore, Singapore
Company: Singtel Group
At Singtel, our mission is to Empower Every Generation. We are dedicated to fostering an equitable and forward-thinking work environment where our employees experience a strong sense of Belonging, to make meaningful Impact and Grow both personally and professionally. By joining Singtel, you will be part of a caring, inclusive and diverse workforce that creates positive impact and a sustainable future for all.
To design, implement, and manage cloud data processing pipelines (e.g., AWS or equivalent), perform Tier 2 & 3 service delivery and operations for Network Analytics Engineering systems, develop big data solutions for real-time and batch processing of large telco datasets, plan hybrid cloud strategies for the data lake, manage data governance and protection, and collaborate with business domain experts, data scientists, and operational teams to create E2E data system solutions.
Tier 3 Systems SME
- Accountable for overall System Performance and Design.
- Accountable for Change Management outcomes, executing minor to major software upgrades as well as solution changes independently.
- Responsible to manage vendors and technically debate on optimum solution performance while ensuring robust and cost-efficient architecture.
- Drives use of automation for deployment and implement near real-time alerts for security or critical issues.
- Devise UAT plan and conduct testing for the new features/ bug fixes towards acceptance criteria.
- Develop Method of Procedures facilitating Change Management with any system impact.
- Manage project initiatives for sensitive systems with minimal supervision and ability to deliver project within the timeline.
Tier 2 Systems Administration & Operation Support
- Accountable for overall System Health, Recovery and Security.
- Accountable to manage Classified Keys, which has root access to Sensitive Systems.
- Responsible to drive improvements in System’s security posture.
- Execute planned night activities for production system signature update and bug fixes.
- Ensure the System / Service uptime to the SLA.
- Drive daily system health check and monitoring and reporting on daily basis to Director, and raise any issues observed with vendor, follow-up to work on the fixes.
- Resolve Trouble tickets independently and resolve data dispute issues within SLA.
- Manage Vendors and conduct recurring operations reviews to track the Tickets SLA.
- Prepare data for monthly Operations meeting with Director.
- Drive quarterly system audit reviews and highlight any abuse on subscriber or sensitive workspace user activities in meeting with Director.
Manage System Security (Cloud and on-premises)
- Management of Anti-Malware systems and perform monthly scanning for any security threats and system administration and perform monthly scanning for systems.
- Analyse security reports from Anti-Malware scans, determining best course of action.
- Managing the implementation and provisioning of firewalls, switches for the Department systems.
- Administrator for the firewalls and perform the quarterly review for the rules to ensure the system defence-in-depth and security.
Support InfoSec and implement Security Governance initiatives.
- Manage PIAM (Physical access and Identity management) for the user account administration to improve security and avoid manual password changes/management and control and keep track of the elevated privileged user accounts logins.
- Design, deploy and Operate a Centralise log server to store all logs and process to expeditiously support investigations into security incidents.
- Deploy, implement, and operate vulnerability scanning tool and analyse reports and closely working with security governance and planners to conduct remediation activities which he/ she will perform.
- Achieve zero SL1 and cyber-security incidents and to resolve trouble tickets within the given SLA. Represent NAE as member of NSC Security Rep and ISMS Networks Security working Committee.
- Lead Incident Management and provide timely update to the Management and accountable for the RCA of the managed platforms.
- Maintain high level of Data pipelines and system availability (Data Integrity & Systems 95%+)
- Ensure operational processes for the respective systems are well documented. This includes system inventories, solution doc, IP/Network design, SOPs etc.
- Update NAE systems to the latest stable Software version. Keep the system updated with the Periodic patch updates (OS, software, Firmware, Switches, Firewall) – Half yearly/yearly.
- Conduct monthly operations review and quarterly Audits on time and Tickets resolution within SLA.
- Ensure no gaps in security monitoring. Perform bi-weekly Antimalware signature updates and monthly Scans. Complete and report IOC scans and security patches to network Security Operation Centre. Support various security initiatives to achieve full audit compliance.
- Use Ansible and scripting to automate system operations.
Skills for Success:
- Degree in Engineering or IT
- Good to be certified in Redhat architect certification, Cloudera CDP certification or Cyber security (CISSP) certification.
- Good to have deep knowledge of mobile and broadband network technologies.
- At least 5 years’ experience in data solution and administration
- Proficient in either 3 of the following:
- Cloud Data solution (AWS)
- Big data and data warehousing operations.
- System administration
- System integration
- Data architect
- Hadoop and Spark (or equivalent)
- Linux/Unix, Ansible automation, Shell Scripting
- Kubernetes, Docker, serverless functions, APIs and Kafka bus
- Network and Security design
- Good technical writing skills and presentation skills
Your Career Growth Starts Here. Apply Now!
We are committed to a safe and healthy environment for our employees & customers and will require all prospective employees to be fully vaccinated.