Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

Alden Liaw Yee Liang

Cyberjaya,Selangor

Summary

Proactive and detail-oriented IT Operations & Data Center Team Lead with 8+ years of experience managing GPU/AI compute racks, storage servers, and enterprise CPU platforms. Skilled in hardware troubleshooting and replacement (HDD/SSD, CPU, RAM, PSU, backplanes), firmware/BMC/BIOS/FRU updates, structured cabling, ticketing systems, and SLA-driven support. Adept at leading 24/7 teams, enforcing SOPs, and ensuring 99.99% uptime in mission-critical environments. Proven record in vendor coordination, audits, and large-scale server deployments for global clients including Alibaba, Tencent, ByteDance, Bigo, China Telecom, as well as on-site helpdesk support for universities and VIPs in Singapore.

Overview

9
9
years of professional experience

Work History

IT Operation Team Lead

Aolani Cloud
08.2025 - Current
  • - Lead IT operations for high-density AI and GPU-based compute racks in a 24/7 production environment.
  • - Coordinate with clients, vendors, and internal teams to ensure uptime, efficiency, and compliance.
  • - Develop SOPs for troubleshooting, escalation, and preventive maintenance.
  • - Perform hardware troubleshooting and replacement (HDD/SSD, CPUs, RAM, motherboards, PSUs, backplanes, cabling).
  • - Execute firmware updates (BMC, BIOS, FRU) to maintain vendor compliance.
  • - Manage inventory control for spare parts and coordinate RMA with vendors.
  • - Support infrastructure audits and client inspections, achieving 100% compliance.
  • - Oversee incident handling via ticketing systems, ensuring SLA compliance and timely resolution.
  • Key Achievements:
  • Deployed & integrated multiple GPU clusters supporting production AI workloads.
  • Reduced rack setup time by 20% via optimized cabling/labeling workflows.
  • Improved incident response efficiency, cutting downtime incidents by 30%.

IDC Team Lead

Titanicom Tech Limited
07.2024 - 07.2025
  • - Directed a rotating 24/7 team handling AI compute deployments across multi-rack environments.
  • - Oversaw structured cabling projects (MPO/AOC), ensuring labeling accuracy, airflow optimization, and signal integrity.
  • - Deployed, configured, and validated GPU clusters and NVLink interconnects for AI/ML workloads.
  • - Optimized rack layouts, PDUs, and airflow, reducing cooling inefficiencies.
  • - Performed component-level replacements (disks, DIMMs, PSUs, cabling) during live operations with minimal downtime.
  • - Maintained 99.99% uptime through proactive monitoring and preventive maintenance.
  • - Controlled on-site inventory, logged spare part usage, prepared RMA shipments, and tracked assets.
  • - Managed incidents using ticketing systems, ensuring SLA adherence and accurate reporting.
  • - Tools used: Winterm, MobaXterm, BMC/IPMI, Redfish APIs.
  • Key Achievements:
  • Deployed & integrated multiple GPU clusters supporting production AI workloads.
  • Reduced rack setup time by 20% via optimized cabling/labeling workflows.
  • Improved incident response efficiency, cutting downtime incidents by 30%.

Computer Engineer

Aspeed Infotech Pte Ltd
01.2017 - 12.2022
  • - Deployed and maintained enterprise servers (1U–4U) across multi-vendor platforms: Supermicro, Foxconn, Huawei, Dell, Inspur.
  • - Implemented BMC/IPMI remote management, improving recovery speed.
  • - Supported enterprise clients (Alibaba, Tencent, ByteDance, Bigo, China Telecom) in high-availability environments.
  • - Provided on-site helpdesk support for Singapore Management University (SMU) and Singapore Institute of Management (SIM University), assisting students, staff, and VIPs.
  • - Delivered Lenovo helpdesk services, including troubleshooting, escalation, SLA-based support, and hardware replacement (desktops, laptops, and peripherals).
  • - Led relocation of 2,000+ servers across borders with zero downtime.
  • - Troubleshot complex hardware issues: BIOS/BMC mismatches, NIC failures, PSU/NVMe faults, and backplane errors.
  • - Executed firmware and FRU updates to ensure fleet compatibility.
  • - Maintained hardware inventory, managing spare stock, RMAs, and asset documentation.
  • - Managed support tickets, prioritized issues according to SLA requirements, and delivered timely resolutions.
  • Key Achievements:
  • Reduced deployment time by 30% via workflow optimization.
  • Maintained 99.99% uptime across data centers.
  • Authored troubleshooting SOPs adopted by client operations teams.
  • Delivered consistent VIP helpdesk support at SMU & SIM University, ensuring uninterrupted service for faculty, executives, and senior staff.

Education

Diploma - Information Technology

Erican College
Kuala Lumpur Malaysia
01.2017

Skills

  • Data Center Operations & Team Leadership – 24/7 shift management, incident response, vendor coordination, client support
  • Server Expertise – GPU servers (AI/ML workloads, NVLink validation), Storage servers (disk arrays, RAID, backplanes), CPU servers (enterprise x86 multi-vendor)
  • Hardware Troubleshooting & Replacement – HDD/SSD, CPUs, RAM, motherboards, PSUs, hard disk backplanes, PCIe backplanes, NICs, structured cabling (MPO/AOC & copper)
  • Firmware & Configuration Updates – BIOS, BMC/IPMI, FRU, Redfish APIs, lifecycle management across heterogeneous fleets
  • Deployment & Validation – Rack integration, airflow optimization, PDU setup, acceptance testing, NVLink/PCIe connectivity checks
  • Inventory & Asset Management – Spare parts tracking, stock level monitoring, RMA handling, lifecycle documentation
  • Helpdesk & SLA Management – Ticketing systems, incident prioritization, SLA compliance, and VIP user support
  • Remote Management & Tools – IPMI, BMC CLI, Winterm, MobaXterm, remote diagnostics
  • Compliance & SOPs – Standard operating procedures, preventive maintenance, audit readiness, client inspections

Timeline

IT Operation Team Lead

Aolani Cloud
08.2025 - Current

IDC Team Lead

Titanicom Tech Limited
07.2024 - 07.2025

Computer Engineer

Aspeed Infotech Pte Ltd
01.2017 - 12.2022

Diploma - Information Technology

Erican College
Alden Liaw Yee Liang