Singapore · Coinbase Cold Storage
Blockchain infrastructure for institutions that can't afford downtime.
Penghian Ang is a blockchain infrastructure and DevOps engineer with 5+ years operating validator and RPC nodes across Coinbase, Crypto.com, and Tencent. Currently a Cold Storage Engineer III at Coinbase — approving institutional transactions and operating staking infrastructure under 24×7 compliance protocols.
Work
-
2025 — Now
Cold Storage Engineer III · Coinbase
- Built pre-call automation integrating Calendly, Snowflake, Datadog, and Slack to auto-surface institutional client context — transactions, duress status, identity, approver eligibility — before high-stakes approval calls.
- Automated cold-to-hot wallet balance restoration with multi-source enrichment (Snowflake, Datadog, Glean, Messari) via Slack, cutting team AI token spend by $500–$1,000 USD/month.
- Approved high-value institutional cold-storage transactions and conducted video verification calls under Coinbase Prime and Custody operational protocols.
- Provided white-glove support to institutional clients on staking, cryptographic, and transaction issues across multiple chains.
- Operated in 24×7 global rotation under strict compliance and security standards. Partnered with Security, Product, and Engineering on operational improvements.
- Snowflake
- Datadog
- Slack
- Custody ops
- Compliance
-
2023 — 2025
Senior Blockchain Security DevOps Engineer · Crypto.com
- Provisioned and operated validator and RPC nodes across TON, Conflux, Celestia, Oasis, Fetch.ai, NEO3, Neutron, ZetaChain, and Vara — including upgrades and incident response.
- Built node deployment pipeline using Terraform, Docker, Ansible, and Azure DevOps CI/CD with auto-pull of updated container images.
- Standardized Cosmos-family deployments by consolidating multiple Ansible playbooks into one modular template, reducing rollout time for new chains.
- Built developer environment POC: Terraform-provisioned Azure VMs accessed via Teleport from VS Code Remote, with automated idle-instance shutdown after 6pm to reduce developer cloud spend.
- Wrote custom Prometheus exporters for chain-specific node health monitoring.
- Terraform
- Ansible
- Docker
- Cosmos
- Azure DevOps
- Prometheus
-
2022 — 2023
DevOps Engineer · Tencent
- Primary point-of-contact for PUBG Mobile production infrastructure: 6,000+ VMs supporting 4–10M daily concurrent users.
- Reduced alert noise from 600+ rules to 140 actionable alerts by reclassifying and rewriting system and network monitoring logic.
- Built FastAPI-based alert automation integrating Bash scripts and CI/CD pipelines for incident auto-triage and WeCom responder notification.
- Solution presented to the Director of Tencent Overseas Games and adopted as best-practice reference across teams.
- FastAPI
- Prometheus
- Grafana
- Bash
- CI/CD
-
2021 — 2022
DevOps Engineer · UP DevLabs
- Managed environments across AWS, Aliyun, and Tencent Cloud. Built CI/CD pipelines for repeatable deployments.
- Implemented ELK Stack for developer self-service log search, reducing ops support load.
- Established Grafana and Prometheus monitoring for system health and capacity planning.
- AWS
- Aliyun
- ELK
- Grafana
-
2021
System Analyst · Kenrich Partners
- Day-1 phishing incident response. Coordinated with legal counsel (Rajah & Tann) and performed root cause analysis.
- Led MAS TRM compliance initiatives: asset tracking, system hardening, SIEM setup.
- Migrated on-prem infrastructure to Azure.
- MAS TRM
- SIEM
- Azure
- Incident response
-
2020 — 2021
SNOC Engineer · Government Technology Agency (GovTech)
- Monitored and maintained uptime for critical government systems using AWS CloudWatch, Grafana, Splunk, and SolarWinds.
- Supported real-time incident triage, root cause analysis, and cyber threat detection for national digital services.
- CloudWatch
- Splunk
- SolarWinds
- Grafana
-
2020
NOC Engineer · Netpluz Asia
- Provisioned and troubleshot circuits (Baccess, GPON, MetroE) and configured Cisco and Sophos network devices.
- Built Python automation tools to streamline operations, reducing manual ITSM reporting by 80%.
- Cisco
- Sophos
- Python
- Networking
Projects
-
2025
Site Reliability Engineer · Valigator
- Built production-grade Solana staking infrastructure for a white-glove validator service.
- Authored Ansible playbooks to automate validator provisioning, upgrades, and monitoring setup.
- Developed a custom Solana Node Exporter integrated with Prometheus and Grafana dashboards.
- Solana
- Ansible
- FastAPI
- Prometheus
- Grafana
-
2024 — 2025
Senior Infrastructure Support Engineer · Thoughtworks
- Designed and deployed enterprise-scale cloud infrastructure using Infrastructure as Code, improving scalability and resilience.
- Led end-to-end build and maintenance of SNTC.org.sg, a public-good project supporting Singapore's special needs community.
- Delivered government-requested website updates within 2 weeks, ensuring compliance and uninterrupted service.
- Partnered with CTO/CIO/COO stakeholders to align technical delivery with organizational goals.
- AWS
- Terraform
- IaC
- Compliance
Open source · github.com/angpenghian
-
2025
Agent Times
x402 pay-per-request news API designed for AI agents to consume. Node.js, Express, Docker, SearXNG, SQLite on DigitalOcean. 28,000+ articles indexed across 1,100+ RSS feeds. Received first real USDC payment on Base mainnet.
- Node.js
- Express
- Docker
- SearXNG
- x402
- Base
-
Ansible
solana-ansible-kit
Production-grade Ansible automation provisioning Solana validator fleets with security hardening, performance tuning, and zero-downtime upgrades.
- Ansible
- Solana
- Linux
-
CI/CD
solana-repro-builds
Automated CI/CD pipeline (GitHub Actions) publishing reproducible Solana validator binaries with hermetic Docker builds and checksum verification.
- GitHub Actions
- Docker
- Solana
-
Prometheus
solana-exporter
Prometheus exporter (Python/FastAPI) for Solana validator observability — exposes 26+ metrics, ships with a 21-panel Grafana dashboard.
- Python
- FastAPI
- Prometheus
- Grafana
Skills
-
Blockchain
Solana · TON · Cosmos (Neutron, Fetch.ai, Vara) · Celestia · Oasis · Conflux · NEO3 · ZetaChain · validator and RPC node operations · staking infrastructure
-
Cloud & IaC
AWS · GCP · Azure · Tencent Cloud · Terraform · Ansible
-
Containers & CI/CD
Docker · Kubernetes (CKA) · Helm · GitHub Actions · Azure DevOps · Jenkins · ArgoCD
-
Data & Observability
Snowflake · Prometheus · Grafana · Datadog · ELK · Splunk · PagerDuty
-
Scripting & Automation
Python (FastAPI, Flask) · Bash · JavaScript
-
Integrations
REST APIs · Slack · Calendly · Glean · Messari
-
Security & Compliance
SIEM · MAS TRM · incident response · root cause analysis · post-mortems
-
AI Tooling
OpenAI API · Claude Code · Cursor · n8n · agentic patterns (tool orchestration, context management) · AI-assisted development workflows
Certifications
-
CKA
Certified Kubernetes Administrator · CNCF
-
Terraform
HashiCorp Certified — Terraform Associate
-
AWS SAA
AWS Certified Solutions Architect — Associate
-
AWS CCP
AWS Certified Cloud Practitioner
-
Tencent SA
Tencent Cloud Solutions Architect — Associate
-
Tencent SysOps
Tencent Cloud SysOps — Associate
-
Azure AZ-900
Microsoft Certified — Azure Fundamentals
-
Sophos CE
Sophos Certified Engineer
Contact
Available for blockchain infrastructure work. penghian@gmail.com · LinkedIn · GitHub · Resume (PDF)