Connecting...

Location
シンガポール
Salary
Competitive Salary
Job Type
Permanent
Ref
BH-23700
Contact
Joyce Chia
Contact email
Email Joyce
Contact phone
65 6692 9341
Posted
約7時間前
For an exciting early-stage AI venture we are seeking an accomplished tech leader to build the development and scaling of an advanced hybrid cloud and on-prem software platform. This role represents a unique opportunity to lead the software and engineering function of a transformative platform situated at the nexus of artificial intelligence, high-performance computing, and scientific innovation.
 
We’re looking for a visionary, hands-on technology leader to architect and own a secure internal developer platform that unifies HPC-aware MLOps, hybrid (on-prem + cloud) control planes, and best-in-class developer experience. From git-push to GPU, you’ll be responsible for the entire pipeline. We require a strong technology leader and people leader for the below position:
 
Key Responsibilities:
  • Drive platform strategy: Lead the design and evolution of the internal developer platform, including service catalogs, scaffolding, automated quality checks, and policy enforcement.
  • Architect hybrid compute: Oversee a unified infrastructure that integrates GPU clusters, on-prem schedulers, and cloud resources, optimizing for advanced scheduling, checkpointing, and data locality.
  • Shape data architecture: Guide multi-tenant database management, scalable object storage, data pipelines, and lakehouse solutions tailored to business domains.
  • Lead release engineering: Champion modern release practices—trunk-based workflows, blue-green deployments, GPU-aware testing, and continuous performance monitoring.
  • Uphold security & compliance: Ensure robust security frameworks and compliance with ISO-27001, SOC-2, and other standards, implementing service mesh, mTLS, secrets management, and vulnerability scanning.
  • Advance AI innovation: Spearhead federated and confidential AI solutions to meet enterprise data sovereignty needs.
  • Align with leadership: Translate business goals into actionable platform roadmaps and communicate progress to executives and stakeholders.
  • Build & mentor teams: Cultivate a high-performing engineering team, manage vendor partnerships, and ensure operational excellence, including 24/7 support and SLOs.
 
Qualifications:
  • Extensive platform and people leadership: Over 12 years building distributed or hybrid software platforms, including 5+ years in senior roles managing engineering teams of 30–80.
  • SaaS & security delivery: Proven track record delivering secure, multi-tenant SaaS or on-prem solutions with robust APIs, tenant isolation, and comprehensive audit trails.
  • Cloud-native & HPC expertise: Deep hands-on experience with Kubernetes, Argo CD, Terraform, service mesh, HPC schedulers (Slurm, PBS), GPU scheduling, and data-plane encryption.
  • Developer experience leadership: Drove initiatives for internal developer platforms, mono/poly-repo strategies, and automated engineering health metrics.
  • Advanced data engineering: Strong proficiency in databases (MongoDB), streaming pipelines (Kafka), OLAP systems (DuckDB, Snowflake), and scientific schema/ontology design.
  • Security & compliance success: Led teams through ISO-27001, SOC-2, or equivalent security and compliance audits.
  • Exceptional communicator: Skilled at engaging technical teams, executives, and global enterprise clients across multiple time zones.
 
Stack:

Orchestration & Infrastructure
  • Kubernetes
  • Argo CD
  • Terraform
  • Slurm
  • AWS / GCP / on-prem DGX

Networking & Service Mesh

  • Istio

Security & Policy

  • OPA/Rego
  • Vault

Developer Experience & CI/CD

  • Backstage
  • GitHub Actions

Observability

  • Prometheus/Grafana

Application Layer

  • Python & Go micro-services
  • FastAPI
  • gRPC
  • GraphQL

Data & Storage

  • MongoDB
  • MinIO / S3
  • Kafka
  • DuckDB


Reg. No. R1109395
BeathChapman Pte Ltd
Licence no. 16S8112