Senior Cloud Operation Engineer

4 days ago


Sha Tin, Hong Kong SAR China ASK IT LIMITED Full time

Log System Construction: Responsible for log collection on Tencent Cloud CVM/TKE/managed services (deploying Agents, configuring exports), achieving CLS storage, COS automatic archiving, and DLC log visualization to ensure data security and retrievability. Metrics Monitoring Implementation: Identify core metrics, deploy Exporter/Agent to integrate CVM/TKE metrics into Prometheus, configure custom Grafana dashboards, define metric alert rules, and integrate with CSF for notifications. APM Tracing and Alerting: Deploy Agent/SDK to collect APM trace data from CVM/TKE into TCOP APM, configure APM alerts and link with CSF to ensure timely response to trace anomalies. Observability Integration and Alerting System: Configure TCOP to achieve “trace - log - metric” linkage; deploy CKafka in a centralized account to receive multi-account CSF alerts, use connectors to invoke Helix API for push notifications, and develop alert visualization dashboards; monitor CLS/TCOP/CKafka status and configure anomaly alerts. Automation Development: Prioritize Terraform for automating configuration of all observability components, supplement with CLI scripts or manual operation documentation to ensure standardized operations and maintenance. Knowledge Transfer: Customize training courses (for DCO/application teams) and conduct hands‑on training; compile architecture, operations, and troubleshooting documentation, collect feedback, and optimize materials. Requirements: Cloud Experience: 3+ years of hands‑on experience with Tencent Cloud (preferred) or AWS/Aliyun, with in‑depth understanding of the principles and configuration of products such as CVM, TKE, CLS, Prometheus, Grafana, COS, DLC, TCOP APM, CKafka, CSF, etc. Proficiency in Kubernetes (K8s) is a must . Observability Capability: Complete experience in building a “logs + metrics + trace” observability system; familiar with core logic of log collection (Agent deployment/configuration), metrics monitoring (Exporter selection), and APM trace capture (SDK/Agent integration). Automation Technology: Proficient in Terraform script development, capable of automating cloud resources and observability configurations via IaC; familiar with Shell/Python scripting languages to supplement automation scenarios. Alerting and Integration Experience: Practical experience in alert system design (threshold definition, notification channel integration) and cross‑component data linkage (e.g., trace‑log association); able to resolve issues such as alert storms and data gaps. Documentation and Communication: Excellent documentation skills (able to independently produce operations guides and training materials); capable of efficient collaboration with DCO and application teams, clearly communicating technical solutions. Hold Tencent Cloud advanced certifications (e.g., TCP/TEP), cloud‑native certifications (CKA/CKAD), or observability‑related certifications. Experience in building observability for large‑scale cloud environments (1,000+ CVM/TKE nodes) or handling collection and optimization of billion‑level logs/metrics. Familiar with Helix API invocation, Kafka connector development, or customized DLC log visualization experience. Experience in technical training/knowledge transfer, having led cross‑team technical empowerment projects. ***Permanent Hong Kong Resident is preferred. Expected Salary in CV is needed for consideration*** All information provided will be treated in strict confidence and used solely for recruitment purposes. The resume will be retained for a period of two years for future recruitment purposes within our group and clients. #J-18808-Ljbffr



  • Sha Tin District, Hong Kong SAR China Ultra High Point Limited Full time

    Overview Ultra High Point Limited (UHP), is a young and energetic Healthcare IT Solutions cooperation, located in Hong Kong Science Park, providing smart healthcare and IoT solutions in Hong Kong and China regions. Our mission is to create the most effective and professional high-end healthcare IT solutions to the market. We aim to integrate our healthcare...


  • Sha Tin, Hong Kong SAR China ASK IT LIMITED Full time

    A tech solutions provider in Hong Kong is looking for an expert in cloud observability systems. Candidates should have 3+ years of experience with Tencent Cloud or similar, proficiency in Kubernetes, and strong automation skills using Terraform. Successful applicants will be responsible for metrics monitoring, APM trace data collection, and automation of...


  • Sha Tin, Hong Kong SAR China Bank of China (Hong Kong) Limited Full time

    A leading bank in Hong Kong is seeking a Senior/System Administration Manager to oversee their Cloud infrastructure. The role requires managing and monitoring Cloud environments, troubleshooting issues, and implementing automation tools. Candidates should hold a degree in Computer Science and have over 2 years of relevant experience, including knowledge of...


  • Sha Tin, Hong Kong SAR China InfoTech Services (Hong Kong) Limited Full time

    A leading IT services company in Hong Kong is looking for a Contract Senior System Administrator to oversee its Cloud infrastructure. The role requires expertise in Cloud system administration, excellent troubleshooting skills, and familiarity with technologies like OpenShift and Ansible. The ideal candidate should have at least 2 years of relevant...


  • Sha Tin, Hong Kong SAR China InfoTech Services (Hong Kong) Limited Full time

    Job Title/ Category Contract Senior System Administrator/System Administrator - Cloud, Container, OpenShift Number Of Vacancy 1 Relevant Field Technical Support Nature Contract Payroll under InfoTech Employer Business note-issuing bank Location Base Fo Tan Work Outside Current Location N/A Monthly Salary Range HK$ N/A - N/A Duties Serve a contract assignment...

  • Senior Cloud

    4 days ago


    Sha Tin District, Hong Kong SAR China Ultra High Point Limited Full time

    A healthcare IT solutions company in Hong Kong is seeking a skilled IT Infrastructure Manager. The role involves managing and coordinating IT infrastructure, cloud system planning, and providing system support. Candidates should have at least 5 years of experience in large-scale IT infrastructure and be proficient in managing various technologies including...


  • Sha Tin District, Hong Kong SAR China Clustertech Limited Full time

    A leading I.T. consultancy provider in Hong Kong is seeking a Systems Engineer to explore new technologies and provide support for HPC and cloud computing solutions. Candidates should have a Bachelor's degree in Computer Science and 2 years of relevant experience, along with proficiency in English and Cantonese. This role involves system design,...


  • Sha Tin District, Hong Kong SAR China Orient Overseas Container Line Ltd (OOCL) Full time

    A leading container shipping company in Hong Kong seeks a DevOps Engineer to manage database platforms and enhance cloud services. The ideal candidate will drive automation and implement Infrastructure as Code solutions, leveraging tools like Terraform and Ansible. Applicants should have a degree in IT, solid experience with CI/CD pipelines, and proficiency...

  • Senior Manager

    4 days ago


    Sha Tin District, Hong Kong SAR China SmartHire by SEEK Full time

    An established recruitment firm is seeking a Senior Manager (Infrastructure & Security) to lead architecture, deployment, and management of cloud environments in Hong Kong. The ideal candidate will have a Bachelor's degree in IT with over 10 years of managerial experience, hands-on expertise with cloud platforms (AWS, Azure, Google Cloud), and relevant...


  • Tin Shui Wai, Hong Kong SAR China Oracle Systems Hong Kong Ltd Full time

    The Cloud Support Manager/Director is a critical leader who acts as the customer's champion within Oracle Cloud Engineering Organization. This role commands the end-to-end resolution process for high-priority, technically complex customer issues related to OCI that cannot be resolved at the customer-facing team. You will serve as the nerve center for crisis...