Cloud Inference Engineer

5 days ago


Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

We are looking for a highly skilled Cloud Inference Engineer - Large Model Expert to join our team.

The ideal candidate will have a strong background in large-model inference and experience with cloud-based infrastructure.

Job Description

The Cloud Inference Engineer - Large Model Expert will be responsible for optimizing the inference deployment of large models on computing clusters.

This includes:

  • Designing and implementing efficient large-model inference pipelines.
  • Developing and maintaining scalable cloud infrastructure for large-model deployment.
  • Collaborating with cross-functional teams to integrate large-model serving capabilities into various applications.

This role requires strong technical skills, including expertise in large-model algorithms, cloud infrastructure, and software engineering.

Requirements

To be successful in this role, you will need:

  • Strong knowledge of large-model inference engines and frameworks.
  • Experience with cloud-based infrastructure and deployment tools.
  • Proficiency in programming languages such as C++ and Python.


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    We are seeking an Inference Engine Optimization Lead to join our team and help us optimize the inference deployment of large models on computing clusters.The ideal candidate will have a deep understanding of mainstream large-model algorithms and underlying principles, as well as experience with large-model inference engines and frameworks.Job DescriptionThe...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    AI Navigator – Large Model Cloud Inference Deployment EngineerSenseTime Group LimitedChinaOn-site Part Time InternshipSkills:C++, PythonJob ResponsibilitiesOptimize the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    SenseTime Group Limited is a leading AI research and development company focused on computer vision and deep learning.We are seeking a Large Model Cloud Deployment Professional to join our team and help us optimize the inference deployment of large models on computing clusters.Job OverviewThe Large Model Cloud Deployment Professional will be responsible for...


  • Hong Kong, Central and Western District, Hong Kong SAR China Alibaba Cloud Full time

    Alibaba Cloud is seeking an experienced Technical Account Manager to join our team. As a Technical Account Manager, you will be responsible for serving as the primary technical point of contact for our enterprise customers.Your primary goal will be to understand customer business goals, technical challenges, and cloud adoption roadmap. You will develop...


  • Hong Kong, Central and Western District, Hong Kong SAR China Alibaba Cloud Full time

    As a Technical Account Manager at Alibaba Cloud, you will be the trusted technical advisor and advocate for our enterprise customers. Your primary goal will be to ensure they maximize the value of Alibaba Cloud services and Generative AI solutions.Bridge the gap between business objectives and technical execution by serving as the primary technical point of...


  • Hong Kong, Central and Western District, Hong Kong SAR China VisionMatrix Technology Limited Full time

    Responsibilities: 1. Infrastructure Management Design scalable cloud infrastructureConfigure distributed computing resourcesOptimize GPU/compute allocationManage AWS services (SageMaker, EC2)Implement cost-effective training solutions2. Training Pipeline Development Create end-to-end ML workflowsImplement CI/CD for machine learningDevelop automated training...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    SenseTime Group Limited is a world-leading artificial intelligence platform company.Focused primarily on computer vision and deep learning, the company has independently developed a deep learning platform and a deep learning supercomputing center.Job OverviewWe are seeking an experienced AI Infrastructure Deployment Specialist to optimize the inference...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    Job Description:We are seeking a talented Cloud Applications Engineer to join our team at Classy Wheeler Limited. As a Senior Software Engineer, you will be responsible for designing and implementing cloud-based applications using the latest technologies.Key Responsibilities:Developing scalable and secure cloud applications using NodeJS, React, and related...

  • Cloud engineer

    3 weeks ago


    Hong Kong, Central and Western District, Hong Kong SAR China FastLaneRecruit Full time

    Job Brief We are looking for a talented and proactive Cloud Engineer to join a dynamic team in a fast-paced environment. In this role, you will collaborate with engineering and development teams to design, develop, and deploy modular cloud-based systems. You will also ensure secure data storage and processing while providing cloud support and recommendations...


  • Hong Kong Island, Hong Kong SAR China Hong Kong Genome Institute Full time

    On-site Contract 5 year or above Mid-Senior levelSenior Cloud EngineerAs HKGI advances in innovative cloud technologies, a remarkable opportunity has emerged for a Senior Cloud Engineer to join our Infrastructure team. The incumbent will assume the following responsibilities:Key Responsibilities:Design, implement, and manage multi-cloud infrastructure...


  • Hong Kong Island, Hong Kong SAR China Sanderson-iKas Hong Kong Full time

    Sanderson-iKas Hong Kong is expanding its Cloud Infrastructure team and seeking a talented Infrastructure Engineer - Cloud.As a key member of our team, you will design and build cloud-based services and solutions that address diverse business needs.Closely collaborating with various teams, including trading and research, you will promote the adoption of...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    Company OverviewClassy Wheeler Limited is a leading provider of cybersecurity, cloud services, and modern infrastructure solutions. Our company empowers businesses with advanced IT strategies for enhanced security and efficiency.Job DescriptionWe are seeking a talented Hybrid Cloud Engineer to join our team and contribute to the design and implementation of...

  • Cloud Engineer

    3 days ago


    Hong Kong, Central and Western District, Hong Kong SAR China VisionMatrix Technology Limited Full time

    About the RoleAs a Cloud Engineer at VisionMatrix Technology Limited, you will play a crucial part in designing and implementing scalable cloud infrastructure to support our cutting-edge video search solutions. Your expertise in containerization technologies (Kubernetes, Docker) and Infrastructure as Code (Terraform, CloudFormation) will enable us to...

  • Cloud Engineer

    1 day ago


    Hong Kong, Central and Western District, Hong Kong SAR China Purview Asia Pacific Full time

    The Cloud Identity and Access Management team is responsible for enabling the public cloud to become a preferred platform across client's IT structure. This is a global, multi-discipline team responsible for architecting and delivering secure, robust, and innovative solutions which would enable the development teams to build and deploy new applications as...


  • Hong Kong, Central and Western District, Hong Kong SAR China PrimePeak Group Full time

    Cloud Engineering Lead for Financial ServicesWe are looking for an experienced C# Engineer to lead our cloud engineering efforts at PrimePeak Group. As a key member of our investment firm, you will be responsible for designing and implementing cloud-native financial services.Key Responsibilities:Developing and deploying cloud-native financial...


  • Hong Kong Island, Hong Kong SAR China Sanderson-iKas Hong Kong Full time

    We are looking for a Senior Cloud Systems Engineer to join our Cloud Infrastructure team at Sanderson-iKas Hong Kong.As a seasoned professional, you will design and build cloud-based services and solutions that address diverse business needs.Closely collaborating with various teams, including trading and research, you will promote the adoption of cloud...


  • Hong Kong, Central and Western District, Hong Kong SAR China Hong Kong Genome Institute Full time

    On-site Contract 5 year or above Mid-Senior level Senior Cloud EngineerAs HKGI advances in innovative cloud technologies, a remarkable opportunity has emerged for a Senior Cloud Engineer to join our Infrastructure team. The incumbent will assume the following responsibilities:Key Responsibilities: Design, implement, and manage multi-cloud infrastructure...


  • Hong Kong, Central and Western District, Hong Kong SAR China ACCA Careers Full time

    About the RoleWe are looking for a talented Cloud Systems Engineer to join our infrastructure team at ACCA Careers. As an Infrastructure Senior Engineer/Assistant Manager, you will play a key role in designing, building, and maintaining our cloud infrastructure.Your Key ResponsibilitiesDesign and implement cloud-based infrastructure solutions to meet...


  • Hong Kong, Central and Western District, Hong Kong SAR China Google Full time

    Apply info_outline info_outline X Info Google welcomes people with disabilities. Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Hong Kong; Taipei, Taiwan . Minimum Qualifications: Bachelor's degree in Computer Science, Engineering, or equivalent practical experience. 10 years of...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    Join Our TeamWe are looking for a talented and motivated Senior Software Engineer (Cloud & IoT) to join our team. As a Senior Software Engineer (Cloud & IoT), you will be responsible for designing and developing cutting-edge Cloud and IoT solutions that meet the needs of our clients. You will work closely with our development team to ensure that our...