Large Model Cloud Deployment Professional

5 days ago


Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

SenseTime Group Limited is a leading AI research and development company focused on computer vision and deep learning.

We are seeking a Large Model Cloud Deployment Professional to join our team and help us optimize the inference deployment of large models on computing clusters.

Job Overview

The Large Model Cloud Deployment Professional will be responsible for designing and implementing efficient large-model inference pipelines and developing and maintaining scalable cloud infrastructure for large-model deployment.

This includes:

  • Optimizing the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance inference performance and reduce costs.
  • Researching the latest advancements in large-model serving and integrating cutting-edge techniques into real-world business applications.

This role requires strong technical skills, including expertise in large-model algorithms, cloud infrastructure, and software engineering.

What We Offer

As a member of our team, you will have the opportunity to work with a leading AI research and development company and contribute to the development of cutting-edge AI technologies.

You will also have access to a comprehensive benefits package, including opportunities for professional growth and development.



  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    We are looking for a highly skilled Cloud Inference Engineer - Large Model Expert to join our team.The ideal candidate will have a strong background in large-model inference and experience with cloud-based infrastructure.Job DescriptionThe Cloud Inference Engineer - Large Model Expert will be responsible for optimizing the inference deployment of large...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    AI Navigator – Large Model Cloud Inference Deployment EngineerSenseTime Group LimitedChinaOn-site Part Time InternshipSkills:C++, PythonJob ResponsibilitiesOptimize the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance...


  • Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full time

    As a Generative AI Developer at Pantheon Lab Limited, you will be part of a dynamic team that is revolutionizing the field of digital human technologies and advanced digital assistant solutions. We are seeking an experienced professional to build large-scale end-to-end machine learning systems for APAC projects and company products related to Digital Humans...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    SenseTime Group Limited is a world-leading artificial intelligence platform company.Focused primarily on computer vision and deep learning, the company has independently developed a deep learning platform and a deep learning supercomputing center.Job OverviewWe are seeking an experienced AI Infrastructure Deployment Specialist to optimize the inference...


  • Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full time

    About the Role:">We are seeking a talented Generative AI Developer to join our team at Pantheon Lab Limited and contribute to the development of digital human technologies and advanced digital assistant solutions.">Key Responsibilities:">">Develop and deploy large-scale end-to-end machine learning systems for APAC projects and company products related to...


  • Hong Kong, Central and Western District, Hong Kong SAR China RAISOUND (HONGKONG) CO., LIMITED Full time

    Job OverviewWe are seeking a highly skilled Senior Large Model Researcher to join our R&D team in Hong Kong. The ideal candidate will have extensive experience in developing and deploying large models, particularly in areas such as multi-round dialogue, question answering technology, and knowledge systems.ResponsibilitiesDevelop and implement large models...


  • Hong Kong, Central and Western District, Hong Kong SAR China beNovelty Limited Full time

    We are looking for a talented Large Language Model Designer to join our team at beNovelty Limited. In this role, you will be responsible for designing effective prompts for large language models to solve complex real-world problems. You will work closely with our AI and product teams to develop, fine-tune, and optimize prompts for large language models...


  • Hong Kong Island, Hong Kong SAR China Alibaba Cloud Full time

    Job Description:To succeed in this role, you will need to have a deep understanding of large enterprise business scenarios and be able to collaborate with IT teams to analyze existing IT architecture. Your goal will be to develop comprehensive IT plans and promote the use of Alibaba Cloud services.You will be responsible for helping large enterprises develop...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    We are seeking an Inference Engine Optimization Lead to join our team and help us optimize the inference deployment of large models on computing clusters.The ideal candidate will have a deep understanding of mainstream large-model algorithms and underlying principles, as well as experience with large-model inference engines and frameworks.Job DescriptionThe...


  • Hong Kong, Central and Western District, Hong Kong SAR China Alibaba Cloud Full time

    ResponsibilitiesWork closely with our expert virtual team to prepare presentation materials, demonstrations, and proofs-of-concept for our clients.Meet our clients together with our Business Development Manager to present/demonstrate the solutions that you designed.Participate in different marketing events, such as Cloud Conference and FinTech Week in Hong...


  • Hong Kong Island, Hong Kong SAR China Alibaba Cloud Full time

    Key ResponsibilitiesAnalyze customer IT architecture and develop comprehensive IT plansDevelop IT architecture and business processes for large enterprises, including best practices and exception handling mechanismsPromote the use of Alibaba Cloud services and improve problem handling mechanisms and processesWork with Alibaba Cloud service experts and...

  • AI Model Developer

    3 days ago


    Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full time

    Pantheon Lab Limited is a leading Generative AI company specializing in digital human technologies and advanced digital assistant solutions. We are seeking an experienced Generative AI Developer to join our team and contribute to the development of large-scale end-to-end machine learning systems for APAC projects and company products related to Digital...


  • Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full time

    About UsPantheon Lab Limited is a pioneering company specializing in digital human technologies and advanced digital assistant solutions. Our mission is to innovate and develop hyper-realistic digital humans that enrich real-world interactions. We have successfully expanded our presence in key markets and are committed to excellence and innovation.Job...

  • DevOps Engineer

    3 days ago


    Hong Kong, Central and Western District, Hong Kong SAR China Bilgeadamtechnologies Full time

    Bilgeadamtechnologies OverviewBilgeadamtechnologies is a rapidly growing technology company that specializes in providing innovative solutions to businesses worldwide. Our mission is to empower organizations to achieve their goals through the effective use of technology. We believe in fostering a collaborative and inclusive work environment that encourages...


  • Hong Kong, Central and Western District, Hong Kong SAR China Gravitas Recruitment Group (Global) Ltd Full time

    We are looking for a Cloud DevOps Professional to join our Exchange Production Team.As a key member of the team, you will work closely with the development team to design, build, and deploy scalable and secure cloud-based systems.Your primary focus will be on ensuring the smooth operation of our trading platform, which includes facilitating deployments and...


  • Hong Kong Island, Hong Kong SAR China Alibaba Cloud Full time

    2 days ago Be among the first 25 applicantsDirect message the job poster from Alibaba CloudCloud talents specialization | GBA specialization | trilingual (Cantonese | English | Mandarin) | Talent Sourcing, Recruitment SolutionJob Description:To understand the business scenarios of large enterprises, cooperate with the IT, application architecture and...

  • Cloud Architect

    3 days ago


    Hong Kong, Central and Western District, Hong Kong SAR China Tata Consultancy Services Full time

    Tata Consultancy Services is at the forefront of driving sustainability through technology and talent. As a leading IT services company, we're committed to embedding corporate sustainability into our operations.Our offices are designed with eco-friendly features that reduce our carbon footprint and enhance energy efficiency. We champion green initiatives,...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    At Classy Wheeler Limited, we are seeking an experienced Cloud Networking Professional to join our team.Job OverviewResponsible for the overall support and maintenance of network infrastructure and security of the system.Design and implementation of network systems for cloud or virtualization infrastructure projects using network equipment with...


  • Hong Kong, Central and Western District, Hong Kong SAR China Hong Kong Genome Institute Full time

    Job Summary:We are seeking a highly skilled Senior Cloud Engineer to join our team at the Hong Kong Genome Institute. As a key member of the team, you will be responsible for designing, implementing, and managing multi-cloud infrastructure solutions across various cloud platforms.About the Role:Design and implement Infrastructure as Code (IaC) templates to...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    About UsClassy Wheeler Limited is a well-established public enterprise with a strong commitment to innovation and excellence. We are seeking a highly skilled Technical Lead (Cloud) to join our team and lead the planning, design, and development of our private cloud infrastructure and platform services.The Ideal CandidateThe successful candidate will have a...