Large Model Cloud Deployment Professional
5 days ago
SenseTime Group Limited is a leading AI research and development company focused on computer vision and deep learning.
We are seeking a Large Model Cloud Deployment Professional to join our team and help us optimize the inference deployment of large models on computing clusters.
Job OverviewThe Large Model Cloud Deployment Professional will be responsible for designing and implementing efficient large-model inference pipelines and developing and maintaining scalable cloud infrastructure for large-model deployment.
This includes:
- Optimizing the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance inference performance and reduce costs.
- Researching the latest advancements in large-model serving and integrating cutting-edge techniques into real-world business applications.
This role requires strong technical skills, including expertise in large-model algorithms, cloud infrastructure, and software engineering.
What We OfferAs a member of our team, you will have the opportunity to work with a leading AI research and development company and contribute to the development of cutting-edge AI technologies.
You will also have access to a comprehensive benefits package, including opportunities for professional growth and development.
-
Cloud Inference Engineer
5 days ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeWe are looking for a highly skilled Cloud Inference Engineer - Large Model Expert to join our team.The ideal candidate will have a strong background in large-model inference and experience with cloud-based infrastructure.Job DescriptionThe Cloud Inference Engineer - Large Model Expert will be responsible for optimizing the inference deployment of large...
-
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeAI Navigator – Large Model Cloud Inference Deployment EngineerSenseTime Group LimitedChinaOn-site Part Time InternshipSkills:C++, PythonJob ResponsibilitiesOptimize the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance...
-
Large Language Model Engineer
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timeAs a Generative AI Developer at Pantheon Lab Limited, you will be part of a dynamic team that is revolutionizing the field of digital human technologies and advanced digital assistant solutions. We are seeking an experienced professional to build large-scale end-to-end machine learning systems for APAC projects and company products related to Digital Humans...
-
AI Infrastructure Deployment Specialist
5 days ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeSenseTime Group Limited is a world-leading artificial intelligence platform company.Focused primarily on computer vision and deep learning, the company has independently developed a deep learning platform and a deep learning supercomputing center.Job OverviewWe are seeking an experienced AI Infrastructure Deployment Specialist to optimize the inference...
-
Large Language Model Specialist
4 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timeAbout the Role:">We are seeking a talented Generative AI Developer to join our team at Pantheon Lab Limited and contribute to the development of digital human technologies and advanced digital assistant solutions.">Key Responsibilities:">">Develop and deploy large-scale end-to-end machine learning systems for APAC projects and company products related to...
-
Senior Large Model Researcher
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China RAISOUND (HONGKONG) CO., LIMITED Full timeJob OverviewWe are seeking a highly skilled Senior Large Model Researcher to join our R&D team in Hong Kong. The ideal candidate will have extensive experience in developing and deploying large models, particularly in areas such as multi-round dialogue, question answering technology, and knowledge systems.ResponsibilitiesDevelop and implement large models...
-
Large Language Model Designer
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China beNovelty Limited Full timeWe are looking for a talented Large Language Model Designer to join our team at beNovelty Limited. In this role, you will be responsible for designing effective prompts for large language models to solve complex real-world problems. You will work closely with our AI and product teams to develop, fine-tune, and optimize prompts for large language models...
-
Cloud Computing Professional
7 days ago
Hong Kong Island, Hong Kong SAR China Alibaba Cloud Full timeJob Description:To succeed in this role, you will need to have a deep understanding of large enterprise business scenarios and be able to collaborate with IT teams to analyze existing IT architecture. Your goal will be to develop comprehensive IT plans and promote the use of Alibaba Cloud services.You will be responsible for helping large enterprises develop...
-
Inference Engine Optimization Lead
5 days ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeWe are seeking an Inference Engine Optimization Lead to join our team and help us optimize the inference deployment of large models on computing clusters.The ideal candidate will have a deep understanding of mainstream large-model algorithms and underlying principles, as well as experience with large-model inference engines and frameworks.Job DescriptionThe...
-
Senior Cloud IT Professional
1 day ago
Hong Kong, Central and Western District, Hong Kong SAR China Alibaba Cloud Full timeResponsibilitiesWork closely with our expert virtual team to prepare presentation materials, demonstrations, and proofs-of-concept for our clients.Meet our clients together with our Business Development Manager to present/demonstrate the solutions that you designed.Participate in different marketing events, such as Cloud Conference and FinTech Week in Hong...
-
Senior IT Consultant
7 days ago
Hong Kong Island, Hong Kong SAR China Alibaba Cloud Full timeKey ResponsibilitiesAnalyze customer IT architecture and develop comprehensive IT plansDevelop IT architecture and business processes for large enterprises, including best practices and exception handling mechanismsPromote the use of Alibaba Cloud services and improve problem handling mechanisms and processesWork with Alibaba Cloud service experts and...
-
AI Model Developer
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timePantheon Lab Limited is a leading Generative AI company specializing in digital human technologies and advanced digital assistant solutions. We are seeking an experienced Generative AI Developer to join our team and contribute to the development of large-scale end-to-end machine learning systems for APAC projects and company products related to Digital...
-
Language Model Developer
6 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timeAbout UsPantheon Lab Limited is a pioneering company specializing in digital human technologies and advanced digital assistant solutions. Our mission is to innovate and develop hyper-realistic digital humans that enrich real-world interactions. We have successfully expanded our presence in key markets and are committed to excellence and innovation.Job...
-
DevOps Engineer
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Bilgeadamtechnologies Full timeBilgeadamtechnologies OverviewBilgeadamtechnologies is a rapidly growing technology company that specializes in providing innovative solutions to businesses worldwide. Our mission is to empower organizations to achieve their goals through the effective use of technology. We believe in fostering a collaborative and inclusive work environment that encourages...
-
Cloud DevOps Professional
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Gravitas Recruitment Group (Global) Ltd Full timeWe are looking for a Cloud DevOps Professional to join our Exchange Production Team.As a key member of the team, you will work closely with the development team to design, build, and deploy scalable and secure cloud-based systems.Your primary focus will be on ensuring the smooth operation of our trading platform, which includes facilitating deployments and...
-
Technical Account Manager-Hong Kong SAR
7 days ago
Hong Kong Island, Hong Kong SAR China Alibaba Cloud Full time2 days ago Be among the first 25 applicantsDirect message the job poster from Alibaba CloudCloud talents specialization | GBA specialization | trilingual (Cantonese | English | Mandarin) | Talent Sourcing, Recruitment SolutionJob Description:To understand the business scenarios of large enterprises, cooperate with the IT, application architecture and...
-
Cloud Architect
3 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Tata Consultancy Services Full timeTata Consultancy Services is at the forefront of driving sustainability through technology and talent. As a leading IT services company, we're committed to embedding corporate sustainability into our operations.Our offices are designed with eco-friendly features that reduce our carbon footprint and enhance energy efficiency. We champion green initiatives,...
-
Cloud Networking Professional
3 hours ago
Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full timeAt Classy Wheeler Limited, we are seeking an experienced Cloud Networking Professional to join our team.Job OverviewResponsible for the overall support and maintenance of network infrastructure and security of the system.Design and implementation of network systems for cloud or virtualization infrastructure projects using network equipment with...
-
Infrastructure Deployment Specialist
6 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Hong Kong Genome Institute Full timeJob Summary:We are seeking a highly skilled Senior Cloud Engineer to join our team at the Hong Kong Genome Institute. As a key member of the team, you will be responsible for designing, implementing, and managing multi-cloud infrastructure solutions across various cloud platforms.About the Role:Design and implement Infrastructure as Code (IaC) templates to...
-
Private Cloud Architect
3 days ago
Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full timeAbout UsClassy Wheeler Limited is a well-established public enterprise with a strong commitment to innovation and excellence. We are seeking a highly skilled Technical Lead (Cloud) to join our team and lead the planning, design, and development of our private cloud infrastructure and platform services.The Ideal CandidateThe successful candidate will have a...