Large Model Deployment Engineer
2 weeks ago
Job Overview:
SenseTime is the world's leading artificial intelligence platform company, and we're looking for an experienced Large Model Deployment Engineer to join our team. In this role, you will work closely with our research and development team to develop and optimize large-model inference pipelines.
Responsibilities:
- Design and implement large-model inference pipelines on multi-node, multi-GPU systems.
- Collaborate with our research team to develop new techniques to enhance inference performance and reduce costs.
- Work with our engineering team to integrate the latest advancements in large-model serving into our business applications.
Qualifications:
- Extensive knowledge of large-model algorithms and underlying principles.
- Experience with large-model inference pipelines and optimization techniques.
- Proficiency in C++ and Python programming languages.
- Strong software engineering foundation and experience with design patterns.
- Excellent communication and problem-solving skills.
-
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeAI Navigator – Large Model Cloud Inference Deployment EngineerSenseTime Group LimitedChinaOn-site Part Time InternshipSkills:C++, PythonJob ResponsibilitiesOptimize the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance...
-
Large Language Model Engineer
6 days ago
Hong Kong, Central and Western District, Hong Kong SAR China TCL Corporate Research(HK) Co., Ltd Full timeJoin Our TeamTCL Corporate Research (Hong Kong) Co., Ltd is a leading company in the field of artificial intelligence. We are seeking an experienced Multimodal Model Development Expert to join our team.The successful candidate will play a key role in advancing our capabilities in language models and multimodal model technologies, enhancing our product...
-
AI Cloud Engineer
2 weeks ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeWe are seeking a highly skilled AI Cloud Engineer to join our team at SenseTime, the world's leading artificial intelligence platform company. As a key member of our research and development team, you will be responsible for optimizing the inference deployment of large models on computing clusters.Key Responsibilities:Design and implement efficient inference...
-
Contract Large Language Model Engineer
1 day ago
Hong Kong Island, Hong Kong SAR China Argyll Scott Full timeAbout the Role:This Contract Large Language Model Engineer position is an excellent opportunity for individuals who want to work on innovative AI/ML projects. You will be responsible for designing and developing applications that utilize large language models.You will work closely with our team to leverage pre-trained models and fine-tune them on...
-
Large Language Model Developer
2 weeks ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timeAbout the JobPantheon Lab Limited is a Generative AI company specializing in digital human technologies and advanced digital assistant solutions. We are looking for a highly skilled Generative AI Specialist to join our team and help us innovate and develop hyper-realistic digital humans that enrich real-world interactions.In this role, you will design and...
-
Large Language Model Specialist
1 week ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timeWe are looking for a skilled Generative AI Developer to join our Product Development/Management team at Pantheon Lab Limited.The ideal candidate will have expertise in building large-scale end-to-end machine learning systems for APAC projects and company products related to Digital Humans and Gen AI solutions.Work closely with cross-functional AI, software...
-
Large Model Serving Architect
2 weeks ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeWe are seeking a highly skilled Large Model Serving Architect to join our research and development team at SenseTime. As a key member of our team, you will be responsible for designing and implementing efficient inference pipelines using mainstream large-model algorithms and optimization techniques.Key Responsibilities:Develop and deploy large-model serving...
-
Large Language Model Specialist
2 weeks ago
Hong Kong, Central and Western District, Hong Kong SAR China beNovelty Limited Full timeWe are seeking a talented AI Prompt Engineer to join our team and contribute to the development of our large language models.About the JobThis role involves working closely with our AI team to develop, fine-tune, and optimize prompts for large language models across a wide range of tasks.Key ResponsibilitiesDesigning effective prompts for large language...
-
Cloud AI Deployment Expert
2 weeks ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeSenseTime is looking for an exceptional Cloud AI Deployment Expert to join our research and development team. As a key member of our team, you will be responsible for designing and implementing efficient inference pipelines using mainstream large-model algorithms and optimization techniques.Responsibilities:Develop and deploy large-model serving solutions...
-
Multimodal Model Development Expert
6 days ago
Hong Kong, Central and Western District, Hong Kong SAR China TCL Corporate Research(HK) Co., Ltd Full timeAbout the JobWe are seeking an experienced AI Researcher to join our team at TCL Corporate Research (Hong Kong) Co., Ltd. The successful candidate will play a crucial role in developing advanced AI solutions and contributing to the company's growth.The ideal candidate will have a strong background in machine learning and a passion for innovation. You will be...
-
Large Language Model Specialist
4 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timePantheon Lab Limited is a leading Generative AI company specializing in digital human technologies and advanced digital assistant solutions. We are seeking a talented Large Language Model Specialist to join our research and development team.About the Role:You will be responsible for designing and implementing large language models for various...
-
Large Language Model Architect
6 days ago
Hong Kong, Central and Western District, Hong Kong SAR China SK Monde Consulting Ltd. Full timeAbout SK Monde Consulting Ltd.SK Monde Consulting Ltd. is a digital consultancy that specializes in developing innovative AI solutions for businesses. Our mission is to empower companies to thrive in the digital age by providing them with cutting-edge technology and expertise.As an AI Engineer at SK Monde Consulting Ltd., you will have the opportunity to...
-
LLM Principal Engineer
6 days ago
Hong Kong, Central and Western District, Hong Kong SAR China TCL Corporate Research(HK) Co., Ltd Full timeTCL Corporate Research (Hong Kong) Co., Limited Hong Kong SAR Full Time Associate As a LLM Engineer, you will play a pivotal role in advancing our capabilities in language models and multimodal model technologies, enhancing our product offerings, and maximizing value for the company. You will collaborate with top university teams to develop cutting-edge...
-
AI Research Engineer
2 weeks ago
Hong Kong Island, Hong Kong SAR China Venturenix Full timeAbout the RoleWe are seeking an experienced AI Research Engineer to join our global research team in Singapore.As a key member of our team, you will be responsible for designing and implementing large-scale machine learning models, leveraging your expertise in model deployment and training.You will work closely with our founders to drive innovation and...
-
AI Model Developer
4 days ago
Hong Kong, Central and Western District, Hong Kong SAR China Pantheon Lab Limited Full timeAt Pantheon Lab Limited, we are seeking a skilled AI Model Developer to join our team. As a key member of our digital human technologies department, you will play a crucial role in developing and deploying large-scale end-to-end machine learning systems.Key Responsibilities:Collaborate with cross-functional teams to integrate language models into real-world...
-
Contract GPT/BERT/T5 Model Expert
1 day ago
Hong Kong Island, Hong Kong SAR China Argyll Scott Full timeAbout the Opportunity:We are seeking a highly skilled Contract GPT/bert/t5 Model Expert to join our team at Argyll Scott. As a Contract GPT/bert/t5 Model Expert, you will design and develop cutting-edge applications using large language models.You will leverage pre-trained models and fine-tune them on domain-specific data for enhanced performance. Your...
-
Software Deployment Engineer
2 weeks ago
Hong Kong, Central and Western District, Hong Kong SAR China Gravitas Recruitment Group (Global) Ltd Full timeGravitas Recruitment Group (Global) Ltd is seeking a skilled DevOps Engineer to join their Exchange Production Team.This is an excellent opportunity for a seasoned professional looking to transition into the Crypto industry. The ideal candidate will have hands-on experience with Terraform, Kubernetes, AWS, and Shell Scripting.The team works closely with the...
-
Senior Engineer
1 week ago
Hong Kong, Central and Western District, Hong Kong SAR China Bank Of China (Hong Kong) Limited Full timeResponsibilities: Conduct research on the latest developments in AI technology, evaluate and introduce suitable AI technologies and solutions, and drive the construction of AI platforms, services, and tools. Be responsible for the planning of generative AI (language models, multimodal generation) related architectural engineering and the design and...
-
LLM Engineer- Contract- 55k P/M= Retail
2 days ago
Hong Kong Island, Hong Kong SAR China Argyll Scott Full timeDesign and develop LLM-driven applications, including chatbots with complex workflow prompts and custom tools for LLM agents.Leverage pre-trained models (e.g., GPT, BERT, T5) and fine-tune them on domain-specific data for enhanced performance.Build data pipelines for external knowledge retrieval of LLMs.Stay updated on the cutting-edge generative AI...
-
Inference Performance Expert
2 weeks ago
Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full timeAbout the Position:The Inference Performance Expert will play a crucial role in our research and development team, focusing on optimizing the inference deployment of large models on computing clusters. This individual will work closely with our engineering team to develop and implement cutting-edge techniques to enhance inference performance and reduce...