Inference Performance Expert

1 week ago


Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

About the Position:

The Inference Performance Expert will play a crucial role in our research and development team, focusing on optimizing the inference deployment of large models on computing clusters. This individual will work closely with our engineering team to develop and implement cutting-edge techniques to enhance inference performance and reduce costs.

Key Responsibilities:

  • Develop and implement novel techniques to optimize large-model inference performance on multi-node, multi-GPU systems.
  • Collaborate with our research team to integrate the latest advancements in large-model serving into our business applications.
  • Work with our engineering team to develop and optimize large-model inference pipelines.

Requirements:

  • Advanced degree in Computer Science, related field, or equivalent experience.
  • Extensive knowledge of large-model algorithms and underlying principles.
  • Experience with large-model inference pipelines and optimization techniques.
  • Proficiency in C++ and Python programming languages.
  • Strong software engineering foundation and experience with design patterns.


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    Inference Performance Optimizer needed! Join our research and development team at SenseTime, the world's leading artificial intelligence platform company, as we strive to push the boundaries of AI technology. As a key member of our team, you will be responsible for optimizing the inference deployment of large models on computing clusters.Key...

  • AI Cloud Engineer

    2 weeks ago


    Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    We are seeking a highly skilled AI Cloud Engineer to join our team at SenseTime, the world's leading artificial intelligence platform company. As a key member of our research and development team, you will be responsible for optimizing the inference deployment of large models on computing clusters.Key Responsibilities:Design and implement efficient inference...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    AI Navigator – Large Model Cloud Inference Deployment EngineerSenseTime Group LimitedChinaOn-site Part Time InternshipSkills:C++, PythonJob ResponsibilitiesOptimize the inference deployment of large models on computing clusters, focusing on multi-node, multi-GPU parallel inference, task scheduling, KV cache management, and other techniques to enhance...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    SenseTime is looking for an exceptional Cloud AI Deployment Expert to join our research and development team. As a key member of our team, you will be responsible for designing and implementing efficient inference pipelines using mainstream large-model algorithms and optimization techniques.Responsibilities:Develop and deploy large-model serving solutions...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    About the Role:We are seeking a skilled Cloud AI Optimization Specialist to join our team at SenseTime. As a key member of our research and development team, you will be responsible for optimizing the inference deployment of large models on computing clusters.Key Responsibilities:Optimize large-model inference performance on multi-node, multi-GPU parallel...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    Job Overview:SenseTime is the world's leading artificial intelligence platform company, and we're looking for an experienced Large Model Deployment Engineer to join our team. In this role, you will work closely with our research and development team to develop and optimize large-model inference pipelines.Responsibilities:Design and implement large-model...


  • Hong Kong Island, Hong Kong SAR China Ultimate Performance Full time

    Join our vibrant team at Ultimate Performance as a Sales and Operations Manager in Hong Kong! In this challenging yet rewarding role, you'll be responsible for driving business growth, optimizing operations, and delivering exceptional client experiences.About the Position:Responsible for driving business growth through effective sales strategies and client...

  • Fitness Coach

    6 days ago


    Hong Kong, Central and Western District, Hong Kong SAR China Ultimate Performance Full time

    About Ultimate PerformanceUltimate Performance (UP) is a dynamic and innovative company that specializes in providing top-notch fitness solutions. Our team of expert trainers and coaches are dedicated to helping individuals achieve their wellness goals and reach their full potential.We take pride in creating a supportive and motivating environment where our...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    We are looking for a Performance Marketing Expert to join our team at Classy Wheeler Limited.The ideal candidate will have experience in digital marketing, particularly in SEO, SEM, Facebook and affiliate marketing. They will assist in developing and implementing a comprehensive digital marketing strategy, including SEO, SEM, Facebook and affiliate...


  • Hong Kong Island, Hong Kong SAR China SenseTime 商汤科技 Full time

    We are seeking a highly skilled Large Model Serving Architect to join our research and development team at SenseTime. As a key member of our team, you will be responsible for designing and implementing efficient inference pipelines using mainstream large-model algorithms and optimization techniques.Key Responsibilities:Develop and deploy large-model serving...


  • Hong Kong Island, Hong Kong SAR China Venturenix Full time

    Machine Learning Engineer (3 roles) | Relocation opportunity to Singapore | USD 100K - USD 130K per annumDirect message the job poster from Venturenix.About the CompanyOur client is a fast-growing, well-funded startup with a mission to make a significant impact in the world through AI based in Singapore. The founders of this venture have a proven track...


  • Hong Kong, Central and Western District, Hong Kong SAR China GUM | Your MPF & EB Expert | Hong Kong Full time

    About GUM | Your MPF & EB Expert | Hong KongGUM, a pioneering boutique consulting firm in the Health and Wealth Industry, has re-started our brands as an expert in providing MPF solutions and financial consulting services for over 42 years of market experience. We are dedicated to giving you more inspiration, wealth, and happiness.Fast-Learning EnvironmentWe...


  • Hong Kong Island, Hong Kong SAR China Venturenix Full time

    About the JobWe are looking for a highly skilled Senior Machine Learning Specialist to join our team in Singapore.As a senior member of our team, you will be responsible for developing and deploying large-scale machine learning models, leveraging your expertise in model deployment and training.You will work closely with our founders to drive innovation and...


  • Hong Kong Island, Hong Kong SAR China Venturenix Full time

    About the PositionWe are seeking a highly skilled Full Stack ML Developer to join our team in Singapore.As a key member of our team, you will be responsible for designing and implementing large-scale machine learning models, leveraging your expertise in model deployment and training.You will work closely with our founders to drive innovation and develop...

  • AI Research Engineer

    2 weeks ago


    Hong Kong Island, Hong Kong SAR China Venturenix Full time

    About the RoleWe are seeking an experienced AI Research Engineer to join our global research team in Singapore.As a key member of our team, you will be responsible for designing and implementing large-scale machine learning models, leveraging your expertise in model deployment and training.You will work closely with our founders to drive innovation and...

  • Cloud AI Architect

    2 weeks ago


    Hong Kong Island, Hong Kong SAR China Venturenix Full time

    About the Job DescriptionWe are looking for a highly skilled Cloud AI Architect to join our team in Singapore.As a senior member of our team, you will be responsible for designing and implementing large-scale machine learning models, leveraging your expertise in model deployment and training.You will work closely with our founders to drive innovation and...


  • Hong Kong, Central and Western District, Hong Kong SAR China ConnectedGroup Full time

    Driving Business Growth through Performance OptimizationWe are seeking an experienced Senior Manager, FP&A to drive business growth through performance optimization. As a key member of our finance department, you will be responsible for analyzing business performance, identifying areas for improvement, and developing strategies to enhance business outcomes....


  • Hong Kong Island, Hong Kong SAR China Adaptive Frontier Full time

    About the OpportunityJoin our team at Adaptive Frontier as a High Performance Computing Developer and take on the challenge of designing and optimizing high-speed trading systems. You will work closely with experienced engineers to develop innovative solutions and push the boundaries of what is possible in this exciting field.What We OfferWe provide a...


  • Hong Kong, Central and Western District, Hong Kong SAR China GroupM Full time

    About GroupMGroupM is the world's largest media investment company and is part of WPP. We are responsible for one in every three ads you see globally. Our vision is to be the industry leader in performance marketing, driving innovation and effectiveness across all forms of media.We are currently looking for a talented individual to join our team as a Digital...


  • Hong Kong Island, Hong Kong SAR China Classy Wheeler Limited Full time

    Classy Wheeler Limited is hiring!We're on the lookout for a High-Performance Database Specialist to join our dynamic IT team. As a key contributor to our database strategy, you'll design, implement, and maintain high-performance database systems that drive business success.About the Job:Design and implement database systems that meet the evolving needs of...