LLM Engineer
1 week ago
Overview LLM Engineer (Data and Optimization) role at TCL Corporate Research(HK) Co., Ltd. Responsibilities Responsible for training, fine-tuning (SFT), and system deployment of vertical domain large models, promoting efficient application of large models in industrial environments. Research and implement compression and optimization techniques for large models, including pruning, quantization, and knowledge distillation, to improve inference efficiency and deployment performance. Participate in the algorithm design and development of RAG (Retrieval-Augmented Generation) and Agent modules to enhance reasoning capabilities in dynamic and complex environments. Research and apply multimodal understanding technologies to optimize the application of Large Vision Models (LVM) in industrial vision and other fields. Translate business rules into efficient workflow code and participate in the design and implementation of Agentic Workflow to enhance workflow intelligence. Build industry datasets to support large model training and applications, including data preprocessing, pretraining data construction, and training/application/evaluation dataset setup. Research and implement large model merging techniques, exploring collaborative optimization solutions for multiple models. Develop and maintain validation, evaluation, and performance monitoring processes for large models to ensure system stability and usability. Participate in the development and optimization of large model application platforms (microservices) to enhance system modularity and usability. Qualifications Master’s degree or higher in Mathematics, Electrical Engineering, Computer Science, Data Science, or a related field is preferred. Proficient in machine learning, deep learning, and Transformer architecture, with hands-on experience in end-to-end training and development of large models. Familiar with large model compression and optimization methods such as pruning, quantization, and knowledge distillation. Strong capabilities in large-scale data processing and familiarity with big data tools (e.g., Hadoop, Spark), with experience in data preprocessing, cleaning, and building training datasets. Skilled in Python, C/C++, and Linux programming, with a solid foundation in algorithms and data structures. Familiar with mainstream large model training and inference frameworks such as PyTorch, Hugging Face (HF), DeepSpeed, PEFT, vLLM, TRL, etc. Knowledge of Triton or other high-performance inference tools, with experience in applying model optimization to real-world deployments. Proficient in Docker and Linux shell scripting; experience with FastAPI development is a plus. Experience in enterprise-level large model development, optimization, deployment, and tool development is preferred. Strong teamwork and communication skills, capable of collaborating with cross-domain teams. Passionate about cutting-edge large model technologies and their applications in industrial vertical domains. Bonus Points Experience in RAG systems and Agent module development and optimization. Familiarity with CUDA programming, distributed computing, or related high-performance computing technologies. Publications in top-tier conferences (e.g., NeurIPS, ICLR, CVPR). Knowledge of hardware acceleration technologies (e.g., GPU, TPU) and their applications in model optimization. Seniority level Mid-Senior level Employment type Full-time Job function Engineering, Science, and Information Technology Industries Technology, Information and Media #J-18808-Ljbffr
-
AI Engineer
2 weeks ago
hong kong, Hong Kong SAR China FCC ANALYTICS Full timeResponsibilities Develop LLM applications for tasks within the AML context Research and develop prototype implementations of new algorithms and methodologies related to LLMs Work with product development team to uncover customer needs and aspirations Job Requirements Proficiency in programming and project collaboration tools, such as Python, PyTorch,...
-
AI Researcher/ Engineer
7 days ago
hong kong, Hong Kong SAR China Pantheon Lab Limited Full timeOverview We are seeking an experienced AI Researcher/Engineer (LLM) to manage the design, deployment, and optimization of production‑grade language model systems. This role involves building applications using both commercial LLM APIs and self‑hosted open‑source models, implementing RAG pipelines, and creating end‑to‑end LLM workflows. The ideal...
-
AI Research Engineer: Production LLMs
7 days ago
hong kong, Hong Kong SAR China Pantheon Lab Limited Full timeA prominent AI research firm in Hong Kong is seeking an AI Researcher/Engineer to manage LLM systems. Responsibilities include designing high-throughput architectures, deploying models, and collaborating with teams. Applicants should have 2+ years in software engineering with a focus on LLM applications, combining technical and practical experience. This...
-
LLM Algorithm Engineer
1 week ago
Hong Kong Island, Hong Kong SAR China CoinMarketCap Full timeOverview Join to apply for the LLM Algorithm Engineer role at CoinMarketCap . Responsibilities Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining) Aligning models for reliable JSON-schema function calls and external tool usage Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint...
-
hong kong, Hong Kong SAR China Technology Excellence Technical Lead Engineer Full timeTechnology Excellence Technical Lead Engineer Technology Excellence Technical Lead Engineer View all jobs Add expected salary to your profile for insights Main Content Technology Excellence Technical Lead / Engineer, Information Technology Group Responsibilities Agile Project Leadership Act as Scrum Master: Facilitate Agile ceremonies, manage backlogs, and...
-
Senior Data Scientist
2 weeks ago
hong kong, Hong Kong SAR China Binance Full timeSenior Data Scientist / Analyst (AI/LLM) Join to apply for the Senior Data Scientist / Analyst (AI/LLM) role at Binance Binance is the leading global blockchain ecosystem and cryptocurrency infrastructure provider whose suite of financial products includes the world’s largest digital-asset exchange. Our mission is to accelerate cryptocurrency adoption and...
-
Business Development Manager
2 weeks ago
Hong Kong Island, Hong Kong SAR China Fortinet Full timeBusiness Development Manager (AIDC/LLM) – Hong Kong Job Description Location: Hong Kong Join Fortinet, a cybersecurity pioneer with over two decades of excellence, as we continue to shape the future of cybersecurity and redefine the intersection of networking and security. At Fortinet, our mission is to safeguard people, devices, and data everywhere. We...
-
Business Development Manager
3 days ago
Hong Kong Island, Hong Kong SAR China Fortinet, Inc. Full timeLocation: Hong Kong Join Fortinet, a cybersecurity pioneer with over two decades of excellence, as we continue to shape the future of cybersecurity and redefine the intersection of networking and security. At Fortinet, our mission is to safeguard people, devices, and data everywhere. We are currently seeking a dynamic Business Development Manager (AIDC/LLM)...
-
Senior Data Analyst
2 days ago
hong kong, Hong Kong SAR China Binance Full timeSenior Data Analyst – Recommendation, Feeds, Growth (AI/LLM) Join to apply for the Senior Data Analyst – Recommendation, Feeds, Growth (AI/LLM) role at Binance . Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+...
-
AI Engineer
5 days ago
hong kong, Hong Kong SAR China MegazoneCloud Full timeMEGAZONE CLOUD is looking for a talented and hands-on AI Engineer with a strong focus on Generative AI (GenAI) to join our growing technical team. In this role, you will be responsible for designing, building, deploying, and maintaining advanced AI solutions powered by Large Language Models (LLMs). You will work with cutting‑edge frameworks to build...