More about HKUST
Qwen: Towards a Generalist Model
Speaker: Junyang Lin
Staff Engineer
and
Leader of Qwen team in Alibaba
Title: "Qwen: Towards a Generalist Model"
Date: Tuesday, 5 December 2023
Time: 4:00pm - 5:00pm
Venue: Lecture Theater H
(Chen Kuan Cheng Forum)
near lift 27/28, HKUST
Abstract:
This talk introduces the large language and multimodal model series Qwen,
which stands for Tongyi Qianwen (通义千问), published and opensourced by
Alibaba Group. It will provide a brief review of the development of LLMs
and LMMs, and delve into details about building such models, including
pretraining, alignment, multimodal extension, as well as the opensource.
Additionally, it points out the limitations of recent work, and discusses
future work for both the research community and industry.
***************
Biography:
Junyang Lin is a staff engineer of Alibaba Group, and he is now a leader
of Qwen Team. He has been doing research in natural language processing
and multimodal representation learning, with a focus on large-scale
pretraining, and he has around 3000 citations. Recently his team released
and opensourced the Qwen series, including large language model Qwen,
large vision-language model Qwen-VL, and large audio-language model
Qwen-Audio. Previously, he focused on building large-scale pretraining
with a focus on multimodal pretraining, and developed opensourced models
OFA, Chinese-CLIP, etc. Now, he aims at building a multimodal AI system
towards a generalist agent.