Pre-trained Language Model | Yequan's Academic

52B to 1T: Lessons Learned via Tele-FLM Series

Wed, 03 Jul 2024 00:00:00 +0000

Masked Structural Growth for 2x Faster Language Model Pre-training

Tue, 07 May 2024 00:00:00 +0000

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Fri, 14 Apr 2023 00:00:00 +0000

The overview of Big Language Model and ChatGPT

Thu, 23 Feb 2023 09:00:00 +0000

This talk aims to clearly reveal the challenge and the powerful abilities of ChatGPT and the used GPT series language models. More importantly, we hope to discuss the future direction of both academic and industry.

Specifically, we introduce the language model, Wudao 2.0 and the academic research of language model. The slides used will be released after the stage of anonymous appraisal.