Damai Dai

Cited by

	All	Since 2019
Citations	2224	2219
h-index	15	15
i10-index	19	19

1300

650

325

975

20192020202120222023202411 29 64 130 735 1244

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Baobao CHANGPeking UniversityVerified email at pku.edu.cn
Qingxiu DongPeking UniversityVerified email at stu.pku.edu.cn
Xu SunAssociate Professor, Peking UniversityVerified email at pku.edu.cn
Li DongMicrosoft ResearchVerified email at microsoft.com
Furu WeiPartner Research Manager, Microsoft ResearchVerified email at microsoft.com
Tianyu LiuAlibabaVerified email at pku.edu.cn
Shuming MaMicrosoft Research AsiaVerified email at microsoft.com
Peiyi WangPeking UniversityVerified email at stu.pku.edu.cn
Fuli Luo (罗福莉)DeepSeekVerified email at pku.edu.cn
Wei LiBeijing Language and Culture UniversityVerified email at blcu.edu.cn

Damai Dai

Other names代达劢

Peking University, DeepSeek AI

Verified email at pku.edu.cn

Deep Learning Natural Language Processing Large Language Model Mixture-of-Experts


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey on in-context learning Q Dong, L Li, D Dai, C Zheng, Z Wu, B Chang, X Sun, J Xu, Z Sui arXiv preprint arXiv:2301.00234, 2022	840	2022
Knowledge neurons in pretrained transformers D Dai, L Dong, Y Hao, Z Sui, C Baobao, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	357	2022
Why can GPT learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei Findings of the Association for Computational Linguistics: ACL 2023, 4005-4019, 2023	270	2023
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024	95	2024
Calibrating Factual Knowledge in Pretrained Language Models Q Dong, D Dai, Y Song, J Xu, Z Sui, L Li Findings of the Association for Computational Linguistics: EMNLP 2022, 2022	77	2022
Preliminary study on the construction of Chinese medical knowledge graph O Byambasuren, Y Yang, Z Sui, D Dai, B Chang, S Li, H Zan Journal of Chinese Information Processing 33 (10), 1-9, 2019	72*	2019
Learning to control the fine-grained sentiment for story ending generation F Luo, D Dai, P Yang, T Liu, B Chang, Z Sui, X Sun Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	62	2019
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun (EMNLP 2023 Best Long Paper) Proceedings of the 2023 Conference on Empirical …, 2023	60	2023
Livebot: Generating live video comments based on visual and textual contexts S Ma, L Cui, D Dai, F Wei, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6810-6817, 2019	59	2019
On the representation collapse of sparse mixture of experts Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ... Advances in Neural Information Processing Systems 35, 34600-34613, 2022	56	2022
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	47	2024
StableMoE: Stable Routing Strategy for Mixture of Experts D Dai, L Dong, S Ma, B Zheng, Z Sui, B Chang, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	46	2022
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	40*	2024
Sememe prediction: Learning semantic knowledge from unstructured textual wiki descriptions W Li, X Ren, D Dai, Y Wu, H Wang, X Sun arXiv preprint arXiv:1808.05437, 2018	20	2018
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions D Dai, H Zheng, F Luo, P Yang, T Liu, Z Sui, B Chang Proceedings of the 6th ACL Workshop on Representation Learning for NLP …, 2021	18	2021
Behind the scenes: An exploration of trigger biases problem in few-shot event classification P Wang, R Xun, T Liu, D Dai, B Chang, Z Sui Proceedings of the 30th ACM International Conference on Information …, 2021	15	2021
Neural knowledge bank for pretrained transformers D Dai, W Jiang, Q Dong, Y Lyu, Z Sui CCF International Conference on Natural Language Processing and Chinese …, 2023	13	2023
Hierarchical Curriculum Learning for AMR Parsing P Wang, L Chen, T Liu, D Dai, Y Cao, B Chang, Z Sui Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	12	2022
Leveraging word-formation knowledge for Chinese word sense disambiguation H Zheng, L Li, D Dai, D Chen, T Liu, X Sun, Y Liu Findings of the Association for Computational Linguistics: EMNLP 2021, 918-923, 2021	11	2021
Coarse-to-fine entity representations for document-level relation extraction D Dai, J Ren, S Zeng, B Chang, Z Sui CCF International Conference on Natural Language Processing and Chinese …, 2023	8	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors