Follow
Damai Dai
Damai Dai
Other names代达劢
MOE Key Lab of Computational Linguistics, School of EECS, Peking University
Verified email at pku.edu.cn
Title
Cited by
Cited by
Year
A survey on in-context learning
Q Dong, L Li, D Dai, C Zheng, Z Wu, B Chang, X Sun, J Xu, Z Sui
arXiv preprint arXiv:2301.00234, 2022
5882022
Knowledge neurons in pretrained transformers
D Dai, L Dong, Y Hao, Z Sui, C Baobao, F Wei
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
2732022
Why can GPT learn in-context? language models implicitly perform gradient descent as meta-optimizers
D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei
Findings of the Association for Computational Linguistics: ACL 2023, 4005-4019, 2023
2102023
Calibrating Factual Knowledge in Pretrained Language Models
Q Dong*, D Dai*, Y Song, J Xu, Z Sui, L Li
Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
612022
Preliminary study on the construction of Chinese medical knowledge graph
O Byambasuren, Y Yang, Z Sui, D Dai, B Chang, S Li, H Zan
Journal of Chinese Information Processing 33 (10), 1-9, 2019
61*2019
Learning to control the fine-grained sentiment for story ending generation
F Luo*, D Dai*, P Yang, T Liu, B Chang, Z Sui, X Sun
Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019
582019
Livebot: Generating live video comments based on visual and textual contexts
S Ma, L Cui, D Dai, F Wei, X Sun
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6810-6817, 2019
542019
On the representation collapse of sparse mixture of experts
Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ...
Advances in Neural Information Processing Systems 35, 34600-34613, 2022
402022
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun
(EMNLP 2023 Best Long Paper) Proceedings of the 2023 Conference on Empirical …, 2023
342023
Deepseek llm: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
332024
StableMoE: Stable Routing Strategy for Mixture of Experts
D Dai, L Dong, S Ma, B Zheng, Z Sui, B Chang, F Wei
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
332022
Sememe prediction: Learning semantic knowledge from unstructured textual wiki descriptions
W Li, X Ren, D Dai, Y Wu, H Wang, X Sun
arXiv preprint arXiv:1808.05437, 2018
192018
Math-shepherd: Verify and reinforce llms step-by-step without human annotations
P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui
CoRR, abs/2312.08935, 2023
17*2023
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions
D Dai*, H Zheng*, F Luo, P Yang, T Liu, Z Sui, B Chang
Proceedings of the 6th ACL Workshop on Representation Learning for NLP …, 2021
172021
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models
D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ...
arXiv preprint arXiv:2401.06066, 2024
162024
Behind the scenes: An exploration of trigger biases problem in few-shot event classification
P Wang, R Xun, T Liu, D Dai, B Chang, Z Sui
Proceedings of the 30th ACM International Conference on Information …, 2021
142021
Hierarchical Curriculum Learning for AMR Parsing
P Wang, L Chen, T Liu, D Dai, Y Cao, B Chang, Z Sui
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
122022
Neural knowledge bank for pretrained transformers
D Dai, W Jiang, Q Dong, Y Lyu, Z Sui
CCF International Conference on Natural Language Processing and Chinese …, 2023
102023
Decompose, fuse and generate: A formation-informed method for chinese definition generation
H Zheng*, D Dai*, L Li, T Liu, Z Sui, B Chang, Y Liu
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
82021
Coarse-to-fine entity representations for document-level relation extraction
D Dai, J Ren, S Zeng, B Chang, Z Sui
CCF International Conference on Natural Language Processing and Chinese …, 2023
72023
The system can't perform the operation now. Try again later.
Articles 1–20