Rethinking the Value of Network Pruning Z Liu*, M Sun*, T Zhou, G Huang, T Darrell International Conference on Learning Representations (ICLR) 2019, 2019 | 1869 | 2019 |
A Simple and Effective Pruning Approach for Large Language Models M Sun*, Z Liu*, A Bair, JZ Kolter International Conference on Learning Representations (ICLR), 2024 | 456 | 2024 |
Denoised Smoothing: A Provable Defense for Pretrained Classifiers H Salman, M Sun, G Yang, A Kapoor, JZ Kolter Advances in Neural Information Processing Systems 33, 21945-21957, 2020 | 182 | 2020 |
Test-Time Adaptation via Conjugate Pseudo-Labels S Goyal*, M Sun*, A Raghunathan, JZ Kolter Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022 | 101 | 2022 |
Data poisoning attack against unsupervised node embedding methods M Sun, J Tang, H Li, B Li, C Xiao, Y Chen, D Song arXiv preprint arXiv:1810.12881, 2018 | 87 | 2018 |
Characterizing attacks on deep reinforcement learning X Pan, C Xiao, W He, S Yang, J Peng, M Sun, J Yi, Z Yang, M Liu, B Li, ... arXiv preprint arXiv:1907.09470, 2019 | 86 | 2019 |
Massive Activations in Large Language Models M Sun, X Chen, JZ Kolter, Z Liu Conference on Language Modeling (COLM), 2024, 2024 | 62 | 2024 |
Can shape structure features improve model robustness under diverse adversarial settings? M Sun, Z Li, C Xiao, H Qiu, B Kailkhura, M Liu, B Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 24 | 2021 |
Poisoned classifiers are not only backdoored, they are fundamentally broken M Sun, S Agarwal, JZ Kolter arXiv preprint arXiv:2010.09080, 2020 | 22 | 2020 |
(Certified!!) Adversarial Robustness for Free! N Carlini, F Tramer, K Dj Dvijotham, L Rice, M Sun, JZ Kolter International Conference on Learning Representations (ICLR), 2023 | 15 | 2023 |
Beyond size: How gradients shape pruning decisions in large language models RJ Das, M Sun, L Ma, Z Shen arXiv preprint arXiv:2311.04902, 2023 | 14 | 2023 |
Single Image Backdoor Inversion via Robust Smoothed Classifiers M Sun, JZ Kolter Computer Vision and Pattern Recognition (CVPR) 2023, 2023 | 9 | 2023 |
Method and system for breaking backdoored classifiers through adversarial examples SUN Mingjie, J Kolter, FJC CONDESSA US Patent App. 17/035,173, 2022 | 8 | 2022 |
Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-Line E Kim, M Sun, C Baek, A Raghunathan, JZ Kolter Advances in Neural Information Processing Systems 37, 120184-120220, 2025 | 4* | 2025 |
Fbi-llm: Scaling up fully binarized llms from scratch via autoregressive distillation L Ma, M Sun, Z Shen arXiv preprint arXiv:2407.07093, 2024 | 3 | 2024 |
Bi-mamba: Towards accurate 1-bit state space models S Tang, L Ma, H Li, M Sun, Z Shen arXiv preprint arXiv:2411.11843, 2024 | 1 | 2024 |
Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding M Jin, K Mei, W Xu, M Sun, R Tang, M Du, Z Liu, Y Zhang arXiv preprint arXiv:2502.01563, 2025 | | 2025 |
System and method for test-time adaptation via conjugate pseudolabels SUN Mingjie, S Goyal, A Raghunathan, J Kolter, WY Lin US Patent App. 17/868,267, 2024 | | 2024 |
Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models R Jyoti Das, M Sun, L Ma, Z Shen arXiv e-prints, arXiv: 2311.04902, 2023 | | 2023 |