Follow
Qingqing Cao
Qingqing Cao
AIML Research Scientist, Apple
Verified email at apple.com - Homepage
Title
Cited by
Cited by
Year
MobiRNN: Efficient recurrent neural network execution on mobile GPU
Q Cao, N Balasubramanian, A Balasubramanian
Proceedings of the 1st International Workshop on Deep Learning for Mobile …, 2017
742017
Efficient methods for natural language processing: A survey
M Treviso, JU Lee, T Ji, B Aken, Q Cao, MR Ciosici, M Hassid, K Heafield, ...
Transactions of the Association for Computational Linguistics 11, 826-860, 2023
552023
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Q Cao, H Trivedi, A Balasubramanian, N Balasubramanian
ACL 2020, 2020
552020
Towards Accurate and Reliable Energy Measurement of NLP Models
Q Cao, A Balasubramanian, N Balasubramanian
SustaiNLP@EMNLP 2020, 2020
262020
Uiwear: Easily adapting user interfaces for wearable devices
J Xu*, Q Cao*, A Prakash, A Balasubramanian, DE Porter
Proceedings of the 23rd Annual International Conference on Mobile Computing …, 2017
252017
A survey for efficient open domain question answering
Q Zhang, S Chen, D Xu, Q Cao, X Chen, T Cohn, M Fang
arXiv preprint arXiv:2211.07886, 2022
232022
Deqa: On-device question answering
Q Cao, N Weber, N Balasubramanian, A Balasubramanian
Proceedings of the 17th Annual International Conference on Mobile Systems …, 2019
172019
IrEne: Interpretable Energy Prediction for Transformers
Q Cao, YK Lal, H Trivedi, A Balasubramanian, N Balasubramanian
ACL 2021, 2021
152021
Pumer: Pruning and merging tokens for efficient vision language models
Q Cao, B Paranjape, H Hajishirzi
arXiv preprint arXiv:2305.17530, 2023
72023
MobiVQA: Efficient On-Device Visual Question Answering
Q Cao, P Khanna, ND Lane, A Balasubramanian
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous …, 2022
62022
Are mobile dnn accelerators accelerating dnns?
Q Cao, AE Irimiea, M Abdelfattah, A Balasubramanian, ND Lane
Proceedings of the 5th International Workshop on Embedded and Mobile Deep …, 2021
62021
Faster and just as accurate: A simple decomposition for transformer models
Q Cao, H Trivedi, A Balasubramanian, N Balasubramanian
52019
Efficiency pentathlon: A standardized arena for efficiency evaluation
H Peng, Q Cao, J Dodge, ME Peters, J Fernandez, T Sherborne, K Lo, ...
arXiv preprint arXiv:2307.09701, 2023
42023
Apt: Adaptive pruning and tuning pretrained language models for efficient training and inference
B Zhao, H Hajishirzi, Q Cao
arXiv preprint arXiv:2401.12200, 2024
22024
IrEne-viz: Visualizing Energy Consumption of Transformer Models
YK Lal, R Singh, H Trivedi, Q Cao, A Balasubramanian, ...
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
12021
Bew: Towards Answering Business-Entity-Related Web Questions
Q Cao, O Riva, A Balasubramanian, N Balasubramanian
arXiv preprint arXiv:2012.05818, 2020
12020
Efficiency Pentathlon: A Standardized Benchmark for Efficiency Evaluation
H Peng, Q Cao, J Dodge, ME Peters, J Fernandez, T Sherborne, K Lo, ...
2023
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
Q Cao, S Min, Y Wang, H Hajishirzi
arXiv preprint arXiv:2310.01329, 2023
2023
[TACL] Efficient Methods for Natural Language Processing: A Survey
M Treviso, JU Lee, T Ji, B van Aken, Q Cao, M Ciosici, M Hassid, ...
The 61st Annual Meeting Of The Association For Computational Linguistics, 2023
2023
Efficient Natural Language Processing for Heterogeneous Platforms
Q Cao
State University of New York at Stony Brook, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20