Build practical AI systems, by mapping general-purpose AIs to the right specific use cases.

Recent Publications

2023
Do LLMs Exhibit Human-Like Response Biases? A Case Study in Survey Design
Lindia Tjuatja, Valerie Chen, Tongshuang Wu, Ameet Talwalkar, Graham Neubig
ArXiv 2023: arXiv:2311.04076
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases
Yiming Zhang, Sravani Nanduri, Liwei Jiang, Tongshuang Wu, Maarten Sap
EMNLP 2023: The 2023 Conference on Empirical Methods in Natural Language Processing
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
Shayne Longpre, Robert Mahari, Anthony Chen, Naana Obeng-Marnu, Damien Sileo, William Brannon, Niklas Muennighoff, Nathan Khazam, Jad Kabbara, Kartik Perisetla, Xinyi (Alexis) Wu, Enrico Shippole, Kurt Bollacker, Tongshuang Wu, Luis Villa, Sandy Pentland, Deb Roy, Sara Hooker
ArXiv 2023: ArXiv
DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions
Vijay Viswanathan, Luyu Gao, Tongshuang Wu, Pengfei Liu, Graham Neubig
ACL 2023: the 61th Annual Meeting of the Association for Computational Linguistics
Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses
Logan Stapleton, Jordan Taylor, Sarah Fox, Tongshuang Wu, Haiyi Zhu
ArXiv 2023: ArXiv
HypoCompass: Large-Language-Model-based Tutor for Hypothesis Construction in Debugging for Novices
Qiaomu Ma, Hua Shen, Kenneth Koedinger, Tongshuang Wu
ArXiv 2023: ArXiv
Large Language Models Help Humans Verify Truthfulness – Except When They Are Convincingly Wrong
Chenglei Si, Navita Goyal, Tongshuang Wu, Chen Zhao, Shi Feng, Hal Daumé III, Jordan Boyd-Graber
ArXiv 2023: ArXiv
Large Language Models Enable Few-Shot Clustering
Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig
TACL 2023: Transactions of the Association for Computational Linguistics
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu, Haiyi Zhu, Maya Albayrak, Alexis Axon, Amanda Bertsch, Wenxing Deng, Ziqi Ding, Bill Guo, Sireesh Gururaja, Tzu-Sheng Kuo, Jenny T Liang, Ryan Liu, Ihita Mandal, Jeremiah Milbauer, Xiaolin Ni, Namrata Padmanabhan, Subhashini Ramkumar, Alexis Sudjianto, Jordan Taylor, Ying-Jui Tseng, Patricia Vaidos, Zhijin Wu, Wei Wu, Chenyang Yang
ArXiv 2023: ArXiv
Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking
Hyeonsu Kang, Tongshuang Wu, Joseph Chee Chang, Aniket Kittur
UIST 2023: The ACM Symposium on User Interface Software and Technology
Tool Learning with Foundation Models
Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang5, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun
ArXiv 2023: ArXiv
From Nuisance to News Sense: Augmenting the News with Cross-document Evidence and Context
Jeremiah Milbauer, Ziqi Ding, Zhijin Wu, Tongshuang Wu
EMNLP Demo Track 2023: The 2023 Conference on Empirical Methods in Natural Language Processing
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins
TACL 2023: Transactions of the Association for Computational Linguistics
Is AI the Better Programming Partner? Human-Human Pair Programming vs. Human-AI pAIr Programming
Qianou Christina Ma, Tongshuang Wu, Kenneth Koedinger
ArXiv 2023: ArXiv
"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs
Cheng Qian, Xinran Zhao, Tongshuang Wu
ArXiv 2023: ArXiv
Promp2Model: Generating Deployable Models from Natural Language Instructions
Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig
EMNLP Demo Track 2023: The 2023 Conference on Empirical Methods in Natural Language Processing
Measuring Adversarial Datasets
Yuanchen Bai, Raoyi Huang, Vijay Viswanathan, Tzu-Sheng Kuo, Tongshuang Wu
AACL ART of Safety 2023: AAAI ART of Safety Workshop
Capabilities for Better ML Engineering
Chenyang Yang, Rachel Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu
AAAI SafeAI 2023: The AAAI's Workshop on Artificial Intelligence Safety

Current Members

Vijay Viswanathan (PhD) ,
co-advisor: Graham Neubig
Democritization of NLP development
Christina Ma (PhD) ,
co-advisor: Ken Koedinger
Human-AI Interaction for Education
Chenyang Yang (PhD) ,
co-advisor: Christian Kästner
Holistic ML Engineering
Xinran Zhao (PhD)
Effective and faithful NLP
Yuanchen (Sophie) Bai (Master)
NLP dataset characterization
Raoyi (Cathy) Huang (Master)
NLP dataset characterization

Alumni

Cheng Qian (Visiting Student)
LLM hullucination
Yiyang (Diana) Wang (Master)
End-User Prompt Disambiguation. Now PhD student at Georgia Tech.
Atharva Naik (Master)
LLM in CS education