ChatGPT Generated Conversation
Research on ChatGPT-generated conversations explores how this large language model (LLM) performs in various interactive contexts, focusing on its capabilities, limitations, and potential biases. Current studies investigate its application in diverse fields, including education (e.g., essay scoring, literature review assistance), healthcare (e.g., medication management, robot interaction), and software development (e.g., code generation, library recommendation), often comparing its performance to human experts or other AI models. This work is significant because it helps assess the reliability and ethical implications of LLMs in real-world applications, informing the development of responsible AI and highlighting potential risks and benefits across numerous sectors.
Papers
Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT -- A Text-to-SQL Parsing Comparison
Shuo Sun, Yuchen Zhang, Jiahuan Yan, Yuze Gao, Donovan Ong, Bin Chen, Jian Su
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT
Xiaoshuai Song, Keqing He, Pei Wang, Guanting Dong, Yutao Mou, Jingang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu
Empirical Study of Zero-Shot NER with ChatGPT
Tingyu Xie, Qi Li, Jian Zhang, Yan Zhang, Zuozhu Liu, Hongwei Wang
Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving
Karen D. Wang, Eric Burkholder, Carl Wieman, Shima Salehi, Nick Haber
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams
Ethan Callanan, Amarachi Mbakwe, Antony Papadimitriou, Yulong Pei, Mathieu Sibue, Xiaodan Zhu, Zhiqiang Ma, Xiaomo Liu, Sameena Shah
Measuring reasoning capabilities of ChatGPT
Adrian Groza
Are Emily and Greg Still More Employable than Lakisha and Jamal? Investigating Algorithmic Hiring Bias in the Era of ChatGPT
Akshaj Kumar Veldanda, Fabian Grob, Shailja Thakur, Hammond Pearce, Benjamin Tan, Ramesh Karri, Siddharth Garg
FakeGPT: Fake News Generation, Explanation and Detection of Large Language Models
Yue Huang, Lichao Sun
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu
On the Generalization of Training-based ChatGPT Detection Methods
Han Xu, Jie Ren, Pengfei He, Shenglai Zeng, Yingqian Cui, Amy Liu, Hui Liu, Jiliang Tang