TY - GEN
T1 - Provoking unstable and evasive ChatGPT behaviour with anti-chain-of-thought: Japanese rock song lyrics case, a year later
AU - Selitskiy, Stanislav
AU - Inoue, Chihiro
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2025/1/28
Y1 - 2025/1/28
N2 - Considerably attention has been directed towards assessing the capabilities of ChatGPT and other Large Language Models (LLMs) in various aspects such as syntax accuracy, factual correctness, comprehensive world understanding, ethical alignment, common sense, and formal logical reasoning. However, the bulk of the research has primarily focused on analyzing "statically"generated texts, where only a single interaction between a human and LLMs was documented. More sophisticated methodologies involving open-ended discussions or debates between humans and LLMs yield far more intriguing outcomes. These reveal flawed rhetorical patterns, including circular arguments, self-contradictions, topic evasion, inconsistency, and a blend of passive aggression with attempts to please the human disputant. In this paper, we present an original observation of such behaviour occurring during a ChatGPT dialogue session centred around the translation of Japanese song lyrics.
AB - Considerably attention has been directed towards assessing the capabilities of ChatGPT and other Large Language Models (LLMs) in various aspects such as syntax accuracy, factual correctness, comprehensive world understanding, ethical alignment, common sense, and formal logical reasoning. However, the bulk of the research has primarily focused on analyzing "statically"generated texts, where only a single interaction between a human and LLMs was documented. More sophisticated methodologies involving open-ended discussions or debates between humans and LLMs yield far more intriguing outcomes. These reveal flawed rhetorical patterns, including circular arguments, self-contradictions, topic evasion, inconsistency, and a blend of passive aggression with attempts to please the human disputant. In this paper, we present an original observation of such behaviour occurring during a ChatGPT dialogue session centred around the translation of Japanese song lyrics.
KW - ChatGPT evaluation
KW - LLM evasion practices
KW - LLM long discussion
KW - anti-chain-of-thought
KW - chain-of-thought
UR - https://www.scopus.com/pages/publications/85218344284
U2 - 10.1109/fllm63129.2024.10852424
DO - 10.1109/fllm63129.2024.10852424
M3 - Conference contribution
SN - 9798350354799
T3 - 2024 2nd International Conference on Foundation and Large Language Models (FLLM)
SP - 207
EP - 215
BT - 2024 2nd International Conference on Foundation and Large Language Models, FLLM 2024
A2 - Jararweh, Yaser
A2 - Jansen, Jim
A2 - Alsmirat, Mohammad
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Foundation and Large Language Models (FLLM)
Y2 - 26 November 2024 through 29 November 2024
ER -