GPT-4o 辅助游戏重构与功能生成:一项无尽跑酷游戏案例研究
阅读原文· arxiv.org一项基于 GPT-4o 在 Python/Pygame 无尽跑酷游戏中的探索性案例研究,评估了六项开发任务:三项本地化重构和三项游戏玩法功能生成。结果显示,GPT-4o 成功完成了所有重构任务,但仅正确集成了一项新功能。研究表明,在此场景下,GPT-4o 处理局部代码转换比实现跨系统的新交互更可靠,为 LLM 辅助游戏开发提供了透明案例参考。
Large language models (LLMs) are increasingly used to support software development, but their practical usefulness in applied game-development settings remains underexplored, especially when generated code must be integrated into an existing game software system. This paper presents an exploratory empirical case study of GPT-4o in a custom Python/Pygame endless runner. The study examines six selected development tasks: three localized refactoring tasks and three tasks involving gameplay feature generation. The resulting implementations were evaluated using software metrics, unit tests, and manual gameplay assessments. In this case study, all three selected refactoring tasks were completed successfully in functional terms, whereas only one of the three selected gameplay feature generation tasks resulted in a correctly integrated feature. The findings suggest that, in this setting, GPT-4o handled localized transformations more reliably than tasks requiring new gameplay interactions across multiple existing systems. Given the exploratory single-case design, these results are best interpreted as indicative observations rather than as generalizable evidence of category-level model performance. Overall, the paper contributes a transparent case-based account of the opportunities and limitations of LLM-assisted refactoring and gameplay feature generation in an existing game software system.