Soul Player C64--一款运行于1 MHz Commodore 64上的真正的变形金刚

2026-04-21 13:16·72天前·adunk

AI 摘要

开发者发布了 Soul Player C64，一个能在 1 MHz 主频的 Commodore 64 八位家用电脑上运行的完整 Transformer 模型实现。该项目成功将现代生成式 AI 的核心架构移植到 1982 年发布的经典硬件平台，突破了 64KB 内存和 1MHz 处理器的严苛限制。项目代码已托管至 GitHub 开源，在 Hacker News 技术社区获得 101 个点赞关注。

原文 · 未翻译

Soul Player C64

Soul Player C64 is an AI chatbot. Outputs are generated by a transformer language model, not a human.

A real transformer running on a 1 MHz Commodore 64.

And apparently on the Amstrad CPC, too! -> https://github.com/G1D30N/soulplayer-cpc

.-------. | O O | | V | |..|---|..| # SOUL PLAYER C64 25K PARAMETERS. 2 LAYERS. REAL TRANSFORMER. LOADED OFF A FLOPPY DISK. YOU> hey C64> HELLO! RE SOUNDS ME. MEFUL!

A 2-layer decoder-only transformer - the same architecture behind ChatGPT, Claude, and Gemini - implemented in hand-written 6502/6510 assembly and running on an unmodified Commodore 64. ~25,000 int8 parameters. Real multi-head causal self-attention, real softmax, real RMSNorm. About 60 seconds per token. The whole thing fits on a floppy disk with room to spare.

Architecture

2 layers, 4 attention heads × 8 dims, 32-dimensional embeddings, 64 FFN hidden units. ~25,000 parameters quantized to int8 with per-tensor shift scaling. The key breakthrough was fixing the softmax score normalization - shifting attention scores by 14 bits instead of 17 gives the 128-entry exp lookup table enough dynamic range to produce meaningful attention weights. Without this fix, the integer attention was essentially uniform across all positions, making the model blind regardless of architecture or training.

Quick start - run the pre-built soul

Grab disk/soulplayer.d64 and load it in any C64 emulator (VICE recommended):

disk/soulplayer.d64

LOAD"SOULPLAYER",8,1 RUN

Type a short message in lowercase, press RETURN, wait. The border flashes while it thinks. Each token gets a SID blip. A full response takes a few minutes. Type q to quit.

Hacker News 热门（buzzing.cc 中文翻译）

导出 Markdown