Domain Adaptation of Base Models + ShadowdarkQA Bench
Investigating the effects of continued pre-training for learning precise mechanical rules of TTRPGs.
Investigating the effects of continued pre-training for learning precise mechanical rules of TTRPGs.
Can an LLM run a satisfying game of Dungeons & Dragons? The Gygax Test explores whether LLMs can create consistent characters, craft emerging narratives, and run tactical combat across a full campaign.