Alogotron

AI researcher building LLMs for game theory, strategic reasoning, and structured outputs. Fine-tuning Qwen models with GRPO, LoRA & RLVR.

GameTheory-Bench downloads

—

the first RLVR-ready game theory benchmark for LLMs

🤗 HuggingFace Profile 🎲 Live Demo

Models

Datasets

Spaces

2,913

Bench Problems

GRPO

Training Method

🏆Flagship Project — Game Theory LLM Suite

A full research stack: benchmark dataset → fine-tuned solver → GRPO-trained reasoner → formulator → live demos

GameTheory-Bench

The first comprehensive, RLVR-ready game theory dataset for LLM training and evaluation. 2,913 computationally verified problems spanning 10 categories and 35+ subcategories — each with a natural-language statement, step-by-step solution, concise answer, and machine-checkable verification object.

2,913

Problems

🧠Models

Fine-tuned and GRPO-trained language models on HuggingFace Hub

💡GRPO · RLVR

GameTheory-Reasoner

Reinforcement-learning fine-tuned (GRPO) Qwen2.5-7B for game theory reasoning. Trained on GameTheory-Bench to solve Nash equilibria, dominant strategies, and multi-step strategic problems.

game-theorygrporeinforcement-learningqwen2.5LoRA

📥 24 downloads🦙 Qwen2.5-7B base

🎲SFT · QLoRA

GameTheory-Solver

Supervised fine-tuned Qwen2.5-7B for solving game theory problems including Nash equilibria and economic games. Based on the GameTheory-Bench dataset.

game-theorynash-equilibriumeconomicsmathqlora

📥 38 downloads🦙 Qwen2.5-7B base

📐SFT · QLoRA

GameTheory-Formulator

Specialised model for converting natural-language strategic scenarios into formal game-theoretic representations — the formulation step in the full pipeline.

formulationstrategic-reasoningdecision-theoryqwen2.5

📥 29 downloads🦙 Qwen2.5-7B base

🔧PEFT · LoRA

Qwen2.5-3B JSON Output

LoRA adapter on Qwen2.5-3B-Instruct fine-tuned for reliable structured JSON output generation — useful for tool-use and agentic pipelines.

jsonstructured-outputsfttrl

📥 5 downloads🦙 Qwen2.5-3B base

🧮Tiny · Tool-Use

TinyToolUse Calculator

Ultra-compact Qwen2-0.5B fine-tuned for function-calling and calculator tool use. Explores how small models can reliably perform structured tool invocations.

tool-usefunction-callingcalculatortiny

📥 4 downloads🦙 Qwen2-0.5B base

📦Datasets

Open datasets published for the research community

🎲Active

GameTheory-Bench

2,913 computationally verified game theory problems spanning 10 categories and 35+ subcategories. The first RLVR-ready game theory benchmark for LLMs, with machine-checkable verification objects.

game-theoryrlvrbenchmarkmathreasoning

📥 41 downloads🔖 MIT

📊Superseded

Game Theory Business Strategy

Original business-strategy scenario dataset. Superseded by the broader GameTheory-Bench collection, which covers 6 domains including 220+ business scenarios with full formulation steps.

business-strategygame-theorydeprecated

📥 41 downloads🔖 MIT

🚀Spaces & Apps

Interactive demos and live applications

🎮Live · Gradio

GameTheory Solver Demo

Interactive demo for the GameTheory-Solver model. Enter a game theory problem and watch the model find Nash equilibria and optimal strategies step by step.

game-theorynash-equilibriumgradiodemo

💬Live · Gradio

GameTheory Chat

Conversational interface for the GameTheory models. Ask strategic questions, explore dominant strategies, and reason through multi-player games interactively.

chatgame-theoryzerogpugradio

🔬Research Tool

NeuroScope

Activation visualisation and interpretability tool for transformer models. Explore attention patterns, hidden states, and mechanistic interpretability of Qwen models.

interpretabilityattentionhidden-statesmechanistic-interpretability

📝Static · Blog

Game Theory LLM Blog

Research blog documenting the methodology, experiments, and findings from training LLMs on game theory — covering GRPO, RLVR, and fine-tuning insights.

grporlvrfine-tuningblog

🩺Live · Gradio

Medical Reasoning AI

AI medical reasoning application exploring LLM capabilities in clinical and diagnostic reasoning tasks.

medicalreasoninggradio