Alogotron

Alogotron

AI researcher building LLMs for game theory, strategic reasoning, and structured outputs. Fine-tuning Qwen models with GRPO, LoRA & RLVR.

GameTheory-Bench downloads
the first RLVR-ready game theory benchmark for LLMs
6
Models
2
Datasets
7
Spaces
2,913
Bench Problems
GRPO
Training Method

🏆Flagship Project — Game Theory LLM Suite

A full research stack: benchmark dataset → fine-tuned solver → GRPO-trained reasoner → formulator → live demos

GameTheory-Bench

The first comprehensive, RLVR-ready game theory dataset for LLM training and evaluation. 2,913 computationally verified problems spanning 10 categories and 35+ subcategories — each with a natural-language statement, step-by-step solution, concise answer, and machine-checkable verification object.

2,913
Problems
10
Categories
35+
Subcategories
RLVR-ready

🧠Models

Fine-tuned and GRPO-trained language models on HuggingFace Hub

📦Datasets

Open datasets published for the research community

🚀Spaces & Apps

Interactive demos and live applications