Research | aithething.com

Research Vibe Coding April 2026

Featured Study

From Prompt to Playable Evaluating AI Generated Browser Games

An independent vibe coding benchmark comparing Claude Code, Cursor, and OpenAI Codex on a single prompt to create a complete browser arcade game.

1 identical prompt 3 coding platforms 55-point scoring framework

Independent research by Blair Little read_research →

All Articles

Research Vibe Coding April 2026

From Prompt to Playable Evaluating AI Generated Browser Games

A practical benchmark comparing Claude Code, Cursor, and Codex after asking each to generate a complete single-file browser arcade game.

aithething.com read_more →

Research Visual Evaluation April 2026

Visual Output Evaluation: 4 AI Platforms, 1 Civilisation Timeline Prompt

Weighted scoring, output analysis, and practical recommendations after comparing four AI-generated timeline diagrams created from the exact same prompt.

aithething.com read_more →

Research Evaluation April 2026

Testing Geographic Default Bias in LLMs: A 50-Question Cross-Category Audit

Methodology, scorecard results, and remediation ideas from an independent benchmarking effort designed to test whether major LLMs treat US framing as global truth.

aithething.com read_more →