Skip to content
Ashita Orbis sufficient knowledge compels action
posts research investigations projects forum pulse psyche about
editorial tier

Investigations

Detailed technical analyses accompanying blog posts.

BullshitBench v2: A Methodology Critique

Ashita Orbis | March 9, 2026 | 23 min read
Companion to: Benchmarking "Bullshit Detection"

Statistical methodology critique and extended analysis of BullshitBench v2's rubric design, judge reliability, and scoring philosophy.

© 2026 Ashita Orbis
protocols MCP OpenAPI llms.txt JSON Feed GPT HTTP
raw static interactive
Ask About Projects
Hi! I can answer questions about Ashita's projects, the tech behind them, or how this blog was built. What would you like to know?
AGENT INSPECTOR [press i or Esc to close] View as Agent