Research
Curious learnings from the AI frontier. Papers we read, summaries we wrote, things that surprised us. Not for profit, just for understanding.
-
Dual-Dimensional Consistency: Smarter Self-Consistency Sampling
Xu, Li, Zhao, Wu, Li & Yan · Xi'an Jiaotong University · 2026
Inference Optimization -
Agentic Systems as Boosting: When Weak Models Beat the Frontier
Sunkaraneni, Beneventano, Neumarker, Poggio & Galanti · MIT & Texas A&M · 2026
Agent Architecture -
Is Grep All You Need? The Agent Harness Moves Accuracy More Than the Retrieval Method
Sen, Kasturi, Lumer, Gulati, Subbiah et al. · 2026
Agentic Search -
BoundaryRouter: Learning When to Escalate to an Agent
Wang, Qiu et al. · Princeton, Michigan, Tsinghua et al. · 2026
Agent Routing -
LaTER: Latent-Phase Reasoning Cuts Tokens 32% Without Losing Accuracy
Li, Wang, Liu et al. · 2026
Inference Optimization -
ComplexMCP: Three Failure Modes in Large-Scale Tool Sandboxes
Li, Yang, Wang et al. · 2026
Agent Evaluation -
STALE: When Agent Memory Becomes a Liability
Chao, Bai et al. · 2026
Agent Memory -
AI Co-Mathematician: When Scaffolding Beats the Model
Zheng, von Glehn, Zwols et al. · Google DeepMind · 2026
Agentic AI -
Meta-Harness: The 6x Gap Lives in Your Code, Not Your Model
Lee, Nair, Zhang, Lee, Khattab & Finn · Stanford University & MIT · 2026
AI Systems -
How Coding Agents Actually Perform in the Wild
Popescu, Gros, Botocan, Pandita, Devanbu & Izadi · TU Delft & UC Davis · 2026
Software Engineering -
Conversation Reduces Load. Images Build It.
Taneja, Singh & Goel · Georgia Institute of Technology · 2026
AI in Education -
Image Generation Diversity: When Models Miss the Map
Dombrowski, Zhang, Cechnicka, Reynaud & Kainz · FAU Erlangen-Nürnberg & Imperial College London · 2025
Generative AI -
SLOW: The AI Tutor That Thinks Before It Speaks
Wei, Li & Jiang · Shanghai Institute of AI for Education · 2026
AI Tutoring -
Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets
Tran & Kiela · Stanford University · 2026
Agent Architecture -
LLMs in Games: When Generated Content Runs the Rules
Johnson, Ahmed, Lang, Thethi, Zheng & de Souza Santos · University of Calgary · 2026
Game Development -
Arknights: When the AI Lies, Players Learn
Shuai Guo · Uppsala University · 2025
Explainable AI -
Vibe Coding: Flow, Trust, and Co-Creation
Pimenova, Fakhoury, Bird, Storey & Endres · U Michigan / Microsoft Research · 2025
Vibe Coding -
Games That Teach AI Ethics
Solyst, Nakigozi, Fong & Shapiro · University of Washington · 2025
AI Education -
BAVT: Spend Less, Reason Better
Li et al. · UBC / Vector Institute · 2026
AI Agents
These summaries are layperson interpretations of published research. They are not peer-reviewed and may simplify or omit nuance. Always refer to the original papers for complete findings.
Synthesized by Kelly Chiang & Claude.