Skip to content

🛠️ Development

Everything on the development side — agents, testing, models, tooling, and how teams build with AI. Sorted into here by topic, not by which Slack channel it came from.

BlogAbly • Amber Dawson

Is AI making your teams better, or just busier?

Argues AI usage metrics don't capture whether teams actually get better, and proposes outcome-based KPIs.

  • 88% use AI but only 39% report EBIT impact — usage ≠ value.
  • Two KPIs: new outcomes unlocked, and how embedded AI is in the workflow.
  • Needs structural support (scorecards, skill repos), not just tool access.

added by Adam Tomat • 1st Jul 2026

BlogOpenAI

Previewing GPT-5.6 Sol

OpenAI's official preview of the GPT-5.6 Sol / Terra / Luna models.

  • A next-generation model family preview from OpenAI.
  • Positioned as a step up in the GPT-5.x line.
  • See the post for the full capability and benchmark specifics.

added by Seb Kay • 26th Jun 2026

BlogDecoding AI • Alejandro Aboy

How Evaluation-Driven Development (EDD) Works for AI Agents

A pre-merge evaluation gate for AI agent changes, using simulated inputs run through the real agent.

  • An offline gate answering "does it work / did anything regress".
  • Simulate inputs (drawn from real traces), not outputs.
  • Rejects always-on prod eval as too costly; runs targeted branch experiments with calibrated binary judges.

added by Adam Tomat • 23rd Jun 2026

Curated from the AI Chinwag Slack community.