🛠️ Development

Everything on the development side — agents, testing, models, tooling, and how teams build with AI. Sorted into here by topic, not by which Slack channel it came from.

BlogMiguel Carranza

AI shouldn't shrink headcount. It should shrink teams

AI lets orgs run smaller teams per initiative rather than cutting headcount.

AI lowers the cost of writing and reviewing code.
RevenueCat's "Office of the CTO" small-team model, generalised org-wide.
The tradeoff is fragility — it needs strong individual contributors.

View ↗

added by Adam Tomat • 1st Jul 2026

BlogAbly • Amber Dawson

Is AI making your teams better, or just busier?

Argues AI usage metrics don't capture whether teams actually get better, and proposes outcome-based KPIs.

88% use AI but only 39% report EBIT impact — usage ≠ value.
Two KPIs: new outcomes unlocked, and how embedded AI is in the workflow.
Needs structural support (scorecards, skill repos), not just tool access.

View ↗

added by Adam Tomat • 1st Jul 2026

BlogCursor • Chris Brauchli, Rikki Mukherjee & Kevin Niparko

Build from anywhere with Cursor for iOS

Cursor's native iOS app (public beta) to launch and steer coding agents from a phone.

Launch cloud agents or control local agents remotely.
Voice input plus slash commands.
Local ↔ cloud agent handoff.

View ↗

added by Adam Tomat • 30th Jun 2026

BlogOpenAI

Previewing GPT-5.6 Sol

OpenAI's official preview of the GPT-5.6 Sol / Terra / Luna models.

A next-generation model family preview from OpenAI.
Positioned as a step up in the GPT-5.x line.
See the post for the full capability and benchmark specifics.

View ↗

added by Seb Kay • 26th Jun 2026

Repoopenclaw

openclaw/crabbox

A CLI that leases cloud/self-hosted compute, syncs your local diff, and runs commands/tests remotely.

~950 stars; a Go CLI plus a TypeScript coordinator.
40+ infrastructure providers supported.
Core loop: warm a box, sync the diff, run the suite — coordinator deployable on Cloudflare Workers.

View ↗

added by Adam Tomat • 24th Jun 2026

BlogDecoding AI • Paul Iusztin

Build, Configure, or Use As-Is: The Agentic Harness

Argues ~80% of agent harnesses are identical, so decide what to build vs configure vs use off-the-shelf for the other 20%.

Tools and catalogs are mostly YAML — configure, don't build.
Skills are markdown recipes; the memory layer is the piece worth custom-building.
Safety must live in a deterministic layer, not in prompts.

View ↗

added by Adam Tomat • 23rd Jun 2026

BlogDecoding AI • Alejandro Aboy

How Evaluation-Driven Development (EDD) Works for AI Agents

A pre-merge evaluation gate for AI agent changes, using simulated inputs run through the real agent.

An offline gate answering "does it work / did anything regress".
Simulate inputs (drawn from real traces), not outputs.
Rejects always-on prod eval as too costly; runs targeted branch experiments with calibrated binary judges.

View ↗

added by Adam Tomat • 23rd Jun 2026

ToolTessl

Tessl — agent enablement platform

A platform to build, test, distribute, and govern AI agent "skills" at team / enterprise scale.

Security scanning, policy, and audit before a skill deploys.
A shared, versioned skill registry.
Tracks real activation / usage plus evals.

View ↗

added by Tom Harper • 23rd Jun 2026

🛠️ Development ​

🛠️ Development