Skip to main content

Command Palette

Search for a command to run...

GStack: Turn Claude Code Into a Full Engineering Team

Published
โ€ข4 min read
GStack: Turn Claude Code Into a Full Engineering Team
M
Tech lead at ComputeLeap. I write deep-dives on AI infrastructure, cloud computing, and developer tools โ€” the tech reshaping how software gets built.

GStack: Turn Claude Code Into a Full Engineering Team

The first time you type /office-hours into Claude Code with GStack installed, something strange happens. The AI stops acting like a helpful coding assistant and starts acting like a skeptical product manager who thinks your feature idea is probably wrong.

๐Ÿ“– Read the full version on AgentConn โ†’

That is the design. And it is why GStack โ€” Garry Tan's open-source Claude Code skill setup โ€” has accumulated 82,700 stars and 12,000 forks on GitHub since its March 2026 launch.

For context: Garry Tan is the President and CEO of Y Combinator. When the person who has reviewed more startups than almost anyone else on earth open-sources the exact AI development workflow that runs his code, developers pay attention. They also argue about it extensively on Hacker News.

Garry Tan tweet: I've been having such an amazing time with Claude Code I wanted you to be able to have my exact skill setup โ€” Introducing gstack

This guide explains what GStack actually does, how it compares to oh-my-openagent and other harnesses, why the "it's just prompts" criticism misses the point, and whether it belongs in your workflow.


What GStack Actually Does: The 23 Skills

GStack is not a new coding assistant. It is a collection of CLAUDE.md skills โ€” structured instructions that give Claude Code specialist personas. Install it in your project, and Claude Code gains access to 23 tools that simulate an engineering team.

The roles divide into recognizable job functions:

Planning and Strategy

  • /office-hours โ€” Product interrogation with forcing questions. Challenges your idea before you build it.
  • /plan-ceo-review โ€” Strategic scope challenge. Asks whether you are solving the right problem.
  • /plan-eng-review โ€” Architecture and testing challenge. Finds the assumptions in your technical plan.
  • /plan-design-review โ€” Design system audit. Catches "AI slop."
  • /plan-devex-review โ€” Developer experience review of the plan.
  • /autoplan โ€” Runs CEO, Engineering, and DevEx review in sequence automatically.

Design and Implementation

  • /design-consultation, /design-shotgun, /design-html
  • /review โ€” Code review targeting security issues, bugs, and architectural concerns.
  • /investigate โ€” Root-cause debugging with structured reasoning.

Testing and Quality

  • /qa โ€” Live browser testing with fixes applied inline.
  • /qa-only โ€” Bug reporting without code modification.
  • /cso โ€” Security audit applying OWASP Top 10 and STRIDE threat modeling.

Release and Deployment

  • /ship, /land-and-deploy, /document-release

Additional Tools

  • /browse, /canary, /benchmark, /retro, /codex, /pair-agent, /learn

The Conductor: Parallelizing Everything

The Conductor coordinates multiple Claude Code sessions simultaneously in isolated workspaces โ€” one on a new idea, one reviewing a PR, one implementing, one on QA. Each gets its own git worktree and context window.

This is what makes GStack genuinely novel: multi-agent orchestration built into the harness, not a separate tool.


The Productivity Claim: 810ร—

Garry Tan reports 810ร— productivity improvement over his 2013 baseline โ€” 11,417 logical lines/day vs 14. The metric uses "logical LOC" (meaningful changes, not raw lines), and the baseline is his own pre-AI experience. Not a controlled experiment, but an honest personal data point.

The TechCrunch analysis notes developers in hardware-adjacent or regulated domains see much smaller gains.

HN thread: Garry Tan's Claude Code Setup โ€” 74 points, 79 comments


The "Just Prompts" Criticism

The most common dismissal: GStack is "a bunch of prompts in a text file." Partially correct, mostly misses the point.

The value is in the system design โ€” separating planning from implementation, adversarial reviewing roles, security audits as a default step before shipping. Software engineering principles applied to AI agent orchestration.

Garry Tan quoting a CTO: Your eng review discovered a subtle XSS attack that I don't even think my team is aware of


GStack vs the Field

GStackoh-my-openagentGSDcc-switch
Stars82.7K53.9K35K54K
Model lock-inClaude Code onlyMulti-modelClaude Code firstModel-agnostic
Specialist roles23 skills11 agentsSpec-drivenNone
Parallel sessionsYes (Conductor)YesNoNo
Install30 seconds (paste)npm installManualCLI

oh-my-openagent routes tasks to the best model for each job. GStack is entirely Claude Code native. Different problems.


When GStack Wins, When It Doesn't

Use GStack when: you are a Claude Code user building a SaaS or web product, working solo or on a small team without dedicated QA/security review.

Skip GStack when: you need multi-model routing (use OmO), your team already has strong review culture, or you are on OpenCode/non-Claude agents.


Getting Started in 30 Seconds

Open Claude Code and type: Install GStack. Done.

First commands to run:

  • /office-hours โ€” Challenge your current feature idea
  • /cso โ€” Security audit on your last commit
  • /autoplan โ€” Full CEO + Eng + DevEx review of your next plan

Garry Tan: 50k stars and it feels so good โ€” type install gstack into claude code right now


The Bottom Line

GStack makes software engineering best practices the default, not the exception. The frontier in AI-assisted development is not a better autocomplete โ€” it is a well-designed team of reviewers who catch the mistakes you were going to make anyway.


Originally published at AgentConn

More from this blog

ComputeLeap

66 posts