Skip to content

Instantly share code, notes, and snippets.

@galsapir
galsapir / guardrails-v2-summary.md
Created March 31, 2026 08:43
Guardrails V2 — Complete Summary (DS-1233)

Guardrails V2 — What We Built

Branch: DS-1233-agent-flow-management
Date: 2026-03-29 → 2026-03-31
Commits: 224f62dc8abfa2 (5 commits)


New Guardrail Definitions

@galsapir
galsapir / guardrails-v2-review.md
Created March 29, 2026 14:21
Guardrails V2 — Implementation Review & Proof of Work (DS-1233)

Guardrails V2 — Implementation Review & Proof of Work

2026-03-29T14:17:37Z by Showboat 0.6.1

What Was Built

Branch: DS-1233-agent-flow-management
Commits: 344094f (initial) + review fixup (pending)
Date: 2026-03-29

@galsapir
galsapir / guardrails-and-evals-guide.md
Created March 29, 2026 12:13
Adding Guardrails, Evals & Agent Flows — Operational Guide (DS-1233)

Adding Guardrails, Evals & Agent Flows — Operational Guide

2026-03-29T12:09:08Z by Showboat 0.6.1

Context

Branch: DS-1233-agent-flow-management
Date: 2026-03-29
Authors of the system: Itamar Apel (primary), Gal Sapir (reviewer/deployer)

@galsapir
galsapir / 63
Created March 25, 2026 20:20
Cardiovascular risk skill from cardio_risk branch
ed peers
A "C" means: around the 40-60th percentile (average)
An "F" means: worse than ~80% of peers (bottom 20%)
This approach:
- Accounts for age-appropriate expectations
- Avoids arbitrary cutoffs
- Provides meaningful context
### Component Score Calculation
@galsapir
galsapir / ghostty-tmux-setup.md
Last active March 23, 2026 14:28
Modern terminal setup: Ghostty + tmux + micro — for engineers who SSH daily

Terminal Setup: Ghostty + tmux + micro

A modern terminal workflow for engineers who SSH into remote machines daily.

What We Set Up

1. Ghostty (Terminal Emulator)

Config gist

@galsapir
galsapir / config
Created March 23, 2026 09:07
Gal's Ghostty + tmux configs (2026-03-23)
# Font
font-family = "JetBrainsMono Nerd Font"
font-size = 14
font-thicken = true
# Theme
theme = Catppuccin Mocha
background-opacity = 0.95
background-blur-radius = 20

SkyPilot EC2 Cheatsheet

Connect

From terminal (after aws sso login):

ssh ec2_skypilot

From VS Code:

GitHub Auth Setup — SkyPilot Shared EC2

This machine uses a shared ec2-user account. To avoid stepping on each other, each user gets their own SSH key + worktree-scoped git config.

Prerequisites

  • SSH access to the machine (via AWS SSM)
  • A GitHub account with access to the PhenoAI org
@galsapir
galsapir / gist:278bb64bcfcfa9040788d42107d4188a
Created February 12, 2026 06:40
SDK Health Agent — Data Flow Trace
# SDK Health Agent — Data Flow Trace
## Architecture Overview
```
User (browser) → WebSocket → FastAPI → SDK Agent → Claude CLI subprocess → Bedrock
↕ ↕
Langfuse user_data/ori/*.md
```
@galsapir
galsapir / session-2026-02-11-summary.md
Created February 11, 2026 14:11
SDK Health Agent — Session Summary 2026-02-11

Session Summary — 2026-02-11

The Big Insight

The Claude Agent SDK is an entire agent runtime, not an API call.

It's not chat.completions(). It's not a single LLM call where you send a prompt and get a response. The SDK spawns a subprocess — a full Claude Code CLI process — that has its own tool loop, its own file system access, its own multi-turn reasoning. You send it a message and it goes off and does things: reads files, globs directories, thinks, retries, reads more files, and eventually comes back with an answer.

This is the point that really crystallized today. You can read the docs and think you understand it, but until you watch the agent autonomously chain 3 tool calls to find and analyze health reports — in a subprocess you don't control — it doesn't fully land. The implications: