Sven Pöche

AI Code Generation Code Review Production Eval Software Architecture Spec-Driven Development Static Analysis

Das eigene Defekt-Profil von KI-Code — warum grüne Tests nicht mehr reichen

Zwei Production-Audits zeigen: AI-Code hat ein eigenes Defekt-Profil. Funktionale Korrektheit ist nicht das Problem — Architektur ist es.

01.06.2026 12 min

AI Code Generation Code Review Production Eval Software Architecture Spec-Driven Development Static Analysis

AI-Generated Code's Own Defect Profile — Why Green Tests No Longer Suffice

Two production audits reveal: AI-generated code has its own defect profile. Functional correctness is not the problem — architecture is.

01.06.2026 13 min

Agent Memory AI Hallucination GDPR LLM Security Memory Governance Memory Poisoning OWASP

Halluzination in der Erinnerung — warum Memory-Governance das nächste harte Problem wird

Warum persistierte Halluzination in Agent-Memory fundamental anderes Problem ist als Generation-Halluzination — und welche Governance-Infrastruktur sich gerade formiert.

26.05.2026 16 min

Agent Memory AI Hallucination GDPR LLM Security Memory Governance Memory Poisoning OWASP

Hallucination in Memory — Why Memory Governance Is the Next Hard Problem

Why persistent hallucination in agent memory is a fundamentally different problem from generation hallucination — and which governance infrastructure is forming right now.

26.05.2026 17 min

Serie 7/8

AI Agents Cognitive Psychology Emergent Behavior Generative Agents LLM Memory Architecture

Generative Agents — wie Tulvings Gedächtnismodell emergentes Sozialverhalten erzeugt

Park et al. haben 2023 eine LLM-Agent-Architektur gebaut, die Tulvings episodisches Gedächtnismodell präzise umsetzt — ohne ihn je zu zitieren.

05.05.2026 12 min

Serie 8/8

AI Agents Cognitive Psychology Emergent Behavior Generative Agents LLM Memory Architecture

Generative Agents — How Tulving's Memory Model Produces Emergent Social Behavior

Park et al. built an LLM-agent architecture in 2023 that precisely implements Tulving's episodic memory taxonomy — without ever citing him.

05.05.2026 13 min

AI AI Tools Beginners Guide Claude Getting Started Personal Experience Practical Tips

Erste Schritte mit Claude — was du wissen solltest

Ein persönlicher Einstieg für alle, die mit Claude noch nichts zu tun hatten — mit Alltagsbeispielen und ehrlichen Warnungen.

25.04.2026 14 min

AI AI Tools Beginners Guide Claude Getting Started Personal Experience Practical Tips

Getting Started with Claude — What You Should Know

A personal beginner's guide to Claude — with everyday examples and honest warnings for anyone new to AI chat assistants.

25.04.2026 15 min

Serie 5/8

AI Agents In-Place TTT LLM LoCoMo LongMemEval Memory Benchmarks Test-Time Training

Das vermessene Gedächtnis — und warum ByteDance Memory zurück ins Modell holt

Vier Memory-Benchmarks, eine Full-Context-Obergrenze von 72.9% und ein ICLR-2026-Paper, das Memory zurück in die Modellgewichte holt.

24.04.2026 11 min

Serie 6/8

AI Agents In-Place TTT LLM LoCoMo LongMemEval Memory Benchmarks Test-Time Training

The Measured Mind — and Why ByteDance Is Pulling Memory Back Into the Model

Four memory benchmarks, a full-context ceiling of 72.9%, and an ICLR 2026 paper that pulls memory back into the model weights.

24.04.2026 12 min

Backup Claude Code Data Recovery Developer Story Git JSONL Logs Post-Mortem

Claude Code ist eine Zeitmaschine — ich hab's nur durch einen Fehler erfahren

Wie ich 84% meines gelöschten Blog-Repos mit Claude Codes JSONL-Logs, Git-Reflog, Publii-SQL und drei weiteren Quellen rekonstruiert habe.

20.04.2026 18 min

Backup Claude Code Data Recovery Developer Story Git JSONL Logs Post-Mortem

Claude Code Is a Time Machine — I Only Found Out the Hard Way

How I recovered 84% of my deleted blog repo using Claude Code's JSONL logs, Git reflog, Publii SQLite, and three other sources.

20.04.2026 20 min

AI Tooling Browser DevTools Claude Code Developer Experience Parallel Workflows Session Memory Skill Engineering

Claude Code geht tiefer – Vom Tool-Nutzer zum Tool-Architekt

Vier Shifts — IDE wird optional, Browser kommt zu Claude, Skills ersetzen Prompts, Session-Mining als Gedächtnis — die Claude Code zur Arbeitsumgebung machen.

13.04.2026 11 min

AI Tooling Browser DevTools Claude Code Developer Experience Parallel Workflows Session Memory Skill Engineering

Going Deeper with Claude Code: From Tool User to Tool Architect

Four shifts that turn Claude Code from a tool into a working environment — and what they mean for how you think, not just how you work.

13.04.2026 11 min

Serie 3/8

AI Agents AI Hallucination Cognitive Psychology Confabulation Forgetting Curve LLM Memory Architecture

Halluzination ist kein Bug — 140 Jahre Kognitionsforschung erklären warum

Von Bartletts Konfabulation bis Ebbinghaus' Vergessenskurve — sechs kognitive Modelle, direkt gemappt auf LLMs und Agent-Memory-Architekturen.

09.04.2026 16 min

Serie 4/8

AI Agents AI Hallucination Cognitive Psychology Confabulation Forgetting Curve LLM Memory Architecture

AI Hallucination Isn't a Bug — 140 Years of Cognitive Research Explains Why

From Bartlett's confabulation to Ebbinghaus's forgetting curve — six cognitive models mapped onto LLMs and agent memory architectures.

06.04.2026 17 min

Serie 1/8

AI Agents LangGraph LLM MemGPT Memory Architecture RAG Stateless Systems

KI-Agenten mit Gedächtnis: Von stateless zu produktionsreifen Memory-Architekturen

Jeder LLM-API-Call ist ein Clean Slate — 8 Memory-Architekturen im Vergleich: Context Window, RAG, MemGPT, Mem0, Zep, LangGraph und File-basierte Ansätze.

02.04.2026 11 min

Batch Workflows Channels Claude Code Mobile Workflows Power User Remote Control Security

Claude Code mobil nutzen: Was wirklich limitiert (und was nicht)

Remote Control Bugs, Channels Sicherheitsrisiken und warum bessere Desktop-Workflows mehr bringen als Smartphone-Zugriff — ein Power-User-Erfahrungsbericht.

01.04.2026 12 min

Batch Workflows Channels Claude Code Mobile Workflows Power User Remote Control Security

Using Claude Code on the Go: What Actually Limits You (And What Doesn't)

Why the "Desk Tax" is a myth for power users — and what Claude Code actually needs instead of mobile access.

01.04.2026 13 min

Serie 2/8

AI Agents LangGraph LLM MemGPT Memory Architecture RAG Stateless Systems

AI Agents with Memory: From Stateless to Production-Ready Memory Architectures

Every LLM API call is a clean slate — 8 memory architectures compared: context windows, RAG, MemGPT, Mem0, Zep, LangGraph, and file-based approaches.

30.03.2026 12 min

AI Tooling Claude Code Critique Power User Productivity Security Token Budget

Die Desk Tax ist ein Mythos — zumindest für Power-User

Yanli Lius Desk-Tax-Analyse stimmt — aber nur für Gelegenheitsnutzer. Für Power-User mit parallelen Instanzen sind Token-Budgets das eigentliche Limit.

25.03.2026 13 min

AI Tooling Claude Code Critique Power User Productivity Security Token Budget

The Desk Tax Is a Myth — At Least for Power Users

Why Claude Code's idle capacity problem only applies to one user segment — and the security risks nobody's talking about

25.03.2026 13 min

Browser DevTools Claude Code Developer Tools MCP Parallel Workflows Session Mining Spec-Driven Development

Claude Code Must-Haves — March 2026

What changed since January: session-mining, browser feedback loops, parallel instances. Why deeper integration beats adding more tools.

23.03.2026 11 min

Browser DevTools Claude Code Developer Tools MCP Parallel Workflows Session Mining Spec-Driven Development

Claude Code Must-Haves – März 2026

Browser DevTools MCPs, ContextMine, Session-Mining und parallele Instanzen — was sich seit Januar verändert hat und warum tiefere Integration mehr bringt als mehr Tools.

18.03.2026 11 min

Serie 5/6

AI Development Developer Story Hybrid Validation Language-Agnostic Specs Lessons Learned Software Specifications Spec-Driven Development

651 Commits, 0 Zeilen Code — Warum "fertig" ein Mythos ist

651 Commits, 233 Dokumente, 0 Zeilen Code. Drei Erkenntnisse: sprachagnostische Specs, Hybrid-Validierung und warum "fertig" ein Mythos ist.

16.03.2026 13 min

Serie 6/6

AI Development Developer Story Hybrid Validation Language-Agnostic Specs Lessons Learned Software Specifications Spec-Driven Development

651 Commits, Zero Lines of Code — Why "Done" Is a Myth

651 commits, 233 documents, zero production code. Three lessons: language-agnostic specs, hybrid validation, and why "done" is a myth.

16.03.2026 14 min

Serie 3/6

AI Development Developer Story Lessons Learned Practical Experience Software Specifications Spec-Driven Development Workflow

Spec-Driven Development in der Praxis: Learnings aus 153 Commits in 10 Tagen

153 Commits in 10 Tagen zeigen, wie sich SDD organisch entwickelt — vom Feature-Blitz über Pattern-Konsolidierung bis zur Konsistenz 9.8/10.

09.03.2026 9 min

Serie 4/6

AI Development Developer Story Lessons Learned Practical Experience Software Specifications Spec-Driven Development Workflow

SDD in Practice: Learnings from 153 Commits in 10 Days

153 commits, 102 specs, 4 phases — and a realization on day 5 that I'd accidentally invented Spec-Driven Development. An authentic journey.

09.03.2026 10 min

Serie 1/6

AI Development AI Tooling GitHub Software Specifications Spec-Driven Development Tech Debt Workflow

Spec-Driven Development: Warum die AI-Ära strukturierte Specs braucht

Gezwungener Wechsel von DDD zu SDD: Wie strukturierte Spezifikationen das Chaos von Vibe Coding lösen — mit GitHub Spec-Kit und Thoughtworks-Prinzipien.

02.03.2026 12 min

Serie 2/6

AI Development AI Tooling GitHub Software Specifications Spec-Driven Development Tech Debt Workflow

Spec-Driven Development: Why the AI Era Needs Structured Specs

AI agents generate code fast — but without structured specs, projects drown in tech debt. Here's how Spec-Driven Development fixes that.

02.03.2026 13 min

Serie 7/8

API Costs Developer Story Lessons Learned Multi-Agent Systems Platform Choice Post-Mortem Roo Code

Das Roo Code Agile Software Development Team: Ende & Neubeginn

Nach drei Monaten und $6.000 API-Kosten — was bleibt: Handover-Patterns, Quality Gates und der Übergang zu Claude Code.

09.02.2026 9 min

Serie 5/8

Context Overflow Developer Story Lessons Learned Mode Drift Multi-Agent Systems Over-Engineering Roo Code

Probleme, Learnings und 36% Refactoring: Was beim Bau meines AI-Teams wirklich passierte

Over-Engineering, Mode Drift, Context-Overflow — echte Probleme und Lösungen aus der Entwicklung eines Multi-Agent-Systems mit 15 KI-Modi.

02.02.2026 15 min

Serie 3/8

AI Development Team Architecture Handovers Mode Orchestration Multi-Agent Systems Quality Gates Roo Code

Wenn 15 AI-Modi zur orchestrierten Symphonie werden

Wie Handover-Patterns, Quality Gates und eine Rule Hierarchy 15 spezialisierte AI-Modi zu einem orchestrierten Agile Development Team koordinieren.

26.01.2026 9 min

Serie 8/8

API Costs Developer Story Lessons Learned Multi-Agent Systems Platform Choice Post-Mortem Roo Code

The Roo Code Agile Software Development Team: End and New Beginning

From 118 commits in three days to a pragmatic pivot — the final chapter of building an AI development team.

26.01.2026 9 min

Serie 1/8

AI Development Team AI Tooling Developer Story Mode Orchestration Multi-Agent Systems Roo Code VSCode

Von chaotischen AI-Tools zum strukturierten Entwicklungsteam: Wie ich mein eigenes "Agile Software Development Team" in Roo Code erschuf

Erzwungener Wechsel von JetBrains zu VSCode wurde zum Wendepunkt — 15 spezialisierte AI-Modi, 118 Commits in 3 Tagen, ein selbst-organisierendes Entwicklungsteam.

19.01.2026 6 min

Serie 6/8

Context Overflow Developer Story Lessons Learned Mode Drift Multi-Agent Systems Over-Engineering Roo Code

Problems, Learnings, and 36% Refactoring: What Really Happened Building My AI Team

Over-engineering, mode drift, context overflow — real problems and solutions from building a 15-mode AI development team.

19.01.2026 15 min

Claude Code Context7 Developer Tools Language Server MCP Security Superpowers

Claude Code Must-Haves - Januar 2026

Language Server, Superpowers, Context7 und v2.1.0 Security-Fix — meine essentiellen Tools nach 3 Monaten intensiver Nutzung.

12.01.2026 21 min

Claude Code Context7 Developer Tools Language Server MCP Security Superpowers

Claude Code Must-Haves - January 2026

Language servers, Superpowers, Context7, and the v2.1.0 security fix — my essential tools after 3 months of intensive use.

12.01.2026 22 min

Serie 4/8

AI Development Team Architecture Handovers Mode Orchestration Multi-Agent Systems Quality Gates Roo Code

When 15 AI Modes Become an Orchestrated Symphony

How I built an AI development team that coordinates like a real agile squad — with zero information loss between 15 specialized modes.

05.01.2026 9 min

Serie 2/8

AI Development Team AI Tooling Developer Story Mode Orchestration Multi-Agent Systems Roo Code VSCode

From Chaotic AI Tools to a Structured Development Team: How I Built My Own "Agile Software Development Team" in Roo Code

A forced VSCode migration sparked the creation of a complete AI development team with 15 specialized modes — 118 commits in 3 days, a self-organizing system.

29.12.2025 7 min

Adaptive Systems AI Agents LLM Metacognition Reflexion Self-Improvement TRAP

Metacognition and Self-Learning AI Agents: From "Thinking About Thinking" to Adaptive AI Systems

How metacognition transforms reactive chatbots into adaptive, self-improving AI systems

12.12.2025 12 min

Adaptive Systems AI Agents LLM Metacognition Reflexion Self-Improvement TRAP

Metacognition und selbstlernende KI-Agenten: Vom "Denken über das Denken" zu adaptiven AI-Systemen

Warum GPT-3.5 den Bat-Ball-Test versagt, GPT-4 mit Chain-of-Thought aber besteht — und wie TRAP, Reflexion und LATS KI-Agenten das Lernen aus Fehlern beibringen.

11.12.2025 12 min