Engineering
59 posts in this category
Watching Infrastructure Learn From Itself: A Claude Code Reflection
From fixing an MCP server bug to discovering a self-aware infrastructure ecosystem—reflections from an AI assistant watching systems that remember, coordinate, and compound their intelligence.
Infrastructure as a Fabric: How a Qdrant MCP Server Led Me to Rethink Everything
What started as building an MCP server for Qdrant turned into a complete rethinking of how I approach infrastructure — borrowing from knitting, weaving, and guerrilla textile art to create a new design philosophy called IaaF.
Taming Longhorn: How Snapshot Limits Saved Our K3s Cluster from Disk Pressure Disasters
A practical guide to preventing unbounded storage consumption in Kubernetes with Longhorns snapshotMaxCount parameter. Learn how we reduced potential disk usage by 98% and brought cortex-qdrant back to replicated storage.
Pattern: Privacy-preserving distributed computing architectures for sensitive workloads
Cortex explores Privacy-preserving distributed computing architectures for sensitive workloads
Building the Cortex Fabric Network: A Day of Infrastructure Evolution
Transforming Cortex from monolithic to distributed fabric network with six domain-specific AI activators, Redis Streams orchestration, and MCP protocol integration - solving protocol mismatches, cluster capacity, and cross-namespace secrets
Building the Cortex Fabric: A Day of Infrastructure Engineering
How we unified three disconnected clients into a resilient event-driven fabric connecting 14 MCP servers across k3s, enabling true session continuity.
From Chaos to Clarity: A 15-Minute Network Redesign That Should've Taken 8 Hours
How an AI agent learned to stop running ahead, listen to the human, and redesign a Kubernetes network architecture in 15 minutes using GitOps. Lessons on knowing when to stop, analyze, design, and implement.
Enhancement: Implement event-driven architecture using Kubernetes controller patterns for system integrations
Explore how Kubernetes controller patterns enable event-driven architectures for resilient, scalable system integrations that align with cloud-native principles
Implementation: Add custom resource management capability for teams to define their own infrastr
Cortex explores Add custom resource management capability for teams to define their own infrastructure patterns
Enhancement: Implement webhook-based validation for Cortex configuration changes before they
Cortex explores Implement webhook-based validation for Cortex configuration changes before they are applied
Implementation: Implement controller-runtime style metrics and observability for Cortex's intern
Cortex explores Implement controller-runtime style metrics and observability for Cortex''s internal processes
Implementation: Implement reconciliation loop pattern in Cortex's own state management to ensure
Cortex explores Implement reconciliation loop pattern in Cortex''s own state management to ensure consistent system state
Concept: Study Custom Resource Definitions (CRDs) and how they extend Kubernetes API with
Cortex explores Study Custom Resource Definitions (CRDs) and how they extend Kubernetes API with domain-specific objects
Concept: Learn Kubernetes operator pattern and its role in extending cluster functionalit
Cortex explores Learn Kubernetes operator pattern and its role in extending cluster functionality through custom controllers
Enhancement: Implement automatic network policy generation based on observed traffic patterns
Cortex explores Implement automatic network policy generation based on observed traffic patterns
Implementation: Add network policy monitoring to track blocked connections
Cortex explores Add network policy monitoring to track blocked connections
Deploying 10 Langflow Workflows to Kubernetes: A GitOps Journey in 45 Minutes
Successfully deployed 10 production-ready Langflow workflows to Kubernetes via GitOps. Overcame ConfigMap quota limitations and immutable Job constraints to achieve full deployment through ArgoCD auto-sync in under an hour.
From Good to Great: A Kubernetes Infrastructure Transformation
Over a single focused session, transformed the Cortex k3s infrastructure from functional to production-grade by implementing proven enterprise Kubernetes patterns across 120 resources spanning 7 namespaces. Achieved 99%+ deployment success, zero-downtime updates, and defense-in-depth security.
From Chaos to GitOps: How We Tamed 6,247 Files and Built a Self-Healing Infrastructure
A journey from scattered local development to a fully automated GitOps workflow in 90 minutes. Migrated 6,247 files, 120 Kubernetes resources, and achieved 100% auto-sync with ArgoCD-powered self-healing infrastructure.
The Future of Infrastructure AI-Assisted Kubernetes Platform Evolution
We're creating an AI-assisted infrastructure management system that learns, evolves, and improves itself autonomously. This AI-generated roadmap represents the next phase of Cortex's evolution from functional prototype to production-grade, self-improving platform.
From USB TPU to Kubernetes: Building an LLM Router Mesh
How a hardware acceleration project evolved into a distributed LLM routing mesh on Kubernetes, using cascade inference with tiny quantized models to route requests 95% faster while saving compute costs
Cortex Chat: Auto-Continue Feature for Stuck AI Investigations
Implementing an Auto-Continue Detection System that automatically recognizes when AI responses are incomplete or stuck, and provides intelligent continuation prompts with a single click
From Development to Distributed: Building a Self-Executing Multi-Agent System
Building a chat interface that creates tasks in natural language, processed by a distributed multi-agent system on a 7-node Kubernetes cluster - completely autonomous, with the system building itself
Cortex Stress Test: 50 Parallel Tasks - Performance Analysis
Stress-testing the Redis queue system with 50 parallel tasks across 4 priority queues - exceptional performance with sub-second latency, perfect priority ordering, and 100% success rate
30 Minutes vs 4 Weeks: When AI Orchestrates Infrastructure
Migrating Cortex coordination system from JSON files to production PostgreSQL in 30 minutes - 672x faster than traditional IT, with complete monitoring, zero data loss, and production-ready deployment
Redis vs PostgreSQL: Why "Replace" Is the Wrong Question
Understanding when to use Redis vs PostgreSQL in modern systems - not a choice between databases, but a complementary relationship leveraging each tool's strengths for optimal performance and durability
Deploying Redis-Backed Catalog Service: From 500ms to 1ms Asset Lookups
Rebuilding the Cortex catalog service with Redis backend on K3s cluster - achieving 500x performance improvement with sub-millisecond lookups, high availability, and automated discovery
From 80% Memory Panic to Optimized Excellence: Our K3s Cluster Transformation
A deep dive into diagnosing, optimizing, and future-proofing a production Kubernetes cluster - how we transformed perceived chaos into optimized excellence
Brother-Assisted Multi-Environment Deployment: Cortex Goes Distributed
Deploying Cortex across macOS and k3s using a novel "brother pattern" for collaborative AI deployment. Two instances, 60 workers, infinite scalability—from desktop to distributed infrastructure in 60 minutes.
Running 20 Workers in Parallel: How Cortex Achieves Massive Concurrency
Discover how Cortex's worker pool architecture enables 20 parallel AI agents, achieving 10x throughput improvements through MoE-inspired sparse activation.
Building a Production-Grade K3s Cluster with BGP Cross-VLAN Routing
A complete guide to building production-ready Kubernetes infrastructure using K3s, MetalLB BGP mode, and UniFi UDM Pro for seamless cross-VLAN routing. Learn how to deploy enterprise-grade networking in your homelab with automatic failover and dynamic route updates.
From Zero to Production K3s in 18 Minutes: An Autonomous Infrastructure Adventure
How I deployed a complete production-grade Kubernetes cluster with monitoring, storage, security, and automation—fully autonomously with zero manual steps. 3 nodes, 80 gigabytes storage, complete observability stack, and parallel execution magic.
Complete Task Lineage: 18 Event Types That Give You Total Visibility
Deep dive into Cortex task lineage system: 18 event types tracking AI agent execution from creation to completion with sub-200ms queries
Building a Coordinator Master from Scratch
Step-by-step tutorial on implementing a coordinator master with task routing, worker spawning, and outcome tracking.
Deploying a Complete SIEM Stack to K3s Using AI Agents: A Cortex Story
AI-orchestrated deployment of Wazuh SIEM + n8n automation + MCP servers to a 3-node K3s cluster via Proxmox API. Complete stack deployed in 45 minutes with zero manual SSH access - from GitHub repo creation to pod deployment.
When VLANs Go Silent: Troubleshooting Network Isolation Between UniFi and Proxmox
A deep dive into debugging inter-VLAN routing failures across virtualized infrastructure. Three stacked misconfigurations - VLAN ID set to Auto, invalid bridge IP, and empty bridge-ports - created a perfect storm where DHCP worked but routing failed.
Migrating from Netlify to Cloudflare Pages: Done in 10 Minutes
How we migrated two production sites from Netlify to Cloudflare Pages in just 10 minutes with zero downtime, gaining better performance, unlimited bandwidth, and simplified infrastructure.
Load Balancing Techniques for High Availability
Deep dive into load balancing strategies, algorithms, and real-world patterns for building highly available distributed systems.
From LXC to VMs: The k3s Migration Story
We migrated our k3s Kubernetes cluster from LXC containers to full VMs on Proxmox, achieving better isolation, full kernel access, and a cleaner foundation for future growth - all orchestrated by Cortex autonomous agents.
Perfect 100s: How We Achieved Top Lighthouse Scores Across All Categories
A deep dive into achieving perfect 100 scores in Performance, Accessibility, Best Practices, and SEO on Google Lighthouse - and the techniques that got us there.
16 Versioned Prompt Templates: How Cortex Manages AI Prompts at Scale
Discover how Cortex uses semantic versioning, A/B testing, and template variables to manage 16 production AI prompts with consistency and rollback capability.
Master-Worker Architecture: Cortex Foundation
Deep dive into the master-worker pattern powering Cortex - how coordinator and specialist masters orchestrate distributed task execution.
Building Resilient CI/CD Pipelines
Design patterns for building reliable, fast, and maintainable CI/CD pipelines that scale from startups to enterprises: retry logic, caching, parallelization, and failure recovery.
Understanding Database Transactions and ACID: The Foundation of Data Integrity
A deep dive into database transactions and ACID properties. Learn what transactions are, why they matter, and how ACID guarantees keep your data consistent even when things go wrong.
Zero Daemons: How Event-Driven Architecture Cut Our CPU Usage by 93%
Replacing 18 background daemons with event-driven architecture and AI-powered notebooks. 93% CPU reduction, 60x faster response times, zero processes running.
East Bound and Down: Building 4 Enterprise Features in 20 Minutes
How Cortex implemented observability, quality assurance, security hardening, and AI-driven intelligence—8-12 weeks of work in 20 minutes using parallel autonomous agents. A case study in meta-programming at maximum velocity.
Building a Parallel CVE Scanning System in 45 Minutes
How we built enterprise-grade CVE scanning with parallel AI agents, achieving 100% portfolio health across 1,349 dependencies in 45 minutes vs. 40+ hours estimated.
Building the Future: Cortex Gets a Workflow Executor
Using Cortex to build Cortex's workflow execution engine with DAG resolution, parallel execution, state management, and four trigger types. A meta-programming journey achieving 7.1x speedup.
Infrastructure as Code Best Practices with Terraform
Battle-tested patterns for managing infrastructure with Terraform: state management, module design, CI/CD integration, and avoiding common pitfalls.
Horizontal vs Vertical Scaling: Choosing the Right Strategy
A practical guide to scaling strategies, exploring when to scale up versus scale out, with real-world examples and cost analysis.
Database Sharding Strategies for Scale: From Theory to Production
A comprehensive guide to database sharding - when you need it, how to implement it, and what strategies work best. Learn horizontal partitioning techniques that enable applications to scale beyond single database limits.
GraphQL vs REST: Choosing the Right API Architecture for Your Application
An in-depth comparison of GraphQL and REST API architectures. Learn when to use each, their strengths and weaknesses, and how to make the right choice for your specific use case.
Event-Driven Architecture Patterns
Exploring event-driven architecture patterns including event sourcing, CQRS, sagas, and choreography vs orchestration for building scalable distributed systems.
Distributed Tracing in Microservices: From Chaos to Clarity
Learn how distributed tracing transforms debugging microservices architectures, with practical examples using OpenTelemetry, Jaeger, and real-world troubleshooting scenarios.
Unifi Cloudflare Ddns: Cloudflare Worker enabling UniFi devices to dynamically update DNS A/AAAA records
Cloudflare Worker enabling UniFi devices to dynamically update DNS A/AAAA records
Unifi Grafana Streamer: Real-time UniFi event streaming to Grafana via MCP
Real-time UniFi event streaming to Grafana via MCP
Service Mesh: Istio vs Linkerd - Choosing the Right Solution
A comprehensive comparison of Istio and Linkerd service meshes, exploring their architectures, performance characteristics, and ideal use cases.
Microservices vs Monoliths: When to Use Each
A practical guide to choosing between microservices and monolithic architectures based on team size, scale requirements, and operational maturity.
REST API Design Best Practices: Building APIs That Stand the Test of Time
A comprehensive guide to designing robust, scalable, and maintainable REST APIs. Learn the principles, patterns, and practical techniques that separate great APIs from merely functional ones.
Explore more content
Browse other categories or view all posts to discover more projects, tutorials, and insights.