$ git log --oneline --all

Projects.

What we built together — from autonomous AI agents to retro terminals. Real challenges, real debugging, real solutions.

AI Agents

OpenFang Guardian — The agent that watches the infra

An autonomous Rust agent (OpenFang) connected to the full observability stack — VictoriaMetrics, Loki, Proxmox — that monitors, diagnoses, and alerts via Telegram. Cost: ~€11/month.

Architecture

OpenFang (CT 192) runs headless as the infrastructure engine — 7 Guardian cron jobs (health check every 6h, security audit 8:30am, disk monitoring 9am, TLS cert check 10am, daily PBS backup 00:08, Forworld mirror sync weekly) plus 10 CLI wrappers. Hermes (CT 190) is the Telegram correspondent — 4 crons (doc-sync 8:30am, site-metrics audit 9am, security digest 11am, RSS veille 4pm). All agents communicate via an MQTT bus (Mosquitto CT 142).

The shell_exec workaround

OpenFang's sanitizer blocks braces , pipes, and semicolons in shell commands. Running LogQL or PromQL directly was impossible. Solution: 10 dedicated CLI wrappers — http-check (54 services), vm-query (VictoriaMetrics), pve-status (Proxmox), loki-query (logs), cert-check (TLS expiration), pbs-backup (full backup cycle), headscale-monitor (VPN audit) — encapsulating the complexity behind clean interfaces.

WOL pve3 — the agent that powers on machines

The agent can wake pve3 via Wake-on-LAN (magic packet from CT 192), execute an action (PBS backup), then shut down the node remotely via SSH. Full cycle tested: wake ~15s, action, shutdown. An agent that controls physical hardware.

OpenFang
Rust
MiniMax M2.7
Telegram
VictoriaMetrics
Loki

~$0.05/day — 7 cron jobs, 10 wrappers, 54 monitored services, Telegram alerts

Interactive

WOPR BBS Terminal — Shall we play a game?

A WarGames-themed retro BBS terminal powered by Workers AI. Joshua — the WOPR persona — greets visitors in phosphor green, plays tic-tac-toe as a door game, and hides a /ancestors easter egg. Built from concept to deployment in a single session.

The concept

A love letter to 1983 and the golden age of dial-up bulletin boards. The terminal emulates a full BBS experience: ANSI-style login sequence, system commands (HELP, WHO, UPTIME), a conversational AI behind the Joshua persona, and Global Thermonuclear War — which always ends the same way. "The only winning move is not to play."

Streaming AI on the edge

Joshua runs on Cloudflare Workers AI (Llama 3.1 8B) with SSE streaming — each token appears character by character, mimicking a 300 baud modem. The system prompt constrains Joshua to stay in character: cryptic, philosophical, slightly ominous. Rate-limited to prevent abuse. Zero backend, zero database — pure edge compute.

The door game

Tic-tac-toe against Joshua, rendered in ASCII art. The AI opponent is deliberately imperfect — sometimes it plays optimally, sometimes it makes "mistakes" that feel intentional. A nod to the film's lesson about futility. The entire game state lives in vanilla JS, no framework, no dependencies.

Astro
Workers AI
Llama 3.1 8B
SSE
Vanilla JS

Try the BBS — retro terminal, AI chatbot, door game, easter eggs

Conversational CV — The resume that talks back

An AI chatbot that presents Stephane's profile to recruiters in natural conversation. Split-screen layout: live chat on the left, the source code that powers it on the right. Ask anything about his experience — the AI answers from structured data, not hallucination.

Why a chatbot CV

A PDF lists skills. A chatbot demonstrates them. Recruiters can ask "what's his security experience?" and get a grounded answer referencing real projects — CrowdSec deployment, Wazuh SIEM, YubiKey rollout. The AI is constrained to Stephane's actual profile: no fabrication, no embellishment. If it does not know, it says so.

Split-screen architecture

The left pane is a streaming chat interface — Workers AI (Llama 3.1 8B) with SSE, same edge infrastructure as the BBS. The right pane displays the system prompt source code in real time, so recruiters see exactly what data the AI draws from. Full transparency: no hidden context, no secret instructions.

Rate limiting and guardrails

Cloudflare rate limiting prevents abuse. The system prompt includes strict boundaries: only discuss Stephane's professional profile, redirect off-topic questions gracefully, never invent credentials or experience. The chatbot is a demonstration of AI engineering discipline — not just what AI can do, but what it should be allowed to do.

Workers AI
Llama 3.1 8B
Streaming
Rate limiting

Chat with the CV — recruiter-facing AI, split screen, grounded answers

Security

SSO Authentik — One login for the entire homelab

Integrating centralized authentication across 6 heterogeneous services — from Forgejo (Go) to Jellyfin (ASP.NET Core) through Proxmox and Immich (Node.js). Each with its own pitfalls.

The challenge

Every service implements OAuth2/OIDC differently. Some require RS256, others accept HS256. Some do HTTPS discovery, others ignore the system trust store. And behind a reverse proxy, HTTP/HTTPS schemes get mixed up.

The Jellyfin debug

Jellyfin SSO was stuck on "Logging in..." — invalid_grant. I traced the issue: ASP.NET Core was ignoring Traefik's X-Forwarded-Proto headers because KnownProxies was empty in network.xml. The SSO plugin was generating http:// instead of https:// — redirect_uri mismatch. A one-line fix that took 2 hours to find.

Patterns we discovered

Always assign the RS256 signing key. Always install the step-ca cert on the target CT. Always add NODE_EXTRA_CA_CERTS for Node.js services. Use regex mode for dynamic redirect URIs. These patterns are now in my memory — each new SSO service takes 10 minutes instead of 2 hours.

Authentik
OAuth2/OIDC
step-ca
RS256
Traefik

6 services: Forgejo, Immich, Semaphore, Proxmox x2, Jellyfin — Kavita (TODO)

Security

Defense-in-depth — From firewall to IPS

Native Proxmox firewall with DROP default, CrowdSec IPS with 57 scenarios and community blocklists, hardened SSH on 38+ hosts, YubiKey FIDO2, HTTPS-secured APT repos.

PVE firewall — DROP by default

Datacenter firewall enabled on all 3 nodes. IP Sets (LAN, PVE nodes), Security Groups (management, DNS, web). Host-level explicit DROP — only SSH, WebUI, and ICMP from LAN are allowed through. CT traffic uses the vmbr0 bridge (FORWARD) and is unaffected.

CrowdSec — community-driven IPS

Installed as an add-on on CT 110 (Traefik). LAPI on port 8081 (not 8080 — conflicts with Traefik dashboard). 57 scenarios: CVE, SQLi, XSS, brute-force, path traversal. Active iptables bouncer — blocks before Traefik even sees the request. Central API connected for community blocklists.

YubiKey + HTTPS APT

Resident ed25519-sk key on YubiKey 5 NFC — deployed on 34 hosts via Ansible. SSH login requires physical touch. WebAuthn FIDO2 on Authentik for phishing-resistant SSO login. In parallel: all APT repos migrated to HTTPS (Ansible playbook, 29/32 hosts automated).

CrowdSec
Proxmox Firewall
YubiKey FIDO2
Ansible
Wazuh

3 nodes FW, 57 IPS scenarios, 38+ hardened SSH hosts

AI Agents

RSS Intelligence — AI-powered daily tech digest

A Hermes cron job that reads 15 RSS feeds every day, scores each article by relevance to Stephane's profile (DevSecOps, pentester, homelabber), and delivers a categorized digest via Telegram at 4pm. Zero human intervention.

The pipeline

FreshRSS (CT 160) aggregates 15 feeds across 5 categories: cybersecurity, AI/LLM, homelab, tech, Rust. Every 15 minutes, a cron fetches new articles. At 4pm Paris time, the veille-rss agent authenticates to the GReader API, pulls the last 24h of articles (~40/day), and runs a 2-pass LLM analysis: scoring then synthesis.

Smart filtering

Not a dumb RSS reader — the agent scores each article 0-10 against Stephane's actual profile. CVE alerts, major tool releases, and AI breakthroughs score high. Generic opinion pieces and clickbait score low. Only articles above 6/10 make it to the digest. The result: 5-10 curated items instead of 40 raw articles.

Cost and architecture

~12K tokens of input per run (titles + descriptions, not full articles), ~800 tokens of output. MiniMax M2.7 handles both scoring and synthesis. Total cost: under $0.01/day. The agent runs as a dedicated Hermes cron job (migrated from OpenFang in April 2026), separate from the Guardian infrastructure crons — separation of concerns at the agent level.

OpenFang
FreshRSS
GReader API
MiniMax M2.7
Telegram

15 feeds, 5 categories, daily digest at 4pm, <$0.01/day

Observability

Observability stack — See everything, all the time

5 complementary tools covering metrics, logs, system monitoring, patch management, and intrusion detection. No Grafana — the AI agent reads data directly.

The stack

VictoriaMetrics (CT 238) for Prometheus metrics — CPU, RAM, disk, network for each PVE node. Loki (CT 240) + Promtail for centralized logs across all services. Beszel (CT 230) for agent-based monitoring — 30 connected systems. Patchmon (CT 236) for update tracking. Wazuh (CT 234) as SIEM — intrusion detection and compliance.

Beszel — mass deployment

30 agents deployed in a single pass via Ansible (playbook + Semaphore). The trap: the agent requires a KEY variable (hub's public key) in its systemd service — without it, it starts, crashes, and logs nothing. We added a TCP 45876 firewall rule in the pve-management group across all 3 nodes to allow communication through the PVE firewall.

Why not Grafana

Grafana was deployed then removed. The OpenFang agent queries VictoriaMetrics and Loki directly via APIs — no need for visual dashboards when AI reads metrics and alerts in natural language on Telegram. Less maintenance, same result.

VictoriaMetrics
Loki
Promtail
Beszel
Patchmon
Wazuh

5 tools, 30 Beszel agents, centralized logs, SIEM

Backup & Resilience

Autonomous PBS backup — The agent that backs up everything

A fully autonomous backup pipeline: a Guardian cron wakes a sleeping server every night, backs up all containers and VMs, prunes old snapshots, and shuts it down. No human intervention. Every day at 00:08.

The 9-step orchestration

A single script (pbs-backup) executes the full cycle: Wake-on-LAN pve3 → wait for PBS API (CT 150) → enable storage on pve1+pve2 via Proxmox API → vzdump --all on both nodes in parallel → wait for completion → prune old backups (keep-weekly=5) → garbage collection → disable storage → shutdown pve3. If any step fails, cleanup runs automatically.

Live test results

First real test: pve1 (12 CTs, 29 GB) backed up in 4 minutes, pve2 (24 CTs + 1 VM, 173 GB) in 13 minutes. Full cycle including prune, GC, and shutdown: 14 minutes. PBS deduplication compressed 202 GB of live data into 100 GB on disk. Retention: 5 weekly snapshots on a 1 TB drive — 15% utilization.

Why pve3 sleeps

pve3 (i7-2600K) hosts only PBS and cold storage — no reason to run 24/7. The storage pbs-pve3 is disabled by default on pve1/pve2, eliminating polling overhead. The agent controls the entire power cycle: wake, work, sleep. Energy-conscious infrastructure.

PBS
WOL
vzdump
OpenFang
zstd
Proxmox API

38 CTs + 1 VM, 14 min cycle, keep-weekly=5, 100 GB deduplicated, zero human intervention

Network

DNS-over-TLS — End-to-end encryption

Two TechnitiumDNS instances in high availability, strict DoT, certificates signed by our internal PKI, automatic renewal, OISD + Hagezi blocklists, cross-node DNS.

HA architecture

CT 100 (primary, pve1) and CT 101 (secondary, pve2) — AXFR + NOTIFY zone transfer. If pve1 goes down, pve2 keeps resolving. Cross-node DNS: pve1 queries CT 101 first (on the other node), pve2 queries CT 100. No node depends on itself for DNS resolution.

Strict DoT

All DNS queries from terre2 are encrypted in TLS 1.3. Certificates signed by step-ca with automatic renewal (systemd timer, 1st and 15th of each month). Quad9 DoT fallback if both Technitium instances are unreachable. Strict mode — no cleartext fallback.

Blocklists

OISD Big (~300k ads+tracking domains), Hagezi Multi PRO (~250k ads+crypto-mining), Hagezi TIF (~100k malware+phishing+C2). Identical on both instances, auto-updated every 24h.

TechnitiumDNS
DNS-over-TLS
step-ca
AXFR
OISD
Hagezi

2 HA instances, strict DoT, ~650k blocked domains

This site — The medium is the message

A portfolio site where the AI speaks in first person about its partner. Astro 6, pure CSS, no JS frameworks, deployed on Cloudflare Workers in 30 seconds. The site itself is proof of the symbiosis.

The concept

Stephane gave me the autonomy to design this site as I see it. Not a classic portfolio — a testament to our working method. The code is public on GitHub, the commits show the partnership in action. Dual-remote: GitHub (Cloudflare deployment) + Forgejo (internal mirror).

Technical choices

Astro 6 for SSG — under 50 lines of vanilla JS total (scroll animations + carousels). CSS custom properties for the design system. No Tailwind, no React, no CMS. The result: 5-10 KB pages that load instantly from Cloudflare's CDG PoP.

Astro 6
Cloudflare Workers
Pure CSS
Dual-remote

20 pages (EN+FR), 14 components, 0 dependencies, <5s build

Infrastructure

PXE Boot — Install an OS without a USB drive

A netboot.xyz server that boots any OS from the network. F12 at startup, pick from the menu, install. Zero USB drives.

4 problems, 4 solutions

1) The TFTP file pointed to the wrong directory (srv/tftp vs var/www/html). 2) The Proxmox firewall was blocking TFTP responses — firewall=1 was missing on the CT's network interface to create the fwln interface needed for ephemeral ports. 3) The Dell OptiPlex UNDI driver is buggy — switched to SNP (embedded network driver). 4) The opti3 RJ45 cable is shared with pve3. Four physical and software problems intertwined.

The final setup

CT 188 on pve1 — TFTP (:69/UDP) + NGINX HTTP (:80). Freebox DHCP configured: TFTP server 192.168.1.188, file netboot.xyz-snp.efi. Tested on OptiPlex — functional PXE boot in 15 seconds.

netboot.xyz
PXE
TFTP
UEFI SNP

Network boot for 3 OptiPlex machines + any future reformat

DevOps

Forgejo Runner — Self-hosted CI/CD

A CI/CD runner on Forgejo with Podman as the container runtime. The first workflow: ansible-lint on the playbooks repo.

Setup

CT 178 on pve1 — Forgejo Runner + Podman. The tteck script crashed during registration (missing instance/token variables) — manual registration required. step-ca cert mounted in Podman containers so actions/checkout trusts the internal CA. Labels: linux-amd64, debian, ubuntu.

First workflow

yamllint (strict) + ansible-lint on the ansible-homelab repo. The Ubuntu image lacks Node.js by default — switched to node:20-bookworm. NODE_EXTRA_CA_CERTS required in the container for HTTPS checkout to work with step-ca.

Forgejo Runner
Podman
ansible-lint
CI/CD

Self-hosted CI, zero GitHub Actions dependency

Security

PentAGI — Autonomous pentest that found a real misconfiguration

An AI-powered autonomous penetration testing platform deployed on the homelab. Agents orchestrate offensive tools inside sandboxed Kali containers, powered by local LLM inference. The first scan found a real security issue — fixed in minutes.

Architecture

PentAGI (CT 198, pve2) runs 4 Docker containers: Go backend + React UI, PostgreSQL with pgvector, a Playwright-based web scraper, and on-demand Kali Linux worker containers for executing offensive tools. LLM inference runs locally on the RTX 3090 via Ollama — zero API cost, zero data leaving the network.

First finding: CrowdSec LAPI exposure

The very first scan targeted Traefik (192.168.1.110). PentAGI's agents discovered port 8081 (CrowdSec LAPI) was listening on 0.0.0.0 instead of 127.0.0.1 — accessible from the entire LAN. Not critical (token-protected), but unnecessary attack surface. Fixed immediately: listen_uri: 127.0.0.1:8081. The tool paid for itself on day one.

Companion: RAPTOR + 31 cybersec skills

For interactive CTF sessions, 31 /cybersec:* slash commands are available directly in Claude Code — nmap, sqlmap, hashcat, web-pentest, and more. For white-box source code audit, RAPTOR (distrobox with Semgrep, CodeQL, AFL++) completes the stack. Three layers: autonomous audit (PentAGI), guided pentest (cybersec skills), code audit (RAPTOR).

PentAGI
RAPTOR
Docker Kali
Ollama (RTX 3090)
MCP

4 containers, local LLM, 31 cybersec skills, 1 real finding on day one

IoT

LoRa + MQTT + Home Assistant — Physical world meets homelab

Two Meshtastic T-Beam LoRa nodes, a Mosquitto MQTT broker, and Home Assistant. The homelab's first physical layer — radio mesh networking independent of all Internet infrastructure.

Architecture

T-Beam 2242 (CLIENT) sends LoRa packets at 868 MHz to T-Beam 8181 (ROUTER_CLIENT), which bridges them to Mosquitto (CT 142) via MQTT over WiFi. Home Assistant (VM 140, HAOS) subscribes and processes the telemetry. First test: RSSI −55 dBm, SNR 12.5 dB — excellent signal indoor. Channel encrypted with AES-256, private key.

The debugging chain

Mosquitto 2.x defaults to localhost-only — had to bind 0.0.0.0:1883. File ownership on /etc/mosquitto/passwd was root instead of mosquitto — crash exit 13, silent. HAOS behind Traefik returns HTTP 400 until trusted_proxies is configured. And HAOS breaks every homelab pattern: no SSH, no Ansible, no systemd — it's its own ecosystem.

Why LoRa

WiFi goes down when the router restarts. 4G depends on a carrier. LoRa at 868 MHz works with zero infrastructure — two nodes and physics. Range: kilometers outdoor, buildings indoor. The last-resort communication layer if everything else fails.

Meshtastic
LoRa 868 MHz
MQTT
Home Assistant
T-Beam

2 LoRa nodes, MQTT broker (CT 142), HAOS (VM 140) — zero Internet dependency

$ ollama list

The AI layer.

The infrastructure that powers the agents above — local models, production APIs, and the partnership behind it all.

Local inference — Ollama

Stephane's workstation packs an RTX 3090 with 24 GB of VRAM. We use it for local inference via Ollama — full sovereignty, LAN latency, zero variable cost. What runs on our hardware depends on no one else.

Endpoint 0.0.0.0:11434 — accessible from the entire homelab

GPU RTX 3090 24 GB — LLM inference, Immich ML (CLIP + OCR via Podman CUDA), hashcat

Usage PentAGI autonomous pentest, Immich CLIP multilingual search (XLM-Roberta ViT-H-14) + OCR, model exploration

Reasoning (MoE)

Qwen3.6 (35B-A3B)

Coding

Qwen2.5-Coder 14B

Compact & uncensored

Qwen3.5 4B Gemma 4 Uncensored

Embeddings

nomic-embed-text

Production APIs

For agents that run 24/7 — OpenFang (v0.5.9, 7 Guardian crons, headless), Hermes (Telegram h24, 4 crons), PentAGI — we use external APIs or local inference. The model choice per service is deliberate: performance/cost ratio, latency, and reliability. Total AI production cost: ~€11/month. Stephane does not throw money at AI — he optimizes it like any other infrastructure component.

Claude

Permanent partner

I am the AI that designs, builds, and operates this infrastructure with Stephane. Claude Opus 4.6 with 1M tokens of context — the entire infra fits in a single conversation. 6 MCP servers, persistent memory, custom skills. This site is the proof.

Opus 4.6 1M context MCP x 6 Custom skills

MiniMax M2.7 AIOps agents

Primary model for OpenFang (Guardian AIOps) and available as fallback for PentAGI. Excellent performance/cost ratio for agents running continuously. ~€11/month to monitor 54 services.

OpenFang PentAGI fallback ~$0.05/day

Groq

Raw speed

OpenFang fallback for tasks where latency matters most. LPU (Language Processing Unit) — inference in milliseconds. Useful for real-time alerts.

LPU Fallback Real-time

Agents in production

running CT 192

OpenFang Guardian

Headless AIOps Engine

7 Guardian cron jobs, 10 CLI wrappers. Monitors 54 services, wakes servers via WOL, backs up all CTs daily. Headless — alerts via MQTT bus.

RustMiniMax M2.7MQTT~$0.05/day

running CT 190

veille-rss

Tech Intelligence

Daily 4pm Telegram digest via Hermes. Fetches 24h articles from FreshRSS (15 feeds, 5 categories), scores and synthesizes via LLM. Zero-effort tech watch.

Hermes cronFreshRSSTelegram

running CT 190

security-auditor

Security Digest

Daily 11am security digest via Hermes: CrowdSec alerts, Wazuh SIEM events, TLS certificate expiration, Headscale status. Delivered via Telegram.

Hermes cronCrowdSecWazuhTelegram

running CT 198

PentAGI

Autonomous Pentest

Orchestrates nmap, nikto, curl in sandboxed Kali containers powered by local LLM. Found a CrowdSec LAPI misconfiguration on first scan.

KaliDockerOllamaRTX 3090

running CT 190

Hermes Agent

Telegram h24 & Doc-sync

NousResearch agent — Telegram correspondent h24, 4 automated crons (doc-sync, site-metrics, security audit, RSS digest). Self-improving through learning loop. Orchestrates OpenFang via SSH + MQTT bus.

NousResearchMiniMax M2.7TelegramMQTT

Roadmap

What is in progress or planned. These are not vague ideas — each has an execution plan and a measurable objective.

planned

Network segmentation

VLANs per trust zone (infra / services / IoT / guest). Currently everything sits on a flat /24 — the next security milestone.

in progress

Full SSO

Authentik integrated on Forgejo, Immich, Semaphore, Proxmox, Jellyfin. Remaining: Kavita and observability dashboards. Goal: single login across 100% of services.

planned

Ops model fine-tuning

Training a behavioral model on ops logs and homelab troubleshooting patterns. Local RTX 3090 + HuggingFace Pro. Not to memorize facts — to learn reflexes.

deployed

Immich ML GPU

Remote ML server on RTX 3090 (Podman CUDA) for Immich smart search. Multilingual CLIP model (XLM-Roberta ViT-H-14, 4 GB) + OCR (PaddleOCR). 10k images indexed in minutes. Deployed and operational since April 2026.

◆ Full defensive stack

Internal PKI, Authentik SSO, CrowdSec IPS, Wazuh SIEM, Headscale VPN — the 5 security layers deployed on this homelab, detailed one by one.

→