My AI stack: tools and infrastructure for a productivity-focused setup
A detailed look at my constantly evolving AI stack, from LLM APIs and frontends to vector storage, orchestration, and Docker deployment.
Tag
13 posts
A detailed look at my constantly evolving AI stack, from LLM APIs and frontends to vector storage, orchestration, and Docker deployment.
A compact local dashboard showing time, emails, calendar, weather, and news headlines, optimized for 7-inch displays.
A personalized desktop dashboard combining Hebrew calendar data, zmanim, weather, email, and Google services for daily productivity.
A curated list of command-line AI coding tools maintained by the model vendors themselves, from Claude Code to Gemini CLI.
Planning notes and sketches for an MCP gateway architecture that aggregates servers into LAN and WAN gateways.
A curated index of 100+ voice technology tools accessible to Linux desktop users, from real-time dictation to dev frameworks.
A curated resource list of multimodal AI models with native audio support — models that process audio tokens, not just transcribe.
Comparing 8 STT models on a 27-minute podcast. Local Whisper wins on word accuracy, but cloud APIs dominate punctuation.
An MCP server for audio transcription using multimodal LLMs like Gemini, GPT-4o Audio, and Voxtral — not traditional ASR.
An MCP server that brings Gemini-powered audio transcription directly into Claude Code and Claude Desktop.
A desktop transcription app that sends audio directly to multimodal AI models for single-pass transcription and formatting.
A multi-agent system template for conducting comprehensive software and hardware technology evaluations using Claude Code.
Introducing the Claude Code Repos Index — a curated directory of 100+ repositories exploring Claude Code as a multi-purpose agent workspace.