LocalAI enables running 36+ AI models locally without GPU, supporting multi-user API access and built-in AI agents with OpenAI-compatible APIs. Here’s how it works and why it matters.
mem0 enhances AI agent memory with a new single-pass ADD-only extraction algorithm and multi-signal retrieval, boosting benchmarks significantly while simplifying memory management.
PhotoPrism offers AI-driven photo management with facial recognition and tagging in a self-hosted, privacy-focused platform. Explore its architecture, tech, and tradeoffs.
Symfony AI unifies multiple AI platforms into a single PHP interface, enabling flexible AI-powered Symfony apps without vendor lock-in. It includes AI agents, chat context, and data indexing components.
TrendRadar is a self-hosted AI-driven tool for multi-platform trend monitoring, using MCP architecture for advanced language analysis and smart push notifications across popular messaging platforms.
Part 2 of 4: a benchmark journal across nixpkgs llama.cpp, upstream master, and ik_llama.cpp on Qwen3.6-27B. Six hours, four backends, all converging at 66 tok/s — and the physical reason why.
Part 3 of 4: a deep-dive into why speculative decoding silently breaks (or runs anti-economically) on hybrid attention+SSM architectures like Qwen3.6, Mamba-2, and RWKV — and what would need to change upstream to fix it.
Part 4 of 4: the actual NixOS module, llama-pull helper, claude-code-router wiring, and one-line workflow for switching models. Five Nix files for a complete, isolated, rollback-able local LLM service.
Part 1 of 4: motivation, hardware, and stack choices for serving Qwen3.6-27B locally on a 32 GB consumer GPU with NixOS, before any benchmarks or trade-offs kick in.
CC Switch is a Rust-based cross-platform desktop app that centralizes management of AI coding CLIs like Claude Code and Codex. It streamlines switching providers, prompt templating, and session persistence.
Kong Gateway extends traditional API management with universal LLM API routing, semantic security, and AI-specific features, enabling multi-vendor AI traffic governance in cloud-native environments.
OpenHands offers a modular Python platform to build and deploy AI agents with SDK, CLI, GUI, and cloud options. It supports multiple LLMs and self-hosting for enterprises.
Hugging Face Transformers offers a unified Python API to access over 1 million pretrained AI models for text, vision, and audio, simplifying complex pipelines with its Pipeline API.
The openai/skills repo offers a catalog of modular ‘Agent Skills’ for OpenAI Codex agents, enabling reusable AI functionalities with a standardized installation system.
Cloudflare Agents offers a TypeScript framework for stateful AI agents on Durable Objects with real-time communication, AI integration, and sandboxed code execution.
OpenAI Codex CLI brings AI coding assistance local to your terminal, integrating with ChatGPT plans for powerful hybrid AI workflows. Here’s how it works and who it’s for.