llmstxt_architect automates generating and updating llms.txt files that communicate website content to LLMs. Supports multi-provider LLMs and preserves file structure during updates.
macai is a native macOS AI chat client unifying access to major LLM providers with iCloud Sync and local inference support, offering a minimalist cross-device AI chat experience.
MarkPDFDown is a Python CLI tool that converts PDFs and images into Markdown by using vision-capable large language models for visual recognition-based parsing, handling complex layouts and formulas.
qxresearch-event-1 is a collection of 50+ minimalist Python apps showcasing core AI patterns like fine-tuning, vector DB, and Whisper in about 10 lines each. A practical learning resource.
A curated and frequently updated bibliography accompanying the IJCAI 2024 survey paper on LLM-based multi-agent systems, organizing research into five key categories and revealing emerging trends.
The LLM Engineer Handbook catalogs the full lifecycle of large language model engineering, from pretraining to prompt management, guiding engineers beyond demos to production-ready LLM apps.
NomAI combines a Flutter app with a FastAPI backend using a multi-step LLM pipeline and web-grounded reasoning for nutrition analysis and meal tracking.
npcpy offers a unique NPC Context-Agent-Tool data layer to enforce AI compliance via software architecture, supporting multimodal LLM apps and multi-agent systems with local and cloud providers.
obsidian-llm-wiki-local generates interlinked Obsidian markdown wikis using local LLMs. Its standout feature is a rejection feedback loop that refines article quality via user input.
Open Computer Use uses a modular three-stage LLM pipeline to control a cloud Linux desktop, combining grounding, vision, and action models for flexible AI-driven automation.
OpenAgents hosts three specialized LLM agents—Data, Plugins, Web—via a Flask API and Next.js UI, integrating sandboxed code execution, plugin selection, and browser automation.
OpenAnt uses a two-stage LLM pipeline to detect and validate code vulnerabilities across multiple languages, reducing false positives by verifying exploits automatically.
OpenChronicle captures macOS accessibility events to build structured local memory for LLM agents. Its async pipeline produces persistent Markdown memory and an SQLite index.
OptiLLM is an OpenAI-compatible inference proxy that boosts LLM reasoning with 20+ techniques like Mixture of Agents and MCTS, requiring no model retraining. Use a simple prefix to improve accuracy 2-10x.
Paper2Any uses chained LLM calls with structured output to convert academic papers into editable scientific figures, slides, and diagrams via a FastAPI backend and React frontend.
ReasoningBank introduces memory-aware test-time scaling for AI agents by storing reasoning traces from both successes and failures, enabling self-evolution through experience.
SuperClaude transforms Claude Code into a structured AI development platform using behavioral instruction injection, 30 slash commands, 20 specialized agents, and 8 MCP server integrations for faster, token-efficient workflows.
SupoClip is an open-source self-hosted AI video clipper using AssemblyAI transcription and multiple LLM backends including local Ollama. It runs on Docker Compose with FastAPI and Next.js.
Swark is a VS Code extension that creates Mermaid.js architecture diagrams from any code using GitHub Copilot’s free tier via the VS Code Language Model API—no API keys needed.
vLLM Compressor applies advanced quantization and compression techniques to large language models, enabling optimized inference without requiring full model definitions.