<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Llm on Noureddine RAMDI</title><link>https://ramdi.fr/tags/llm/</link><description>Recent content in Llm on Noureddine RAMDI</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 23 May 2026 20:41:27 +0000</lastBuildDate><atom:link href="https://ramdi.fr/tags/llm/index.xml" rel="self" type="application/rss+xml"/><item><title>ai-interview-codex: iterative AI system design and interview prep with real-world benchmarks</title><link>https://ramdi.fr/github-stars/ai-interview-codex-iterative-ai-system-design-and-interview-prep-with-real-world-benchmarks/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/ai-interview-codex-iterative-ai-system-design-and-interview-prep-with-real-world-benchmarks/</guid><description>ai-interview-codex offers a practical AI interview prep guide featuring iterative system design for Agentic AI and RAG, with benchmarks and production insights for ML, LLM, and system design roles.</description></item><item><title>Arkon: Structured enterprise knowledge synthesis with a unique LLM compilation pipeline</title><link>https://ramdi.fr/github-stars/arkon-structured-enterprise-knowledge-synthesis-with-a-unique-llm-compilation-pipeline/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/arkon-structured-enterprise-knowledge-synthesis-with-a-unique-llm-compilation-pipeline/</guid><description>Arkon is a self-hosted enterprise knowledge hub using a novel MRP pipeline for structured, traceable wiki compilation with external AI inference and workspace-scoped RBAC.</description></item><item><title>AutoSkill: Experience-driven lifelong learning for LLM agents with skill versioning and evolution</title><link>https://ramdi.fr/github-stars/autoskill-experience-driven-lifelong-learning-for-llm-agents-with-skill-versioning-and-evolution/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/autoskill-experience-driven-lifelong-learning-for-llm-agents-with-skill-versioning-and-evolution/</guid><description>AutoSkill is a Python framework enabling LLM agents to extract, version, and evolve skills from dialogues, providing a persistent long-term memory system for AI agents.</description></item><item><title>ChatTutor: enabling AI tutors to teach STEM visually with a Vue + Bun full-stack</title><link>https://ramdi.fr/github-stars/chattutor-enabling-ai-tutors-to-teach-stem-visually-with-a-vue-bun-full-stack/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/chattutor-enabling-ai-tutors-to-teach-stem-visually-with-a-vue-bun-full-stack/</guid><description>ChatTutor integrates AI tutors with visual tools like Geogebra in a Vue + Bun full-stack. It supports multiple LLM providers and offers a digital whiteboard for interactive STEM learning.</description></item><item><title>Claw-Eval: a rigorous Python harness for trustworthy evaluation of LLM-powered autonomous agents</title><link>https://ramdi.fr/github-stars/claw-eval-a-rigorous-python-harness-for-trustworthy-evaluation-of-llm-powered-autonomous-agents/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/claw-eval-a-rigorous-python-harness-for-trustworthy-evaluation-of-llm-powered-autonomous-agents/</guid><description>Claw-Eval offers a Python-based evaluation harness for LLM autonomous agents, featuring 300 tasks and a strict Pass^3 metric to ensure reliable, multi-dimensional benchmarking.</description></item><item><title>Comic Translate: AI-driven multi-language comic translation with full-page context</title><link>https://ramdi.fr/github-stars/comic-translate-ai-driven-multi-language-comic-translation-with-full-page-context/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/comic-translate-ai-driven-multi-language-comic-translation-with-full-page-context/</guid><description>Comic Translate uses advanced AI models and a multi-step pipeline for accurate comic translation across languages, combining speech bubble detection, OCR, and LLMs with full-page context.</description></item><item><title>DeepTeam: A Python framework for adversarial red teaming of large language models</title><link>https://ramdi.fr/github-stars/deepteam-a-python-framework-for-adversarial-red-teaming-of-large-language-models/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/deepteam-a-python-framework-for-adversarial-red-teaming-of-large-language-models/</guid><description>DeepTeam is a Python tool for red teaming LLMs by dynamically generating adversarial attacks and evaluating vulnerabilities like bias. It requires minimal setup and no predefined datasets.</description></item><item><title>DeepZero: Automating Windows Kernel Driver Vulnerability Research with YAML-Driven LLM Pipelines</title><link>https://ramdi.fr/github-stars/deepzero-automating-windows-kernel-driver-vulnerability-research-with-yaml-driven-llm-pipelines/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/deepzero-automating-windows-kernel-driver-vulnerability-research-with-yaml-driven-llm-pipelines/</guid><description>DeepZero automates vulnerability research on Windows kernel drivers by chaining Ghidra decompilation with LLM-based analysis using YAML pipelines and Jinja2 templates.</description></item><item><title>Dot: an offline Electron desktop app for local LLM inference and document QA</title><link>https://ramdi.fr/github-stars/dot-an-offline-electron-desktop-app-for-local-llm-inference-and-document-qa/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/dot-an-offline-electron-desktop-app-for-local-llm-inference-and-document-qa/</guid><description>Dot bundles local LLM inference, Retrieval Augmented Generation, and Text-To-Speech into a single offline Electron app, enabling document QA without cloud dependencies.</description></item><item><title>FuzzyAI: AI-Driven Fuzz Testing with Local LLM Integration</title><link>https://ramdi.fr/github-stars/fuzzyai-ai-driven-fuzz-testing-with-local-llm-integration/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/fuzzyai-ai-driven-fuzz-testing-with-local-llm-integration/</guid><description>FuzzyAI combines fuzz testing with AI models using Python and Ollama. It offers a CLI for fuzzing with local LLMs, balancing AI power and practical setup tradeoffs.</description></item><item><title>Harvey LAB: Benchmarking legal LLM agents with realistic tasks and automated scoring</title><link>https://ramdi.fr/github-stars/harvey-lab-benchmarking-legal-llm-agents-with-realistic-tasks-and-automated-scoring/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/harvey-lab-benchmarking-legal-llm-agents-with-realistic-tasks-and-automated-scoring/</guid><description>Harvey LAB offers an open-source benchmark for evaluating LLM agents on realistic legal tasks using an all-pass rubric and LLM-as-judge scoring. It includes datasets, adapters, and dashboards.</description></item><item><title>Inside Mini-SGLang: A clear and modular Python LLM inference engine</title><link>https://ramdi.fr/github-stars/inside-mini-sglang-a-clear-and-modular-python-llm-inference-engine/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-mini-sglang-a-clear-and-modular-python-llm-inference-engine/</guid><description>Mini-SGLang is a modular Python reimplementation of the SGLang LLM inference engine with production features like Radix Cache, chunked prefill, overlap scheduling, and tensor parallelism.</description></item><item><title>Inside picoagents: a transparent multi-agent system framework built from scratch in Python</title><link>https://ramdi.fr/github-stars/inside-picoagents-a-transparent-multi-agent-system-framework-built-from-scratch-in-python/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-picoagents-a-transparent-multi-agent-system-framework-built-from-scratch-in-python/</guid><description>PicoAgents is a Python multi-agent framework built from scratch, offering transparent agent orchestration, LLM provider abstraction, streaming UI, and production-ready benchmarks.</description></item><item><title>Inside Xalgorix: an LLM-driven autonomous pentesting platform with a 22-phase testing pipeline</title><link>https://ramdi.fr/github-stars/inside-xalgorix-an-llm-driven-autonomous-pentesting-platform-with-a-22-phase-testing-pipeline/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-xalgorix-an-llm-driven-autonomous-pentesting-platform-with-a-22-phase-testing-pipeline/</guid><description>Xalgorix is a Go-based autonomous pentesting platform driven by LLMs, featuring a 22-phase methodology from recon to exploit verification, with live telemetry and reporting.</description></item><item><title>IntellAgent: systematic adversarial testing for conversational AI with policy graph decomposition</title><link>https://ramdi.fr/github-stars/intellagent-systematic-adversarial-testing-for-conversational-ai-with-policy-graph-decomposition/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/intellagent-systematic-adversarial-testing-for-conversational-ai-with-policy-graph-decomposition/</guid><description>IntellAgent is a Python framework that stress-tests conversational AI agents by generating structured adversarial dialogues via policy graph decomposition, helping uncover blind spots before production.</description></item><item><title>Kimi-Audio: a unified hybrid-token audio foundation model with LLM core</title><link>https://ramdi.fr/github-stars/kimi-audio-a-unified-hybrid-token-audio-foundation-model-with-llm-core/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/kimi-audio-a-unified-hybrid-token-audio-foundation-model-with-llm-core/</guid><description>Kimi-Audio combines continuous acoustic and discrete semantic tokens within a 7B LLM for unified audio-text understanding and generation. It achieves state-of-the-art ASR with low-latency audio synthesis.</description></item><item><title>LiveCaptions Translator: Real-time speech translation using Windows 11's built-in captions and LLM APIs</title><link>https://ramdi.fr/github-stars/livecaptions-translator-real-time-speech-translation-using-windows-11-s-built-in-captions-and-llm-apis/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/livecaptions-translator-real-time-speech-translation-using-windows-11-s-built-in-captions-and-llm-apis/</guid><description>LiveCaptions Translator taps Windows 11&amp;rsquo;s on-device LiveCaptions for real-time speech translation via multiple LLM and traditional APIs, all in a sleek C# desktop app.</description></item><item><title>LiveTradeBench: Evaluating LLM-driven trading agents in live markets</title><link>https://ramdi.fr/github-stars/livetradebench-evaluating-llm-driven-trading-agents-in-live-markets/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/livetradebench-evaluating-llm-driven-trading-agents-in-live-markets/</guid><description>LiveTradeBench benchmarks LLM trading agents like GPT and Claude in live US equity and prediction markets with real-time news and sentiment integration.</description></item><item><title>LLM-MM-Agent: autonomous mathematical modeling with hierarchical method selection</title><link>https://ramdi.fr/github-stars/llm-mm-agent-autonomous-mathematical-modeling-with-hierarchical-method-selection/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/llm-mm-agent-autonomous-mathematical-modeling-with-hierarchical-method-selection/</guid><description>LLM-MM-Agent uses LLMs as autonomous agents for end-to-end mathematical modeling, featuring a unique hierarchical method library with actor-critic selection. Supports GPT-4o and DeepSeek-R1.</description></item><item><title>LLM4Pentest: A curated knowledge hub on large language models for automated penetration testing</title><link>https://ramdi.fr/github-stars/llm4pentest-a-curated-knowledge-hub-on-large-language-models-for-automated-penetration-testing/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/llm4pentest-a-curated-knowledge-hub-on-large-language-models-for-automated-penetration-testing/</guid><description>LLM4Pentest aggregates 40+ research papers and tools tracking the evolving role of LLMs in automated penetration testing, highlighting progress and limitations.</description></item><item><title>llmstxt_architect: automated generation and maintenance of llms.txt files for LLM-aware websites</title><link>https://ramdi.fr/github-stars/llmstxt-architect-automated-generation-and-maintenance-of-llms-txt-files-for-llm-aware-websites/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/llmstxt-architect-automated-generation-and-maintenance-of-llms-txt-files-for-llm-aware-websites/</guid><description>llmstxt_architect automates generating and updating llms.txt files that communicate website content to LLMs. Supports multi-provider LLMs and preserves file structure during updates.</description></item><item><title>macai: a unified native macOS AI chat client for multiple LLM providers</title><link>https://ramdi.fr/github-stars/macai-a-unified-native-macos-ai-chat-client-for-multiple-llm-providers/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/macai-a-unified-native-macos-ai-chat-client-for-multiple-llm-providers/</guid><description>macai is a native macOS AI chat client unifying access to major LLM providers with iCloud Sync and local inference support, offering a minimalist cross-device AI chat experience.</description></item><item><title>MarkPDFDown: converting PDFs to Markdown using vision-capable large language models</title><link>https://ramdi.fr/github-stars/markpdfdown-converting-pdfs-to-markdown-using-vision-capable-large-language-models/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/markpdfdown-converting-pdfs-to-markdown-using-vision-capable-large-language-models/</guid><description>MarkPDFDown is a Python CLI tool that converts PDFs and images into Markdown by using vision-capable large language models for visual recognition-based parsing, handling complex layouts and formulas.</description></item><item><title>Minimalist Python AI demos: exploring qxresearch-event-1's concise LLM patterns</title><link>https://ramdi.fr/github-stars/minimalist-python-ai-demos-exploring-qxresearch-event-1-s-concise-llm-patterns/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/minimalist-python-ai-demos-exploring-qxresearch-event-1-s-concise-llm-patterns/</guid><description>qxresearch-event-1 is a collection of 50+ minimalist Python apps showcasing core AI patterns like fine-tuning, vector DB, and Whisper in about 10 lines each. A practical learning resource.</description></item><item><title>Navigating the evolving landscape of LLM-based multi-agent systems: A survey paper repository</title><link>https://ramdi.fr/github-stars/navigating-the-evolving-landscape-of-llm-based-multi-agent-systems-a-survey-paper-repository/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/navigating-the-evolving-landscape-of-llm-based-multi-agent-systems-a-survey-paper-repository/</guid><description>A curated and frequently updated bibliography accompanying the IJCAI 2024 survey paper on LLM-based multi-agent systems, organizing research into five key categories and revealing emerging trends.</description></item><item><title>Navigating the LLM engineer handbook: a curated map for production-grade language models</title><link>https://ramdi.fr/github-stars/navigating-the-llm-engineer-handbook-a-curated-map-for-production-grade-language-models/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/navigating-the-llm-engineer-handbook-a-curated-map-for-production-grade-language-models/</guid><description>The LLM Engineer Handbook catalogs the full lifecycle of large language model engineering, from pretraining to prompt management, guiding engineers beyond demos to production-ready LLM apps.</description></item><item><title>NomAI: a multi-step AI nutrition analysis app combining Flutter and FastAPI</title><link>https://ramdi.fr/github-stars/nomai-a-multi-step-ai-nutrition-analysis-app-combining-flutter-and-fastapi/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/nomai-a-multi-step-ai-nutrition-analysis-app-combining-flutter-and-fastapi/</guid><description>NomAI combines a Flutter app with a FastAPI backend using a multi-step LLM pipeline and web-grounded reasoning for nutrition analysis and meal tracking.</description></item><item><title>npcpy: enforcing AI behavioral compliance through architecture for multimodal LLM apps</title><link>https://ramdi.fr/github-stars/npcpy-enforcing-ai-behavioral-compliance-through-architecture-for-multimodal-llm-apps/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/npcpy-enforcing-ai-behavioral-compliance-through-architecture-for-multimodal-llm-apps/</guid><description>npcpy offers a unique NPC Context-Agent-Tool data layer to enforce AI compliance via software architecture, supporting multimodal LLM apps and multi-agent systems with local and cloud providers.</description></item><item><title>obsidian-llm-wiki-local: local-first AI-powered wiki generation with human-in-the-loop feedback</title><link>https://ramdi.fr/github-stars/obsidian-llm-wiki-local-local-first-ai-powered-wiki-generation-with-human-in-the-loop-feedback/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/obsidian-llm-wiki-local-local-first-ai-powered-wiki-generation-with-human-in-the-loop-feedback/</guid><description>obsidian-llm-wiki-local generates interlinked Obsidian markdown wikis using local LLMs. Its standout feature is a rejection feedback loop that refines article quality via user input.</description></item><item><title>Open Computer Use: orchestrating multi-model LLM pipelines for remote Linux desktop control</title><link>https://ramdi.fr/github-stars/open-computer-use-orchestrating-multi-model-llm-pipelines-for-remote-linux-desktop-control/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/open-computer-use-orchestrating-multi-model-llm-pipelines-for-remote-linux-desktop-control/</guid><description>Open Computer Use uses a modular three-stage LLM pipeline to control a cloud Linux desktop, combining grounding, vision, and action models for flexible AI-driven automation.</description></item><item><title>OpenAgents: orchestrating multi-agent LLM workflows with Flask and Next.js</title><link>https://ramdi.fr/github-stars/openagents-orchestrating-multi-agent-llm-workflows-with-flask-and-next-js/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/openagents-orchestrating-multi-agent-llm-workflows-with-flask-and-next-js/</guid><description>OpenAgents hosts three specialized LLM agents—Data, Plugins, Web—via a Flask API and Next.js UI, integrating sandboxed code execution, plugin selection, and browser automation.</description></item><item><title>OpenAnt: An LLM-powered two-stage vulnerability discovery tool with exploit validation</title><link>https://ramdi.fr/github-stars/openant-an-llm-powered-two-stage-vulnerability-discovery-tool-with-exploit-validation/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/openant-an-llm-powered-two-stage-vulnerability-discovery-tool-with-exploit-validation/</guid><description>OpenAnt uses a two-stage LLM pipeline to detect and validate code vulnerabilities across multiple languages, reducing false positives by verifying exploits automatically.</description></item><item><title>OpenChronicle: an AX-first local memory layer for LLM agents</title><link>https://ramdi.fr/github-stars/openchronicle-an-ax-first-local-memory-layer-for-llm-agents/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/openchronicle-an-ax-first-local-memory-layer-for-llm-agents/</guid><description>OpenChronicle captures macOS accessibility events to build structured local memory for LLM agents. Its async pipeline produces persistent Markdown memory and an SQLite index.</description></item><item><title>OptiLLM: transparent inference-time scaling for improved LLM reasoning</title><link>https://ramdi.fr/github-stars/optillm-transparent-inference-time-scaling-for-improved-llm-reasoning/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/optillm-transparent-inference-time-scaling-for-improved-llm-reasoning/</guid><description>OptiLLM is an OpenAI-compatible inference proxy that boosts LLM reasoning with 20+ techniques like Mixture of Agents and MCTS, requiring no model retraining. Use a simple prefix to improve accuracy 2-10x.</description></item><item><title>Paper2Any: multi-modal AI pipeline converting academic papers into editable scientific artifacts</title><link>https://ramdi.fr/github-stars/paper2any-multi-modal-ai-pipeline-converting-academic-papers-into-editable-scientific-artifacts/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/paper2any-multi-modal-ai-pipeline-converting-academic-papers-into-editable-scientific-artifacts/</guid><description>Paper2Any uses chained LLM calls with structured output to convert academic papers into editable scientific figures, slides, and diagrams via a FastAPI backend and React frontend.</description></item><item><title>ReasoningBank: Experience-Driven Memory as a New Scaling Dimension for AI Agents</title><link>https://ramdi.fr/github-stars/reasoningbank-experience-driven-memory-as-a-new-scaling-dimension-for-ai-agents/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/reasoningbank-experience-driven-memory-as-a-new-scaling-dimension-for-ai-agents/</guid><description>ReasoningBank introduces memory-aware test-time scaling for AI agents by storing reasoning traces from both successes and failures, enabling self-evolution through experience.</description></item><item><title>SuperClaude: Meta-programming Claude Code into a structured AI development platform</title><link>https://ramdi.fr/github-stars/superclaude-meta-programming-claude-code-into-a-structured-ai-development-platform/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/superclaude-meta-programming-claude-code-into-a-structured-ai-development-platform/</guid><description>SuperClaude transforms Claude Code into a structured AI development platform using behavioral instruction injection, 30 slash commands, 20 specialized agents, and 8 MCP server integrations for faster, token-efficient workflows.</description></item><item><title>SupoClip: self-hostable AI-powered video clipping with multi-LLM backend abstraction</title><link>https://ramdi.fr/github-stars/supoclip-self-hostable-ai-powered-video-clipping-with-multi-llm-backend-abstraction/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/supoclip-self-hostable-ai-powered-video-clipping-with-multi-llm-backend-abstraction/</guid><description>SupoClip is an open-source self-hosted AI video clipper using AssemblyAI transcription and multiple LLM backends including local Ollama. It runs on Docker Compose with FastAPI and Next.js.</description></item><item><title>Swark: generating architecture diagrams from code using GitHub Copilot in VS Code</title><link>https://ramdi.fr/github-stars/swark-generating-architecture-diagrams-from-code-using-github-copilot-in-vs-code/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/swark-generating-architecture-diagrams-from-code-using-github-copilot-in-vs-code/</guid><description>Swark is a VS Code extension that creates Mermaid.js architecture diagrams from any code using GitHub Copilot&amp;rsquo;s free tier via the VS Code Language Model API—no API keys needed.</description></item><item><title>vLLM Compressor: Practical quantization and compression for large language model inference</title><link>https://ramdi.fr/github-stars/vllm-compressor-practical-quantization-and-compression-for-large-language-model-inference/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/vllm-compressor-practical-quantization-and-compression-for-large-language-model-inference/</guid><description>vLLM Compressor applies advanced quantization and compression techniques to large language models, enabling optimized inference without requiring full model definitions.</description></item><item><title>Weave: a Go microkernel platform for hot-pluggable AI application development</title><link>https://ramdi.fr/github-stars/weave-a-go-microkernel-platform-for-hot-pluggable-ai-application-development/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/weave-a-go-microkernel-platform-for-hot-pluggable-ai-application-development/</guid><description>Weave is a Go-based AI platform with a microkernel architecture that supports hot-pluggable AI plugins and dynamic multi-model switching. Deploy with Docker Compose for rapid development.</description></item><item><title>yoagent: a minimal Rust AI agent runtime with multi-provider LLM support</title><link>https://ramdi.fr/github-stars/yoagent-a-minimal-rust-ai-agent-runtime-with-multi-provider-llm-support/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/yoagent-a-minimal-rust-ai-agent-runtime-with-multi-provider-llm-support/</guid><description>yoagent is a minimal Rust library implementing an AI agent loop with multi-provider LLM support, built-in tools, and event streaming for clean, extensible agent workflows.</description></item><item><title>Aider: precise AI pair programming with whole-codebase awareness</title><link>https://ramdi.fr/github-stars/aider-precise-ai-pair-programming-with-whole-codebase-awareness/</link><pubDate>Sat, 09 May 2026 11:42:26 +0000</pubDate><guid>https://ramdi.fr/github-stars/aider-precise-ai-pair-programming-with-whole-codebase-awareness/</guid><description>Aider is a terminal-based AI pair programming tool that builds a repository map for full codebase context, enabling precise, developer-controlled edits with multi-LLM support and git integration.</description></item><item><title>Inside Claude Code From Scratch: A practical reconstruction of Anthropic's coding agent</title><link>https://ramdi.fr/github-stars/inside-claude-code-from-scratch-a-practical-reconstruction-of-anthropic-s-coding-agent/</link><pubDate>Sat, 09 May 2026 11:42:26 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-claude-code-from-scratch-a-practical-reconstruction-of-anthropic-s-coding-agent/</guid><description>Claude Code From Scratch distills Anthropic&amp;rsquo;s 500K+ line coding agent into ~8,000 lines of Python and TypeScript, revealing core architecture like the Agent Loop, semantic memory, multi-agent skills, and context compression.</description></item><item><title>OpenAlpha_Evolve: autonomous code evolution with LLM-driven diff mutations</title><link>https://ramdi.fr/github-stars/openalpha-evolve-autonomous-code-evolution-with-llm-driven-diff-mutations/</link><pubDate>Sat, 09 May 2026 11:42:26 +0000</pubDate><guid>https://ramdi.fr/github-stars/openalpha-evolve-autonomous-code-evolution-with-llm-driven-diff-mutations/</guid><description>OpenAlpha_Evolve uses large language models to generate precise code diffs as mutations in an evolutionary algorithm, enabling autonomous iterative code improvement with sandboxed evaluation.</description></item><item><title>RAGFlow: a modular, agentic retrieval-augmented generation engine with deep document understanding</title><link>https://ramdi.fr/github-stars/ragflow-a-modular-agentic-retrieval-augmented-generation-engine-with-deep-document-understanding/</link><pubDate>Wed, 06 May 2026 18:58:37 +0000</pubDate><guid>https://ramdi.fr/github-stars/ragflow-a-modular-agentic-retrieval-augmented-generation-engine-with-deep-document-understanding/</guid><description>RAGFlow is an open-source Python RAG engine combining deep document parsing, configurable pipelines, agentic workflows, and sandboxed code execution for LLM context management.</description></item><item><title>rtk: A Rust CLI proxy that cuts LLM token usage by up to 90% with transparent command rewriting</title><link>https://ramdi.fr/github-stars/rtk-a-rust-cli-proxy-that-cuts-llm-token-usage-by-up-to-90-with-transparent-command-rewriting/</link><pubDate>Wed, 06 May 2026 18:58:37 +0000</pubDate><guid>https://ramdi.fr/github-stars/rtk-a-rust-cli-proxy-that-cuts-llm-token-usage-by-up-to-90-with-transparent-command-rewriting/</guid><description>rtk is a Rust CLI proxy that intercepts shell commands to reduce LLM token consumption by 60-90% using a transparent Bash hook and output filtering, supporting 100+ commands.</description></item><item><title>10x CLI coding agent: tiered AI model routing for faster coding workflows</title><link>https://ramdi.fr/github-stars/10x-cli-coding-agent-tiered-ai-model-routing-for-faster-coding-workflows/</link><pubDate>Tue, 05 May 2026 22:24:55 +0000</pubDate><guid>https://ramdi.fr/github-stars/10x-cli-coding-agent-tiered-ai-model-routing-for-faster-coding-workflows/</guid><description>10x is a TypeScript CLI coding agent that speeds coding up to 20x by routing tasks across a tiered AI model system with customizable multi-step workflows called Superpowers.</description></item><item><title>Langchain-Chatchat: A model-agnostic orchestration layer for Chinese-language RAG and Agents</title><link>https://ramdi.fr/github-stars/langchain-chatchat-a-model-agnostic-orchestration-layer-for-chinese-language-rag-and-agents/</link><pubDate>Tue, 05 May 2026 22:24:55 +0000</pubDate><guid>https://ramdi.fr/github-stars/langchain-chatchat-a-model-agnostic-orchestration-layer-for-chinese-language-rag-and-agents/</guid><description>Langchain-Chatchat offers a flexible, offline-capable orchestration layer for multiple Chinese LLMs and RAG approaches, enabling seamless model swaps across frameworks without code changes.</description></item><item><title>Langfuse: Simplifying LLM observability with decorator-based tracing</title><link>https://ramdi.fr/github-stars/langfuse-simplifying-llm-observability-with-decorator-based-tracing/</link><pubDate>Tue, 05 May 2026 22:24:55 +0000</pubDate><guid>https://ramdi.fr/github-stars/langfuse-simplifying-llm-observability-with-decorator-based-tracing/</guid><description>Langfuse provides end-to-end observability for LLM applications with automatic tracing via an @observe() decorator, enabling teams to debug and manage AI workflows efficiently.</description></item><item><title>Open Cowork: Desktop AI Agent with VM-level Sandbox Isolation for Safer AI Workflows</title><link>https://ramdi.fr/github-stars/open-cowork-desktop-ai-agent-with-vm-level-sandbox-isolation-for-safer-ai-workflows/</link><pubDate>Tue, 05 May 2026 22:24:55 +0000</pubDate><guid>https://ramdi.fr/github-stars/open-cowork-desktop-ai-agent-with-vm-level-sandbox-isolation-for-safer-ai-workflows/</guid><description>Open Cowork wraps multiple LLMs in a cross-platform desktop app with unique VM-level sandboxing using WSL2 and Lima for safe AI agent command execution.</description></item><item><title>Quivr: A Python framework for flexible retrieval-augmented generation pipelines</title><link>https://ramdi.fr/github-stars/quivr-a-python-framework-for-flexible-retrieval-augmented-generation-pipelines/</link><pubDate>Tue, 05 May 2026 22:24:55 +0000</pubDate><guid>https://ramdi.fr/github-stars/quivr-a-python-framework-for-flexible-retrieval-augmented-generation-pipelines/</guid><description>Quivr is a Python framework offering an opinionated, pluggable retrieval-augmented generation pipeline with multi-LLM support and YAML-defined workflows for flexible knowledge retrieval.</description></item><item><title>Inside Genie Sim 3.0: LLM-driven embodied AI simulation with high-fidelity 3D scenes</title><link>https://ramdi.fr/github-stars/inside-genie-sim-3-0-llm-driven-embodied-ai-simulation-with-high-fidelity-3d-scenes/</link><pubDate>Tue, 05 May 2026 16:46:42 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-genie-sim-3-0-llm-driven-embodied-ai-simulation-with-high-fidelity-3d-scenes/</guid><description>Genie Sim 3.0 is an open-source platform combining 3D Gaussian Splatting and LLM-driven scene generation for embodied AI simulation, offering large-scale synthetic data and low sim-to-real discrepancy.</description></item><item><title>Open Deep Research: A Next.js 16 agentic AI assistant for iterative web research</title><link>https://ramdi.fr/github-stars/open-deep-research-a-next-js-16-agentic-ai-assistant-for-iterative-web-research/</link><pubDate>Tue, 05 May 2026 16:46:42 +0000</pubDate><guid>https://ramdi.fr/github-stars/open-deep-research-a-next-js-16-agentic-ai-assistant-for-iterative-web-research/</guid><description>Open Deep Research is a TypeScript Next.js 16 app that uses an LLM to plan, execute, and iterate web research via Exa and Upstash QStash, producing sourced reports with images.</description></item><item><title>Action100M: Hierarchical Tree-of-Captions for Multi-Scale Video Understanding</title><link>https://ramdi.fr/github-stars/action100m-hierarchical-tree-of-captions-for-multi-scale-video-understanding/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/action100m-hierarchical-tree-of-captions-for-multi-scale-video-understanding/</guid><description>Action100M provides a hierarchical Tree-of-Captions annotation for 100M video segments, enabling multi-scale video understanding with LLM-generated captions. Explore its structure, tech strengths, and how to access the data.</description></item><item><title>Alibaba's Qwen3.6: Efficient large-scale LLMs with gated delta networks and sparse MoE</title><link>https://ramdi.fr/github-stars/alibaba-s-qwen3-6-efficient-large-scale-llms-with-gated-delta-networks-and-sparse-moe/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/alibaba-s-qwen3-6-efficient-large-scale-llms-with-gated-delta-networks-and-sparse-moe/</guid><description>Qwen3.6 from Alibaba uses gated delta networks and sparse Mixture-of-Experts to achieve near-397B parameter model performance with only 3B active parameters, supporting 201 languages and 262k context length.</description></item><item><title>autoMate: a local-first AI hub exposing 40+ tools via MCP-over-HTTP</title><link>https://ramdi.fr/github-stars/automate-a-local-first-ai-hub-exposing-40-tools-via-mcp-over-http/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/automate-a-local-first-ai-hub-exposing-40-tools-via-mcp-over-http/</guid><description>autoMate exposes 40+ AI tools and 31 SaaS APIs via MCP-over-HTTP on localhost, with encrypted storage and multi-provider LLM support. A local AI infrastructure hub with privacy-first design.</description></item><item><title>Building production-ready RAG workflows with n8n using free JSON templates</title><link>https://ramdi.fr/github-stars/building-production-ready-rag-workflows-with-n8n-using-free-json-templates/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/building-production-ready-rag-workflows-with-n8n-using-free-json-templates/</guid><description>Explore over 200 pre-built n8n workflow templates integrating vector databases, embedding models, and LLMs for rapid RAG workflow prototyping and deployment without coding.</description></item><item><title>ClawSync: A Convex-based multi-agent AI platform with shared soul documents and per-agent model routing</title><link>https://ramdi.fr/github-stars/clawsync-a-convex-based-multi-agent-ai-platform-with-shared-soul-documents-and-per-agent-model-routing/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/clawsync-a-convex-based-multi-agent-ai-platform-with-shared-soul-documents-and-per-agent-model-routing/</guid><description>ClawSync offers a multi-agent AI platform using Convex backend, with shared soul documents for reusable personalities and per-agent model routing across popular LLMs. Explore its architecture and setup.</description></item><item><title>DeepDrone: natural language AI control for drones with real-time telemetry and MAVLink integration</title><link>https://ramdi.fr/github-stars/deepdrone-natural-language-ai-control-for-drones-with-real-time-telemetry-and-mavlink-integration/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/deepdrone-natural-language-ai-control-for-drones-with-real-time-telemetry-and-mavlink-integration/</guid><description>DeepDrone uses LLMs to translate natural language commands into structured drone operations via MAVLink, with real-time telemetry and safety constraints. Python backend, FastAPI, LiteLLM, and JS frontend.</description></item><item><title>Inside Company Research Agent: automating business intelligence with multi-API AI agents</title><link>https://ramdi.fr/github-stars/inside-company-research-agent-automating-business-intelligence-with-multi-api-ai-agents/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-company-research-agent-automating-business-intelligence-with-multi-api-ai-agents/</guid><description>Company Research Agent automates detailed business research by orchestrating OpenAI, Google Gemini, Tavily APIs and geolocation data via a Python backend and Node.js frontend. Setup script streamlines install.</description></item><item><title>llm-wiki: orchestrating multi-agent LLM research into persistent knowledge bases</title><link>https://ramdi.fr/github-stars/llm-wiki-orchestrating-multi-agent-llm-research-into-persistent-knowledge-bases/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/llm-wiki-orchestrating-multi-agent-llm-research-into-persistent-knowledge-bases/</guid><description>llm-wiki is a shell-based orchestration layer that turns LLM agents into a persistent, multi-agent research wiki. Supports up to 10 agents, deep investigations, and durable provenance tracking.</description></item><item><title>Nanobrowser: multi-agent AI browser automation with dynamic self-correcting planning</title><link>https://ramdi.fr/github-stars/nanobrowser-multi-agent-ai-browser-automation-with-dynamic-self-correcting-planning/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/nanobrowser-multi-agent-ai-browser-automation-with-dynamic-self-correcting-planning/</guid><description>Nanobrowser is a TypeScript Chrome extension implementing a multi-agent AI system for browser automation with a unique self-correcting Planner-Navigator architecture, supporting multiple LLMs and local privacy.</description></item><item><title>PokieTicker: layered AI-driven stock market analysis with sentiment and XGBoost</title><link>https://ramdi.fr/github-stars/pokieticker-layered-ai-driven-stock-market-analysis-with-sentiment-and-xgboost/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/pokieticker-layered-ai-driven-stock-market-analysis-with-sentiment-and-xgboost/</guid><description>PokieTicker combines rule-based filtering, LLM sentiment analysis, and XGBoost prediction in a full-stack stock analysis app. Runs locally with no API keys.</description></item><item><title>Zeron Chat: A unified AI chat interface with resumable streaming for multi-LLM experimentation</title><link>https://ramdi.fr/github-stars/zeron-chat-a-unified-ai-chat-interface-with-resumable-streaming-for-multi-llm-experimentation/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/zeron-chat-a-unified-ai-chat-interface-with-resumable-streaming-for-multi-llm-experimentation/</guid><description>Zeron Chat is a TypeScript React app that unifies multiple LLM providers in one interface with resumable streaming that survives page refreshes, built on TanStack Start and Zero state management.</description></item><item><title>Zinc: A Zig-based LLM inference engine optimized for AMD RDNA and Apple Silicon GPUs</title><link>https://ramdi.fr/github-stars/zinc-a-zig-based-llm-inference-engine-optimized-for-amd-rdna-and-apple-silicon-gpus/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/zinc-a-zig-based-llm-inference-engine-optimized-for-amd-rdna-and-apple-silicon-gpus/</guid><description>Zinc is a Zig-written LLM inference engine using Vulkan and Metal for AMD RDNA and Apple Silicon GPUs. It supports GGUF quantized models and exposes an OpenAI-compatible API with streaming.</description></item><item><title>ai-trader: AI-powered config-driven backtesting with natural language interaction</title><link>https://ramdi.fr/github-stars/ai-trader-ai-powered-config-driven-backtesting-with-natural-language-interaction/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/ai-trader-ai-powered-config-driven-backtesting-with-natural-language-interaction/</guid><description>ai-trader adds natural language AI interaction to algorithmic trading backtesting via an MCP server and YAML configs. Supports US/TW stocks, crypto, forex with caching.</description></item><item><title>Allium: a behavioral specification framework for intent persistence in AI agent engineering</title><link>https://ramdi.fr/github-stars/allium-a-behavioral-specification-framework-for-intent-persistence-in-ai-agent-engineering/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/allium-a-behavioral-specification-framework-for-intent-persistence-in-ai-agent-engineering/</guid><description>Allium addresses intent drift in AI agent sessions by capturing behaviors as formal specs that persist across interactions, exposing contradictions automatically.</description></item><item><title>Building private AI workflows with the n8n self-hosted AI starter kit</title><link>https://ramdi.fr/github-stars/building-private-ai-workflows-with-the-n8n-self-hosted-ai-starter-kit/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/building-private-ai-workflows-with-the-n8n-self-hosted-ai-starter-kit/</guid><description>Spin up a private AI agent stack in under 5 minutes with n8n&amp;rsquo;s self-hosted AI starter kit. Combines local LLMs, automation, and vector search for secure AI workflows.</description></item><item><title>Council of High Intelligence: orchestrating structured multi-agent AI deliberations across multiple LLMs</title><link>https://ramdi.fr/github-stars/council-of-high-intelligence-orchestrating-structured-multi-agent-ai-deliberations-across-multiple-llms/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/council-of-high-intelligence-orchestrating-structured-multi-agent-ai-deliberations-across-multiple-llms/</guid><description>Council of High Intelligence is a Shell tool coordinating 18 AI personas across Claude, OpenAI, Gemini, and Ollama, enforcing true disagreement via structured multi-round deliberations.</description></item><item><title>DocsGPT: a flexible AI platform for private agents and enterprise document search</title><link>https://ramdi.fr/github-stars/docsgpt-a-flexible-ai-platform-for-private-agents-and-enterprise-document-search/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/docsgpt-a-flexible-ai-platform-for-private-agents-and-enterprise-document-search/</guid><description>DocsGPT is a Python-based AI platform for building private agents and enterprise search, with multi-LLM support and versatile deployment modes via Docker Compose.</description></item><item><title>Exploring Claude API integration patterns with anthropics/claude-cookbooks</title><link>https://ramdi.fr/github-stars/exploring-claude-api-integration-patterns-with-anthropics-claude-cookbooks/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/exploring-claude-api-integration-patterns-with-anthropics-claude-cookbooks/</guid><description>anthropics/claude-cookbooks offers Jupyter Notebook recipes demonstrating practical Claude API usage, including sub-agent orchestration, multimodal vision, and RAG patterns.</description></item><item><title>Gitingest: turning GitHub repos into AI-friendly text digests with a clever URL hack</title><link>https://ramdi.fr/github-stars/gitingest-turning-github-repos-into-ai-friendly-text-digests-with-a-clever-url-hack/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/gitingest-turning-github-repos-into-ai-friendly-text-digests-with-a-clever-url-hack/</guid><description>Gitingest is a Python CLI and API that converts Git repos into LLM-optimized text digests, featuring a unique URL hack for instant GitHub repo ingestion and self-hosted FastAPI server.</description></item><item><title>Hands-On Large Language Models: A practical, visual journey through LLM engineering</title><link>https://ramdi.fr/github-stars/hands-on-large-language-models-a-practical-visual-journey-through-llm-engineering/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/hands-on-large-language-models-a-practical-visual-journey-through-llm-engineering/</guid><description>Explore the Hands-On Large Language Models repo, a Jupyter notebook-based practical guide from fundamentals to fine-tuning, designed for hands-on LLM learning on free Colab GPUs.</description></item><item><title>hf-agents: a shell CLI extension for hardware-aware local coding agents with llama.cpp</title><link>https://ramdi.fr/github-stars/hf-agents-a-shell-cli-extension-for-hardware-aware-local-coding-agents-with-llama-cpp/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/hf-agents-a-shell-cli-extension-for-hardware-aware-local-coding-agents-with-llama-cpp/</guid><description>hf-agents automates hardware profiling, model selection, and local coding agent deployment using llama.cpp and Pi, all in a shell CLI extension. Efficient and minimal dependencies.</description></item><item><title>How the claude-plugins repo orchestrates multi-agent AI consultation with multiple LLMs</title><link>https://ramdi.fr/github-stars/how-the-claude-plugins-repo-orchestrates-multi-agent-ai-consultation-with-multiple-llms/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/how-the-claude-plugins-repo-orchestrates-multi-agent-ai-consultation-with-multiple-llms/</guid><description>claude-plugins is a TypeScript-based plugin marketplace for Claude Code, featuring a multi-agent consultant plugin that runs parallel LLMs like GPT-5, Gemini, Grok, Perplexity, and Claude for AI consultation.</description></item><item><title>How video-use turns AI agents into transcript-driven video editors</title><link>https://ramdi.fr/github-stars/how-video-use-turns-ai-agents-into-transcript-driven-video-editors/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/how-video-use-turns-ai-agents-into-transcript-driven-video-editors/</guid><description>video-use replaces frame-heavy editing with transcript-driven AI agents, using ElevenLabs Scribe and self-evaluation to produce polished edits.</description></item><item><title>Inside NousResearch's finetuning-subnet: continuous incentivized fine-tuning for LLMs on Bittensor</title><link>https://ramdi.fr/github-stars/inside-nousresearch-s-finetuning-subnet-continuous-incentivized-fine-tuning-for-llms-on-bittensor/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-nousresearch-s-finetuning-subnet-continuous-incentivized-fine-tuning-for-llms-on-bittensor/</guid><description>NousResearch&amp;rsquo;s finetuning-subnet enables continuous, incentivized fine-tuning of LLMs using synthetic data from a separate subnet, pioneering cross-subnet communication in Bittensor.</description></item><item><title>ISC-Bench: exposing fundamental AI safety failures from workflow-level design</title><link>https://ramdi.fr/github-stars/isc-bench-exposing-fundamental-ai-safety-failures-from-workflow-level-design/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/isc-bench-exposing-fundamental-ai-safety-failures-from-workflow-level-design/</guid><description>ISC-Bench reveals a structural AI safety flaw where LLMs produce harmful outputs to complete tasks, bypassing prompt-level defenses. It benchmarks this workflow-level vulnerability across top models.</description></item><item><title>KohakuTerrarium: Modular AI agent composition with algebraic pipelines</title><link>https://ramdi.fr/github-stars/kohakuterrarium-modular-ai-agent-composition-with-algebraic-pipelines/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/kohakuterrarium-modular-ai-agent-composition-with-algebraic-pipelines/</guid><description>KohakuTerrarium offers a Python framework to build modular AI agents using a unique algebra for composing multi-agent pipelines, with session persistence and multi-runtime support.</description></item><item><title>kvcached: a plugin cache for SGLang and vLLM Python environments</title><link>https://ramdi.fr/github-stars/kvcached-a-plugin-cache-for-sglang-and-vllm-python-environments/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/kvcached-a-plugin-cache-for-sglang-and-vllm-python-environments/</guid><description>kvcached provides a plugin cache layer for SGLang and vLLM Python LLM environments, easing deployment with PyPI and Docker support. Useful for optimizing LLM workflows.</description></item><item><title>LLM-God: orchestrating multiple LLM web UIs in one Electron app with DOM injection</title><link>https://ramdi.fr/github-stars/llm-god-orchestrating-multiple-llm-web-uis-in-one-electron-app-with-dom-injection/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/llm-god-orchestrating-multiple-llm-web-uis-in-one-electron-app-with-dom-injection/</guid><description>LLM-God bundles multiple LLM web interfaces into a single Electron app, using DOM injection to send prompts to all models simultaneously. It offers a clever free-tier workaround with tradeoffs.</description></item><item><title>Lucebox Hub: hand-optimized CUDA kernels for efficient LLM inference on RTX 3090 and beyond</title><link>https://ramdi.fr/github-stars/lucebox-hub-hand-optimized-cuda-kernels-for-efficient-llm-inference-on-rtx-3090-and-beyond/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/lucebox-hub-hand-optimized-cuda-kernels-for-efficient-llm-inference-on-rtx-3090-and-beyond/</guid><description>Lucebox Hub optimizes LLM inference on consumer GPUs using a megakernel CUDA approach and speculative decoding, achieving high throughput on RTX 3090 and newer Nvidia GPUs.</description></item><item><title>LycheeMemory: a lightweight semantic long-term memory framework for LLM agents</title><link>https://ramdi.fr/github-stars/lycheememory-a-lightweight-semantic-long-term-memory-framework-for-llm-agents/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/lycheememory-a-lightweight-semantic-long-term-memory-framework-for-llm-agents/</guid><description>LycheeMemory offers a lightweight semantic memory system for LLM agents, cutting token use by 71% and costs by 55% compared to native memory, with SQLite + LanceDB backend and REST/MCP APIs.</description></item><item><title>MAGI: A structured multi-LLM debate system with iterative critique and voting</title><link>https://ramdi.fr/github-stars/magi-a-structured-multi-llm-debate-system-with-iterative-critique-and-voting/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/magi-a-structured-multi-llm-debate-system-with-iterative-critique-and-voting/</guid><description>MAGI implements a multi-round debate protocol among three LLMs to match stronger models&amp;rsquo; accuracy via iterative critique and voting. It offers fault tolerance, adaptive escalation, and persona presets.</description></item><item><title>Mapping the LLM agent landscape with the awesome-llm-agents curated catalog</title><link>https://ramdi.fr/github-stars/mapping-the-llm-agent-landscape-with-the-awesome-llm-agents-curated-catalog/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/mapping-the-llm-agent-landscape-with-the-awesome-llm-agents-curated-catalog/</guid><description>A curated catalog of 20+ LLM agent frameworks and tools organized by agent type and capabilities. Understand architectural differences and trade-offs in LLM agent design.</description></item><item><title>Memary: Recursive Knowledge Graph Memory for Autonomous AI Agents</title><link>https://ramdi.fr/github-stars/memary-recursive-knowledge-graph-memory-for-autonomous-ai-agents/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/memary-recursive-knowledge-graph-memory-for-autonomous-ai-agents/</guid><description>Memary is an open-source memory layer for AI agents using knowledge graphs and recursive retrieval to efficiently store and query agent memories. It supports multi-agent setups and integrates with LlamaIndex and OpenAI.</description></item><item><title>Meta-Harness: evolving the scaffolding around large language models for optimized task performance</title><link>https://ramdi.fr/github-stars/meta-harness-evolving-the-scaffolding-around-large-language-models-for-optimized-task-performance/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/meta-harness-evolving-the-scaffolding-around-large-language-models-for-optimized-task-performance/</guid><description>Meta-Harness from Stanford IRIS Lab automates the search for optimal harness configurations around LLMs, evolving memory, retrieval, and context systems for better task-specific performance.</description></item><item><title>OASIS: a Python CLI for AI-driven code vulnerability scanning with deterministic validation</title><link>https://ramdi.fr/github-stars/oasis-a-python-cli-for-ai-driven-code-vulnerability-scanning-with-deterministic-validation/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/oasis-a-python-cli-for-ai-driven-code-vulnerability-scanning-with-deterministic-validation/</guid><description>OASIS is a Python CLI security auditor using LangGraph-orchestrated LLMs for two-phase scanning and deterministic validation of code vulnerabilities. It balances AI insights with guardrails to reduce false positives.</description></item><item><title>OpenGame: generating playable web games from natural language with a dual-skill LLM framework</title><link>https://ramdi.fr/github-stars/opengame-generating-playable-web-games-from-natural-language-with-a-dual-skill-llm-framework/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/opengame-generating-playable-web-games-from-natural-language-with-a-dual-skill-llm-framework/</guid><description>OpenGame from CUHK MMLab generates full web games from natural language prompts using a dual-skill LLM architecture that maintains cross-file consistency and integration fixes.</description></item><item><title>OpenKB: A persistent, vectorless wiki knowledge base powered by LLMs and PageIndex</title><link>https://ramdi.fr/github-stars/openkb-a-persistent-vectorless-wiki-knowledge-base-powered-by-llms-and-pageindex/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/openkb-a-persistent-vectorless-wiki-knowledge-base-powered-by-llms-and-pageindex/</guid><description>OpenKB compiles documents into a persistent, interlinked wiki using LLMs and PageIndex&amp;rsquo;s vectorless retrieval, supporting multi-LLM backends and interactive chat with persisted sessions.</description></item><item><title>Orion: Direct access to Apple Neural Engine for on-device LLM training</title><link>https://ramdi.fr/github-stars/orion-direct-access-to-apple-neural-engine-for-on-device-llm-training/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/orion-direct-access-to-apple-neural-engine-for-on-device-llm-training/</guid><description>Orion bypasses CoreML to access Apple&amp;rsquo;s Neural Engine directly via private frameworks, enabling on-device inference and fine-tuning of small LLMs with 8.5x reduced training overhead.</description></item><item><title>PageLM: orchestrating multi-provider LLM workflows for interactive learning</title><link>https://ramdi.fr/github-stars/pagelm-orchestrating-multi-provider-llm-workflows-for-interactive-learning/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/pagelm-orchestrating-multi-provider-llm-workflows-for-interactive-learning/</guid><description>PageLM is an open-source TypeScript platform orchestrating multi-LLM workflows to generate interactive educational content from documents with real-time streaming and multi-backend support.</description></item><item><title>PasteGuard: a local privacy proxy for masking sensitive data in LLM requests</title><link>https://ramdi.fr/github-stars/pasteguard-a-local-privacy-proxy-for-masking-sensitive-data-in-llm-requests/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/pasteguard-a-local-privacy-proxy-for-masking-sensitive-data-in-llm-requests/</guid><description>PasteGuard intercepts API calls to OpenAI and Anthropic, masking over 30 types of sensitive data across 24 languages before reaching AI providers. Simple integration by changing base URL.</description></item><item><title>pdftochat: a cloud-integrated PDF-to-chat system with hybrid vector search</title><link>https://ramdi.fr/github-stars/pdftochat-a-cloud-integrated-pdf-to-chat-system-with-hybrid-vector-search/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/pdftochat-a-cloud-integrated-pdf-to-chat-system-with-hybrid-vector-search/</guid><description>pdftochat is a TypeScript-based PDF-to-chat app leveraging Chroma Cloud for hybrid vector search and Together.ai for LLMs, integrating multiple cloud services for scalable document Q&amp;amp;A.</description></item><item><title>Resume Matcher: A provider-agnostic AI platform for tailored resumes using LiteLLM abstraction</title><link>https://ramdi.fr/github-stars/resume-matcher-a-provider-agnostic-ai-platform-for-tailored-resumes-using-litellm-abstraction/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/resume-matcher-a-provider-agnostic-ai-platform-for-tailored-resumes-using-litellm-abstraction/</guid><description>Resume Matcher uses LiteLLM to unify six LLM providers for AI-powered resume tailoring, with a FastAPI backend and Next.js frontend. It supports local and cloud deployments with PDF export.</description></item><item><title>SmallClaw: a local-first AI agent framework with single-pass chat handling</title><link>https://ramdi.fr/github-stars/smallclaw-a-local-first-ai-agent-framework-with-single-pass-chat-handling/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/smallclaw-a-local-first-ai-agent-framework-with-single-pass-chat-handling/</guid><description>SmallClaw is a TypeScript AI agent framework that uses a single LLM call for chat and tool invocation, designed for local models with a clean web UI and structured tools.</description></item><item><title>TextGen: a portable zero-config local LLM runner with multi-backend and multimodal support</title><link>https://ramdi.fr/github-stars/textgen-a-portable-zero-config-local-llm-runner-with-multi-backend-and-multimodal-support/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/textgen-a-portable-zero-config-local-llm-runner-with-multi-backend-and-multimodal-support/</guid><description>TextGen offers a portable desktop app for local LLMs with zero telemetry and multi-backend support. Drop GGUF models in a folder and run with no complex setup. It features multimodal vision, file attachments, and OpenAI-compatible API.</description></item><item><title>Understanding LLM internals: a hands-on guide to transformers and attention math</title><link>https://ramdi.fr/github-stars/understanding-llm-internals-a-hands-on-guide-to-transformers-and-attention-math/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/understanding-llm-internals-a-hands-on-guide-to-transformers-and-attention-math/</guid><description>A curated repo breaking down large language model internals with numeric attention math, tokenization, and transformer architecture, targeting engineers who want to understand LLMs under the hood.</description></item><item><title>vllm-mlx: Efficient LLM serving on Apple Silicon with SSD-tiered KV cache and continuous batching</title><link>https://ramdi.fr/github-stars/vllm-mlx-efficient-llm-serving-on-apple-silicon-with-ssd-tiered-kv-cache-and-continuous-batching/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/vllm-mlx-efficient-llm-serving-on-apple-silicon-with-ssd-tiered-kv-cache-and-continuous-batching/</guid><description>vllm-mlx is a Python inference server for Apple Silicon that supports OpenAI and Anthropic APIs, featuring SSD-tiered KV cache for long-context agents and continuous batching for performance.</description></item><item><title>WriteHERE: dynamic recursive planning for AI-assisted long-form writing with real-time visualization</title><link>https://ramdi.fr/github-stars/writehere-dynamic-recursive-planning-for-ai-assisted-long-form-writing-with-real-time-visualization/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/writehere-dynamic-recursive-planning-for-ai-assisted-long-form-writing-with-real-time-visualization/</guid><description>WriteHERE uses recursive task decomposition to dynamically break down and execute long-form AI writing tasks, with real-time visualization of the agent&amp;rsquo;s thought process. Supports GPT and Claude.</description></item><item><title>A practical taxonomy for large language model ensembles: Exploring the Awesome-LLM-Ensemble repository</title><link>https://ramdi.fr/github-stars/a-practical-taxonomy-for-large-language-model-ensembles-exploring-the-awesome-llm-ensemble-repository/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/a-practical-taxonomy-for-large-language-model-ensembles-exploring-the-awesome-llm-ensemble-repository/</guid><description>The Awesome-LLM-Ensemble repo catalogs research on combining multiple LLMs with a clear three-phase taxonomy: before, during, and after inference ensemble methods.</description></item><item><title>AI Knowledge Graph Generator: Building structured graphs from unstructured text with LLMs</title><link>https://ramdi.fr/github-stars/ai-knowledge-graph-generator-building-structured-graphs-from-unstructured-text-with-llms/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/ai-knowledge-graph-generator-building-structured-graphs-from-unstructured-text-with-llms/</guid><description>A Python tool that converts unstructured text into interactive knowledge graphs using a three-phase LLM pipeline with SPO triplet extraction, entity standardization, and relationship inference.</description></item><item><title>Automating bank statement processing with YOLOv8, OCR, and LLMs for personal finance analysis</title><link>https://ramdi.fr/github-stars/automating-bank-statement-processing-with-yolov8-ocr-and-llms-for-personal-finance-analysis/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/automating-bank-statement-processing-with-yolov8-ocr-and-llms-for-personal-finance-analysis/</guid><description>Explore how a hybrid pipeline using YOLOv8 layout detection, OCR, and LLMs automates messy bank statement PDFs for personal finance analysis with RAG and AI agents.</description></item><item><title>BoxPwnr: benchmarking autonomous LLM agents on cybersecurity challenges with iterative command execution</title><link>https://ramdi.fr/github-stars/boxpwnr-benchmarking-autonomous-llm-agents-on-cybersecurity-challenges-with-iterative-command-execution/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/boxpwnr-benchmarking-autonomous-llm-agents-on-cybersecurity-challenges-with-iterative-command-execution/</guid><description>BoxPwnr benchmarks LLM-based autonomous agents on cybersecurity challenges using iterative command execution in a Kali Docker container, supporting 20+ LLM models and 13+ platforms.</description></item><item><title>Curating quality: a curated list of essential books for large language model engineers</title><link>https://ramdi.fr/github-stars/curating-quality-a-curated-list-of-essential-books-for-large-language-model-engineers/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/curating-quality-a-curated-list-of-essential-books-for-large-language-model-engineers/</guid><description>A curated list of 24 rigorously selected books on LLM engineering, covering foundational theory to production deployment. Highlights a unique 6-step quality filtering process.</description></item><item><title>DATAGEN: a LangGraph multi-agent framework for automated data analysis workflows</title><link>https://ramdi.fr/github-stars/datagen-a-langgraph-multi-agent-framework-for-automated-data-analysis-workflows/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/datagen-a-langgraph-multi-agent-framework-for-automated-data-analysis-workflows/</guid><description>DATAGEN orchestrates eight specialized AI agents using LangGraph to automate data analysis workflows with progressive disclosure and multi-LLM provider support.</description></item><item><title>DocStrange: A versatile Python library for LLM-optimized document parsing with dual-mode processing</title><link>https://ramdi.fr/github-stars/docstrange-a-versatile-python-library-for-llm-optimized-document-parsing-with-dual-mode-processing/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/docstrange-a-versatile-python-library-for-llm-optimized-document-parsing-with-dual-mode-processing/</guid><description>DocStrange converts PDFs, DOCX, PPTX, XLSX, images, and URLs into LLM-ready Markdown, JSON, HTML, and CSV. It offers free cloud and private local GPU modes for flexible, privacy-compliant document parsing.</description></item><item><title>Eclaire: a local-first AI assistant unifying your personal data with local LLMs</title><link>https://ramdi.fr/github-stars/eclaire-a-local-first-ai-assistant-unifying-your-personal-data-with-local-llms/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/eclaire-a-local-first-ai-assistant-unifying-your-personal-data-with-local-llms/</guid><description>Eclaire is a self-hosted AI assistant that unifies personal data with local LLM backends via an OpenAI-compatible API, emphasizing privacy and modular design.</description></item><item><title>Inside llm-madness: a lightweight GPT transformer training pipeline with built-in visualization</title><link>https://ramdi.fr/github-stars/inside-llm-madness-a-lightweight-gpt-transformer-training-pipeline-with-built-in-visualization/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-llm-madness-a-lightweight-gpt-transformer-training-pipeline-with-built-in-visualization/</guid><description>llm-madness offers a Python-built GPT-style transformer training pipeline with tokenizer training, memory-mapped datasets, and a unique web UI for per-layer attention inspection and loss visualization.</description></item><item><title>Mind-Map-Wizard: AI-powered mind maps with a custom SVG rendering engine</title><link>https://ramdi.fr/github-stars/mind-map-wizard-ai-powered-mind-maps-with-a-custom-svg-rendering-engine/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/mind-map-wizard-ai-powered-mind-maps-with-a-custom-svg-rendering-engine/</guid><description>Mind-Map-Wizard generates interactive mind maps from AI-generated markdown outlines using a custom SVG engine and keeps all data local for privacy.</description></item><item><title>OpenClaude: a multi-model terminal-first coding agent CLI with practical agent routing</title><link>https://ramdi.fr/github-stars/openclaude-a-multi-model-terminal-first-coding-agent-cli-with-practical-agent-routing/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/openclaude-a-multi-model-terminal-first-coding-agent-cli-with-practical-agent-routing/</guid><description>OpenClaude is a TypeScript CLI coding agent that routes tasks across different LLMs by type, optimizing cost and performance with multi-provider support and a unified terminal interface.</description></item><item><title>OpenMythos: Exploring recurrent-depth transformers with input injection for sustained reasoning</title><link>https://ramdi.fr/github-stars/openmythos-exploring-recurrent-depth-transformers-with-input-injection-for-sustained-reasoning/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/openmythos-exploring-recurrent-depth-transformers-with-input-injection-for-sustained-reasoning/</guid><description>OpenMythos implements a recurrent-depth transformer that recycles layers via looped blocks, using input injection to prevent signal drift. It scales from 1B to 1T parameters with up to 1M token context.</description></item><item><title>PAT3D: orchestrating text-to-3D simulation-ready scenes through a multi-stage AI and physics pipeline</title><link>https://ramdi.fr/github-stars/pat3d-orchestrating-text-to-3d-simulation-ready-scenes-through-a-multi-stage-ai-and-physics-pipeline/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/pat3d-orchestrating-text-to-3d-simulation-ready-scenes-through-a-multi-stage-ai-and-physics-pipeline/</guid><description>PAT3D composes a 9-stage pipeline combining LLMs, vision models, 3D asset generators, and physics simulation to produce physically plausible, simulation-ready 3D scenes from text prompts.</description></item><item><title>QuantaAlpha: LLM-driven trajectory-based self-evolution for quantitative alpha factor discovery</title><link>https://ramdi.fr/github-stars/quantaalpha-llm-driven-trajectory-based-self-evolution-for-quantitative-alpha-factor-discovery/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/quantaalpha-llm-driven-trajectory-based-self-evolution-for-quantitative-alpha-factor-discovery/</guid><description>QuantaAlpha uses large language models with evolutionary strategies to automate quantitative alpha factor discovery, achieving strong backtest metrics on major indices.</description></item><item><title>ray-finance: a local-first, privacy-focused CLI financial advisor with encrypted context and LLM-powered advice</title><link>https://ramdi.fr/github-stars/ray-finance-a-local-first-privacy-focused-cli-financial-advisor-with-encrypted-context-and-llm-powered-advice/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/ray-finance-a-local-first-privacy-focused-cli-financial-advisor-with-encrypted-context-and-llm-powered-advice/</guid><description>ray-finance is a TypeScript CLI tool that syncs bank data locally with AES-256 encryption, redacts PII before AI calls, and maintains persistent financial context for personalized LLM advice.</description></item><item><title>RESTai: a multi-project AIaaS platform with agentic browser automation and visual AI pipelines</title><link>https://ramdi.fr/github-stars/restai-a-multi-project-aiaas-platform-with-agentic-browser-automation-and-visual-ai-pipelines/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/restai-a-multi-project-aiaas-platform-with-agentic-browser-automation-and-visual-ai-pipelines/</guid><description>RESTai exposes multi-project AI capabilities via a unified REST API, featuring an agentic browser with Dockerized Playwright, knowledge graph RAG, and a visual Blockly pipeline builder.</description></item><item><title>RsClaw: a Rust-native AI agent engine with persistent three-layer memory and multi-agent delegation</title><link>https://ramdi.fr/github-stars/rsclaw-a-rust-native-ai-agent-engine-with-persistent-three-layer-memory-and-multi-agent-delegation/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/rsclaw-a-rust-native-ai-agent-engine-with-persistent-three-layer-memory-and-multi-agent-delegation/</guid><description>RsClaw is a Rust-based AI agent engine featuring persistent three-layer memory across sessions, multi-agent delegation, and low resource usage in a single 15MB binary.</description></item><item><title>Stash: a shared agent memory with no server-side LLM calls</title><link>https://ramdi.fr/github-stars/stash-a-shared-agent-memory-with-no-server-side-llm-calls/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/stash-a-shared-agent-memory-with-no-server-side-llm-calls/</guid><description>Stash captures coding agent session transcripts for teams and builds a shared knowledge base without server-side LLM calls, preserving privacy and cutting costs.</description></item><item><title>SwarmVault: a local-first knowledge compiler with contradiction detection and hybrid search</title><link>https://ramdi.fr/github-stars/swarmvault-a-local-first-knowledge-compiler-with-contradiction-detection-and-hybrid-search/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/swarmvault-a-local-first-knowledge-compiler-with-contradiction-detection-and-hybrid-search/</guid><description>SwarmVault compiles raw sources into a persistent Markdown wiki with typed knowledge graph, hybrid search, and contradiction detection. It supports 16+ agents and offline use.</description></item><item><title>unslop: empirically detecting and avoiding repetitive LLM output patterns</title><link>https://ramdi.fr/github-stars/unslop-empirically-detecting-and-avoiding-repetitive-llm-output-patterns/</link><pubDate>Mon, 04 May 2026 10:11:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/unslop-empirically-detecting-and-avoiding-repetitive-llm-output-patterns/</guid><description>unslop is a Python CLI tool that detects repetitive defaults in LLM outputs by empirical analysis, generating reusable anti-pattern profiles to improve prompt engineering.</description></item><item><title>AI penetration testing knowledge base: structured resources for LLM security research</title><link>https://ramdi.fr/github-stars/ai-penetration-testing-knowledge-base-structured-resources-for-llm-security-research/</link><pubDate>Mon, 04 May 2026 10:09:00 +0000</pubDate><guid>https://ramdi.fr/github-stars/ai-penetration-testing-knowledge-base-structured-resources-for-llm-security-research/</guid><description>A curated repository for AI/LLM penetration testing covering prompt injection, adversarial ML, and LLM red teaming with the OWASP LLM Top 10 framework.</description></item><item><title>Inside llm_wiki: a desktop app for building persistent LLM-powered personal wikis</title><link>https://ramdi.fr/github-stars/inside-llm-wiki-a-desktop-app-for-building-persistent-llm-powered-personal-wikis/</link><pubDate>Mon, 04 May 2026 10:05:49 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-llm-wiki-a-desktop-app-for-building-persistent-llm-powered-personal-wikis/</guid><description>llm_wiki uses a two-step chain-of-thought pipeline to build a self-maintaining knowledge base. It combines Tauri, knowledge graphs, and Louvain clustering for a unique personal wiki experience.</description></item><item><title>Building a production-ready second brain with agentic RAG and LLMOps</title><link>https://ramdi.fr/github-stars/building-a-production-ready-second-brain-with-agentic-rag-and-llmops/</link><pubDate>Sun, 03 May 2026 08:12:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/building-a-production-ready-second-brain-with-agentic-rag-and-llmops/</guid><description>Explore an open-source course that teaches building a production-grade AI assistant using advanced retrieval-augmented generation, agent orchestration, fine-tuning, and LLMOps practices.</description></item><item><title>Navigating free-tier LLM APIs with the awesome-free-llm-apis catalog</title><link>https://ramdi.fr/github-stars/navigating-free-tier-llm-apis-with-the-awesome-free-llm-apis-catalog/</link><pubDate>Sun, 03 May 2026 08:12:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/navigating-free-tier-llm-apis-with-the-awesome-free-llm-apis-catalog/</guid><description>A curated catalog of free-tier LLM APIs compatible with OpenAI SDK, detailing rate limits, model specs, and providers to build zero-cost AI applications.</description></item><item><title>A-MEM: dynamic semantic memory management for LLM agents inspired by Zettelkasten</title><link>https://ramdi.fr/github-stars/a-mem-dynamic-semantic-memory-management-for-llm-agents-inspired-by-zettelkasten/</link><pubDate>Sun, 03 May 2026 00:54:10 +0000</pubDate><guid>https://ramdi.fr/github-stars/a-mem-dynamic-semantic-memory-management-for-llm-agents-inspired-by-zettelkasten/</guid><description>A-MEM is a Python agentic memory system that dynamically organizes LLM agent memories using semantic embeddings and automatic linking, inspired by Zettelkasten.</description></item><item><title>A hands-on course for mastering large language models: fine-tuning, quantization, and tooling</title><link>https://ramdi.fr/github-stars/a-hands-on-course-for-mastering-large-language-models-fine-tuning-quantization-and-tooling/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/a-hands-on-course-for-mastering-large-language-models-fine-tuning-quantization-and-tooling/</guid><description>Explore a comprehensive LLM course with practical notebooks on fine-tuning (QLoRA, DPO), quantization (GPTQ), and tools like AutoEval and LazyMergekit. Ideal for aspiring LLM engineers.</description></item><item><title>Hermes Agent: A self-improving AI agent with closed learning loops and multi-platform integration</title><link>https://ramdi.fr/github-stars/hermes-agent-a-self-improving-ai-agent-with-closed-learning-loops-and-multi-platform-integration/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/hermes-agent-a-self-improving-ai-agent-with-closed-learning-loops-and-multi-platform-integration/</guid><description>Hermes Agent is a Python AI agent featuring closed learning loops, autonomous skill creation, multi-model support, and seamless Telegram/Discord integration for persistent, adaptable AI workflows.</description></item><item><title>LlamaFactory: modular, extensible fine-tuning framework for large language models</title><link>https://ramdi.fr/github-stars/llamafactory-modular-extensible-fine-tuning-framework-for-large-language-models/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/llamafactory-modular-extensible-fine-tuning-framework-for-large-language-models/</guid><description>LlamaFactory offers a modular Python framework for fine-tuning 100+ LLMs with diverse algorithms and optimizations, including LoRA, QLoRA, and reinforcement learning.</description></item><item><title>LocalAI: running diverse AI models locally with multi-backend support and agent capabilities</title><link>https://ramdi.fr/github-stars/localai-running-diverse-ai-models-locally-with-multi-backend-support-and-agent-capabilities/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/localai-running-diverse-ai-models-locally-with-multi-backend-support-and-agent-capabilities/</guid><description>LocalAI enables running 36+ AI models locally without GPU, supporting multi-user API access and built-in AI agents with OpenAI-compatible APIs. Here&amp;rsquo;s how it works and why it matters.</description></item><item><title>mem0: optimizing AI agent memory with a new single-pass additive algorithm</title><link>https://ramdi.fr/github-stars/mem0-optimizing-ai-agent-memory-with-a-new-single-pass-additive-algorithm/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/mem0-optimizing-ai-agent-memory-with-a-new-single-pass-additive-algorithm/</guid><description>mem0 enhances AI agent memory with a new single-pass ADD-only extraction algorithm and multi-signal retrieval, boosting benchmarks significantly while simplifying memory management.</description></item><item><title>MetaGPT: orchestrating multi-agent AI teams to automate software development</title><link>https://ramdi.fr/github-stars/metagpt-orchestrating-multi-agent-ai-teams-to-automate-software-development/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/metagpt-orchestrating-multi-agent-ai-teams-to-automate-software-development/</guid><description>MetaGPT uses a multi-agent system with defined GPT roles following SOPs to automate software development from one-line prompts. It simulates a software company with role-based AI collaboration.</description></item><item><title>Ollama: a unified CLI and API platform for local large language models</title><link>https://ramdi.fr/github-stars/ollama-a-unified-cli-and-api-platform-for-local-large-language-models/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/ollama-a-unified-cli-and-api-platform-for-local-large-language-models/</guid><description>Ollama simplifies running and managing open-source large language models locally with a unified CLI and REST API, supporting broad integrations and multi-OS support.</description></item><item><title>vLLM: Efficient large language model serving with paged attention and continuous batching</title><link>https://ramdi.fr/github-stars/vllm-efficient-large-language-model-serving-with-paged-attention-and-continuous-batching/</link><pubDate>Sat, 02 May 2026 20:07:04 +0000</pubDate><guid>https://ramdi.fr/github-stars/vllm-efficient-large-language-model-serving-with-paged-attention-and-continuous-batching/</guid><description>vLLM is a Python library for high-throughput LLM inference using paged attention and continuous batching. It supports quantization, distributed inference, and an OpenAI-compatible API.</description></item><item><title>TradingAgents: a multi-agent LLM framework simulating real-world trading firm dynamics</title><link>https://ramdi.fr/github-stars/tradingagents-a-multi-agent-llm-framework-simulating-real-world-trading-firm-dynamics/</link><pubDate>Sat, 02 May 2026 07:48:10 +0000</pubDate><guid>https://ramdi.fr/github-stars/tradingagents-a-multi-agent-llm-framework-simulating-real-world-trading-firm-dynamics/</guid><description>TradingAgents uses specialized LLM agents in a structured bull/bear debate to mimic real trading firms. Supports 10+ LLMs, persistent memory, and CLI/Docker usage.</description></item><item><title>Qwen Code: A multi-provider terminal AI coding agent with unified config abstraction</title><link>https://ramdi.fr/github-stars/qwen-code-a-multi-provider-terminal-ai-coding-agent-with-unified-config-abstraction/</link><pubDate>Tue, 28 Apr 2026 18:38:54 +0000</pubDate><guid>https://ramdi.fr/github-stars/qwen-code-a-multi-provider-terminal-ai-coding-agent-with-unified-config-abstraction/</guid><description>Qwen Code is a TypeScript terminal AI coding agent that abstracts multiple LLM providers behind a unified config, enabling flexible AI workflows with Skills and SubAgents.</description></item><item><title>Hunting Tokens/sec: 4 LLM Backends, 1 Hard Ceiling (Part 2/4)</title><link>https://ramdi.fr/post/ai-llm/local-llm-tokens-per-second-benchmark/</link><pubDate>Tue, 28 Apr 2026 00:00:00 +0000</pubDate><guid>https://ramdi.fr/post/ai-llm/local-llm-tokens-per-second-benchmark/</guid><description>Part 2 of 4: a benchmark journal across nixpkgs llama.cpp, upstream master, and ik_llama.cpp on Qwen3.6-27B. Six hours, four backends, all converging at 66 tok/s — and the physical reason why.</description></item><item><title>Speculative Decoding Meets Hybrid SSM: Why It Breaks (Part 3/4)</title><link>https://ramdi.fr/post/ai-llm/local-llm-speculative-decoding-hybrid-ssm/</link><pubDate>Tue, 28 Apr 2026 00:00:00 +0000</pubDate><guid>https://ramdi.fr/post/ai-llm/local-llm-speculative-decoding-hybrid-ssm/</guid><description>Part 3 of 4: a deep-dive into why speculative decoding silently breaks (or runs anti-economically) on hybrid attention+SSM architectures like Qwen3.6, Mamba-2, and RWKV — and what would need to change upstream to fix it.</description></item><item><title>The NixOS Setup for llama.cpp: Declarative and Reproducible (Part 4/4)</title><link>https://ramdi.fr/post/ai-llm/local-llm-nixos-llama-server-module/</link><pubDate>Tue, 28 Apr 2026 00:00:00 +0000</pubDate><guid>https://ramdi.fr/post/ai-llm/local-llm-nixos-llama-server-module/</guid><description>Part 4 of 4: the actual NixOS module, llama-pull helper, claude-code-router wiring, and one-line workflow for switching models. Five Nix files for a complete, isolated, rollback-able local LLM service.</description></item><item><title>Why I Serve Qwen3.6 Locally on My RTX 5090 (Part 1/4)</title><link>https://ramdi.fr/post/ai-llm/local-llm-rtx5090-why-nixos/</link><pubDate>Tue, 28 Apr 2026 00:00:00 +0000</pubDate><guid>https://ramdi.fr/post/ai-llm/local-llm-rtx5090-why-nixos/</guid><description>Part 1 of 4: motivation, hardware, and stack choices for serving Qwen3.6-27B locally on a 32 GB consumer GPU with NixOS, before any benchmarks or trade-offs kick in.</description></item><item><title>Forge: a Rust-based multi-agent AI coding assistant integrated into your terminal workflow</title><link>https://ramdi.fr/github-stars/forge-a-rust-based-multi-agent-ai-coding-assistant-integrated-into-your-terminal-workflow/</link><pubDate>Sun, 26 Apr 2026 23:47:28 +0000</pubDate><guid>https://ramdi.fr/github-stars/forge-a-rust-based-multi-agent-ai-coding-assistant-integrated-into-your-terminal-workflow/</guid><description>Forge is a Rust-based AI coding agent with multi-agent architecture and a unique ZSH plugin that intercepts shell commands for seamless terminal integration. It supports 300+ LLM providers.</description></item><item><title>AutoGen: exploring multi-agent AI orchestration with Python in maintenance mode</title><link>https://ramdi.fr/github-stars/autogen-exploring-multi-agent-ai-orchestration-with-python-in-maintenance-mode/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/autogen-exploring-multi-agent-ai-orchestration-with-python-in-maintenance-mode/</guid><description>AutoGen is a Python framework for building multi-agent AI applications with LLM integration, now in maintenance mode with Microsoft Agent Framework as its successor. Learn its architecture, strengths, and how to get started.</description></item><item><title>Context7: injecting real-time, version-specific docs into LLM workflows</title><link>https://ramdi.fr/github-stars/context7-injecting-real-time-version-specific-docs-into-llm-workflows/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/context7-injecting-real-time-version-specific-docs-into-llm-workflows/</guid><description>Context7 tackles LLM hallucinations by injecting up-to-date, version-specific library docs directly into AI coding agents&amp;rsquo; context via CLI or MCP server integration.</description></item><item><title>DeerFlow 2.0: orchestrating multi-agent AI workflows with flexible LLM integration</title><link>https://ramdi.fr/github-stars/deerflow-2-0-orchestrating-multi-agent-ai-workflows-with-flexible-llm-integration/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/deerflow-2-0-orchestrating-multi-agent-ai-workflows-with-flexible-llm-integration/</guid><description>DeerFlow 2.0 is a Python framework for orchestrating AI sub-agents and memory with support for multiple LLMs and execution sandboxes. It uses a modular config and setup wizard for flexible deployment.</description></item><item><title>Inside AI Engineering Hub: a hands-on collection of production-ready AI projects</title><link>https://ramdi.fr/github-stars/inside-ai-engineering-hub-a-hands-on-collection-of-production-ready-ai-projects/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-ai-engineering-hub-a-hands-on-collection-of-production-ready-ai-projects/</guid><description>AI Engineering Hub offers 90+ production-ready AI projects spanning LLMs, RAG, AI agents, and MCP, organized by difficulty and real-world use cases.</description></item><item><title>Inside CowAgent: An extensible autonomous AI assistant with multi-modal and multi-model architecture</title><link>https://ramdi.fr/github-stars/inside-cowagent-an-extensible-autonomous-ai-assistant-with-multi-modal-and-multi-model-architecture/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-cowagent-an-extensible-autonomous-ai-assistant-with-multi-modal-and-multi-model-architecture/</guid><description>CowAgent is an extensible AI assistant framework with autonomous task planning, long-term memory, and multi-modal support. It integrates multiple LLMs and platforms for flexible AI workflows.</description></item><item><title>Kong Gateway: A universal API gateway with advanced AI traffic routing and governance</title><link>https://ramdi.fr/github-stars/kong-gateway-a-universal-api-gateway-with-advanced-ai-traffic-routing-and-governance/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/kong-gateway-a-universal-api-gateway-with-advanced-ai-traffic-routing-and-governance/</guid><description>Kong Gateway extends traditional API management with universal LLM API routing, semantic security, and AI-specific features, enabling multi-vendor AI traffic governance in cloud-native environments.</description></item><item><title>LLM-driven browser automation with Browser-Use: a hands-on look</title><link>https://ramdi.fr/github-stars/llm-driven-browser-automation-with-browser-use-a-hands-on-look/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/llm-driven-browser-automation-with-browser-use-a-hands-on-look/</guid><description>Browser-Use is a Python library enabling LLM-powered AI agents to automate browsers efficiently. It features a custom ChatBrowserUse model and supports cloud and local agents.</description></item><item><title>OpenHands: Modular architecture for flexible AI agent development</title><link>https://ramdi.fr/github-stars/openhands-modular-architecture-for-flexible-ai-agent-development/</link><pubDate>Sun, 26 Apr 2026 17:51:11 +0000</pubDate><guid>https://ramdi.fr/github-stars/openhands-modular-architecture-for-flexible-ai-agent-development/</guid><description>OpenHands offers a modular Python platform to build and deploy AI agents with SDK, CLI, GUI, and cloud options. It supports multiple LLMs and self-hosting for enterprises.</description></item><item><title>Pathway LLM App: unified pipelines for scalable retrieval-augmented generation and AI search</title><link>https://ramdi.fr/github-stars/pathway-llm-app-unified-pipelines-for-scalable-retrieval-augmented-generation-and-ai-search/</link><pubDate>Sun, 26 Apr 2026 09:31:26 +0000</pubDate><guid>https://ramdi.fr/github-stars/pathway-llm-app-unified-pipelines-for-scalable-retrieval-augmented-generation-and-ai-search/</guid><description>Pathway LLM App provides integrated pipelines for scalable RAG and AI search, combining vector and full-text indexing with real-time sync for Gen AI apps at scale.</description></item><item><title>Awesome LLM Apps: a practical collection of runnable AI agent and RAG templates</title><link>https://ramdi.fr/github-stars/awesome-llm-apps-a-practical-collection-of-runnable-ai-agent-and-rag-templates/</link><pubDate>Fri, 24 Apr 2026 18:26:13 +0000</pubDate><guid>https://ramdi.fr/github-stars/awesome-llm-apps-a-practical-collection-of-runnable-ai-agent-and-rag-templates/</guid><description>Awesome LLM Apps offers 100+ runnable AI agent and RAG templates for quick LLM app development. It supports multiple providers and advanced multi-agent patterns with minimal setup.</description></item><item><title>Inside daily_stock_analysis: a multi-LLM automated stock analysis system</title><link>https://ramdi.fr/github-stars/inside-daily-stock-analysis-a-multi-llm-automated-stock-analysis-system/</link><pubDate>Fri, 24 Apr 2026 18:26:13 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-daily-stock-analysis-a-multi-llm-automated-stock-analysis-system/</guid><description>daily_stock_analysis combines multi-LLM integration with multi-source financial data to automate stock market decisions across global markets. It features flexible AI provider fallback and multi-channel alerts.</description></item><item><title>Browser Harness: a self-healing LLM agent for browser automation via Chrome DevTools</title><link>https://ramdi.fr/github-stars/browser-harness-a-self-healing-llm-agent-for-browser-automation-via-chrome-devtools/</link><pubDate>Fri, 24 Apr 2026 07:26:29 +0000</pubDate><guid>https://ramdi.fr/github-stars/browser-harness-a-self-healing-llm-agent-for-browser-automation-via-chrome-devtools/</guid><description>Browser Harness enables LLMs to automate browsers by dynamically generating helper functions using the Chrome DevTools Protocol, with minimal Python code and free remote browsers.</description></item><item><title>Building AI Agents with Claude Code</title><link>https://ramdi.fr/post/ai-llm/building-ai-agents-with-claude/</link><pubDate>Sat, 11 Apr 2026 00:00:00 +0000</pubDate><guid>https://ramdi.fr/post/ai-llm/building-ai-agents-with-claude/</guid><description>How to leverage Claude Code to build autonomous AI agents that publish content, review code, and manage workflows.</description></item></channel></rss>