Noureddine RAMDI Dinour

Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation

Organizations

154 results for Llm

Clear filter

unslop: empirically detecting and avoiding repetitive LLM output patterns
unslop is a Python CLI tool that detects repetitive defaults in LLM outputs by empirical analysis, generating reusable anti-pattern profiles to improve prompt engineering.
github-stars python llm cli prompt-engineering Created Mon, 04 May 2026 10:11:02 +0000
AI penetration testing knowledge base: structured resources for LLM security research
A curated repository for AI/LLM penetration testing covering prompt injection, adversarial ML, and LLM red teaming with the OWASP LLM Top 10 framework.
github-stars security ai llm penetration-testing Created Mon, 04 May 2026 10:09:00 +0000
Inside llm_wiki: a desktop app for building persistent LLM-powered personal wikis
llm_wiki uses a two-step chain-of-thought pipeline to build a self-maintaining knowledge base. It combines Tauri, knowledge graphs, and Louvain clustering for a unique personal wiki experience.
github-stars typescript llm knowledge-base tauri Created Mon, 04 May 2026 10:05:49 +0000
Building a production-ready second brain with agentic RAG and LLMOps
Explore an open-source course that teaches building a production-grade AI assistant using advanced retrieval-augmented generation, agent orchestration, fine-tuning, and LLMOps practices.
github-stars machine learning llm rag agentic-ai Created Sun, 03 May 2026 08:12:11 +0000
Navigating free-tier LLM APIs with the awesome-free-llm-apis catalog
A curated catalog of free-tier LLM APIs compatible with OpenAI SDK, detailing rate limits, model specs, and providers to build zero-cost AI applications.
github-stars llm api free-tier openai-sdk Created Sun, 03 May 2026 08:12:11 +0000
A-MEM: dynamic semantic memory management for LLM agents inspired by Zettelkasten
A-MEM is a Python agentic memory system that dynamically organizes LLM agent memories using semantic embeddings and automatic linking, inspired by Zettelkasten.
github-stars python llm agent-memory chroma-db Created Sun, 03 May 2026 00:54:10 +0000
A hands-on course for mastering large language models: fine-tuning, quantization, and tooling
Explore a comprehensive LLM course with practical notebooks on fine-tuning (QLoRA, DPO), quantization (GPTQ), and tools like AutoEval and LazyMergekit. Ideal for aspiring LLM engineers.
github-stars llm fine-tuning quantization python Created Sat, 02 May 2026 20:07:04 +0000
Hermes Agent: A self-improving AI agent with closed learning loops and multi-platform integration
Hermes Agent is a Python AI agent featuring closed learning loops, autonomous skill creation, multi-model support, and seamless Telegram/Discord integration for persistent, adaptable AI workflows.
github-stars ai agent python llm Created Sat, 02 May 2026 20:07:04 +0000
LlamaFactory: modular, extensible fine-tuning framework for large language models
LlamaFactory offers a modular Python framework for fine-tuning 100+ LLMs with diverse algorithms and optimizations, including LoRA, QLoRA, and reinforcement learning.
github-stars python llm fine-tuning machine-learning Created Sat, 02 May 2026 20:07:04 +0000
LocalAI: running diverse AI models locally with multi-backend support and agent capabilities
LocalAI enables running 36+ AI models locally without GPU, supporting multi-user API access and built-in AI agents with OpenAI-compatible APIs. Here’s how it works and why it matters.
github-stars go ai llm local-inference Created Sat, 02 May 2026 20:07:04 +0000
mem0: optimizing AI agent memory with a new single-pass additive algorithm
mem0 enhances AI agent memory with a new single-pass ADD-only extraction algorithm and multi-signal retrieval, boosting benchmarks significantly while simplifying memory management.
github-stars python ai memory-management llm Created Sat, 02 May 2026 20:07:04 +0000
MetaGPT: orchestrating multi-agent AI teams to automate software development
MetaGPT uses a multi-agent system with defined GPT roles following SOPs to automate software development from one-line prompts. It simulates a software company with role-based AI collaboration.
github-stars python llm multi-agent software-automation Created Sat, 02 May 2026 20:07:04 +0000
Ollama: a unified CLI and API platform for local large language models
Ollama simplifies running and managing open-source large language models locally with a unified CLI and REST API, supporting broad integrations and multi-OS support.
github-stars go llm cli rest-api Created Sat, 02 May 2026 20:07:04 +0000
vLLM: Efficient large language model serving with paged attention and continuous batching
vLLM is a Python library for high-throughput LLM inference using paged attention and continuous batching. It supports quantization, distributed inference, and an OpenAI-compatible API.
github-stars python llm inference gpu Created Sat, 02 May 2026 20:07:04 +0000
TradingAgents: a multi-agent LLM framework simulating real-world trading firm dynamics
TradingAgents uses specialized LLM agents in a structured bull/bear debate to mimic real trading firms. Supports 10+ LLMs, persistent memory, and CLI/Docker usage.
github-stars python llm multi-agent trading Created Sat, 02 May 2026 07:48:10 +0000
Qwen Code: A multi-provider terminal AI coding agent with unified config abstraction
Qwen Code is a TypeScript terminal AI coding agent that abstracts multiple LLM providers behind a unified config, enabling flexible AI workflows with Skills and SubAgents.
github-stars typescript ai-agent cli llm Created Tue, 28 Apr 2026 18:38:54 +0000
Hunting Tokens/sec: 4 LLM Backends, 1 Hard Ceiling (Part 2/4)
Part 2 of 4: a benchmark journal across nixpkgs llama.cpp, upstream master, and ik_llama.cpp on Qwen3.6-27B. Six hours, four backends, all converging at 66 tok/s — and the physical reason why.
ai llm benchmark llama-cpp qwen Created Tue, 28 Apr 2026 00:00:00 +0000
Speculative Decoding Meets Hybrid SSM: Why It Breaks (Part 3/4)
Part 3 of 4: a deep-dive into why speculative decoding silently breaks (or runs anti-economically) on hybrid attention+SSM architectures like Qwen3.6, Mamba-2, and RWKV — and what would need to change upstream to fix it.
ai llm speculative-decoding qwen mamba Created Tue, 28 Apr 2026 00:00:00 +0000
The NixOS Setup for llama.cpp: Declarative and Reproducible (Part 4/4)
Part 4 of 4: the actual NixOS module, llama-pull helper, claude-code-router wiring, and one-line workflow for switching models. Five Nix files for a complete, isolated, rollback-able local LLM service.
ai llm nixos llama-cpp systemd Created Tue, 28 Apr 2026 00:00:00 +0000
Why I Serve Qwen3.6 Locally on My RTX 5090 (Part 1/4)
Part 1 of 4: motivation, hardware, and stack choices for serving Qwen3.6-27B locally on a 32 GB consumer GPU with NixOS, before any benchmarks or trade-offs kick in.
ai llm qwen rtx5090 nixos Created Tue, 28 Apr 2026 00:00:00 +0000

Previous Next