Noureddine RAMDI Dinour

Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation

Organizations

3 results for Vllm

vLLM Compressor: Practical quantization and compression for large language model inference
vLLM Compressor applies advanced quantization and compression techniques to large language models, enabling optimized inference without requiring full model definitions.
github-stars python llm quantization compression Created Sat, 23 May 2026 20:41:14 +0000
kvcached: a plugin cache for SGLang and vLLM Python environments
kvcached provides a plugin cache layer for SGLang and vLLM Python LLM environments, easing deployment with PyPI and Docker support. Useful for optimizing LLM workflows.
github-stars python llm cache docker Created Mon, 04 May 2026 10:23:02 +0000
OpenResearcher: An open-source 30B LLM for long-horizon deep research
OpenResearcher is a fully open 30B agentic LLM designed for deep research tasks, featuring a 96K-turn dataset and a self-built retriever over 11B tokens, running on vLLM with 8×A100 GPUs.
github-stars python agentic-llm deep-research vllm Created Mon, 04 May 2026 10:23:01 +0000