Attention on Noureddine RAMDI

Attention on Noureddine RAMDIhttps://ramdi.fr/tags/attention/Recent content in Attention on Noureddine RAMDIHugoenSat, 23 May 2026 20:41:27 +0000Tracing deep learning step-by-step in Excel: a hands-on guide to ai-by-hand-excelhttps://ramdi.fr/github-stars/tracing-deep-learning-step-by-step-in-excel-a-hands-on-guide-to-ai-by-hand-excel/Sat, 23 May 2026 20:41:14 +0000https://ramdi.fr/github-stars/tracing-deep-learning-step-by-step-in-excel-a-hands-on-guide-to-ai-by-hand-excel/Explore how ai-by-hand-excel implements deep learning architectures like Transformers entirely in Excel formulas, exposing the math behind AI step-by-step without code.vLLM: Efficient large language model serving with paged attention and continuous batchinghttps://ramdi.fr/github-stars/vllm-efficient-large-language-model-serving-with-paged-attention-and-continuous-batching/Sat, 02 May 2026 20:07:04 +0000https://ramdi.fr/github-stars/vllm-efficient-large-language-model-serving-with-paged-attention-and-continuous-batching/vLLM is a Python library for high-throughput LLM inference using paged attention and continuous batching. It supports quantization, distributed inference, and an OpenAI-compatible API.