Noureddine RAMDI Dinour

Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation

Organizations

36 results for Deep-Learning

Clear filter

SegMAN: Combining State Space Models with Neighborhood Attention for Semantic Segmentation
SegMAN blends State Space Models and Neighborhood Attention within a hybrid encoder-decoder for semantic segmentation, balancing long-range context with local detail. It achieves competitive mIoU on ADE20K with models from 6.4M to 92.6M parameters.
github-stars semantic segmentation state space models neighborhood attention mms segmentation Created Mon, 06 Jul 2026 15:15:52 +0000
ALICE: a self-contained YOLO dataset management toolkit with a creative single-file Python builder
ALICE is a Python-based toolkit for managing YOLO training datasets from home camera setups, featuring a unique single-file builder and seamless Frigate NVR integration.
github-stars python yolo dataset-management deep-learning Created Sat, 23 May 2026 20:41:14 +0000
DeepSpeed: scalable deep learning optimization with extensible hardware support
DeepSpeed is a Python library that optimizes large-scale deep learning training with multi-hardware support and JIT CUDA extensions. Explore its architecture, strengths, and quick installation.
github-stars python deep-learning pytorch cuda Created Sat, 23 May 2026 20:41:14 +0000
DiT4DiT: Vision-Action Modeling with Video Transformers for Real-Time Humanoid Robot Control
DiT4DiT uses a frozen Cosmos-Predict2.5 video transformer backbone combined with flow-matching action heads to model robot actions as video latent transitions, achieving near-perfect success on LIBERO and real-time humanoid control.
github-stars robotics video-transformers vision-action-model flow-matching Created Sat, 23 May 2026 20:41:14 +0000
Hivemind: decentralized peer-to-peer deep learning with PyTorch
Hivemind is a PyTorch library enabling decentralized deep learning over the internet using a peer-to-peer Distributed Hash Table (DHT). It supports fault-tolerant training and decentralized parameter averaging without global sync.
github-stars python pytorch distributed-training decentralized Created Sat, 23 May 2026 20:41:14 +0000
ML-From-Scratch: Exploring Machine Learning Fundamentals with Pure Python and NumPy
ML-From-Scratch offers bare-bones Python implementations of key machine learning algorithms using only NumPy, focusing on transparency over efficiency. Explore how it demystifies ML fundamentals.
github-stars machine learning python numpy education Created Sat, 23 May 2026 20:41:14 +0000
OmniGen2: a unified multimodal generation model with separate decoding paths for text and images
OmniGen2 unifies visual understanding, text-to-image generation, and image editing using distinct decoding pathways for text and images, built on Qwen-VL-2.5 with CPU offloading for accessibility.
github-stars multimodal deep-learning pytorch image-generation Created Sat, 23 May 2026 20:41:14 +0000
OverlapNet: Siamese networks for loop closure detection in 3D LiDAR SLAM
OverlapNet uses Siamese networks on 2D range images from 3D LiDAR to detect loop closures by predicting overlap and relative yaw angle simultaneously. Practical demos included.
github-stars python lidar slam machine-learning Created Sat, 23 May 2026 20:41:14 +0000
Pixal3D: pixel-aligned 3D asset generation from a single image with projection conditioning
Pixal3D generates high-fidelity 3D assets with PBR textures from a single image using pixel-aligned projection conditioning. It offers a three-stage cascade and low-VRAM mode for consumer GPUs.
github-stars python 3d-generation pbr-texturing deep-learning Created Sat, 23 May 2026 20:41:14 +0000
SAM3-UNet: Adapting Meta's SAM3 for efficient dense prediction with a lightweight U-Net decoder
SAM3-UNet adapts Meta’s SAM3 foundation model for dense prediction tasks using a parameter-efficient adapter and U-Net decoder, enabling training under 6 GB GPU memory.
github-stars python computer-vision segmentation deep-learning Created Sat, 23 May 2026 20:41:14 +0000
SVFR: unified video face restoration with task-conditioned stable video diffusion
SVFR combines blind face restoration, colorization, and inpainting in a single stable video diffusion model, enabling efficient multi-task video face enhancement.
github-stars python video-restoration diffusion-models deep-learning Created Sat, 23 May 2026 20:41:14 +0000
Tencent HY-World 2.0: multi-modal pipeline for persistent, editable 3D world generation
Tencent’s HY-World 2.0 generates persistent 3D assets from text, images, or video using a four-stage pipeline. It outputs editable worlds compatible with Blender, Unity, and Unreal Engine.
github-stars 3d generative-ai python deep-learning Created Sat, 23 May 2026 20:41:14 +0000
Tracing deep learning step-by-step in Excel: a hands-on guide to ai-by-hand-excel
Explore how ai-by-hand-excel implements deep learning architectures like Transformers entirely in Excel formulas, exposing the math behind AI step-by-step without code.
github-stars deep learning excel transformer attention Created Sat, 23 May 2026 20:41:14 +0000
CodeFormer: Deep learning-based blind face restoration with fidelity control
CodeFormer uses a codebook transformer architecture for blind face restoration, letting users control the tradeoff between quality and fidelity with a unique fidelity weight parameter.
github-stars python deep-learning face-restoration computer-vision Created Tue, 05 May 2026 13:37:39 +0000
Medical-SAM3: adapting foundation models for prompt-driven medical image segmentation
Medical-SAM3 adapts the SAM3 foundation model for universal prompt-driven medical image segmentation, offering pretrained weights and evaluation tools on diverse medical datasets.
github-stars medical-imaging segmentation foundation-models python Created Tue, 05 May 2026 13:37:39 +0000
A curated 100-day machine learning journey with code and resources
Explore a 100-day machine learning coding challenge combining classical algorithms, deep learning, and curated resources. A practical, day-by-day learning path for self-directed devs.
github-stars machine learning deep learning scikit-learn tensorflow Created Mon, 04 May 2026 10:23:03 +0000
AniGen: GPU-accelerated 3D animation generation with Python and CUDA
AniGen is a Linux-only Python project for 3D animation generation using NVIDIA GPUs and CUDA. It integrates PyTorch, spconv, and pytorch3d with a smooth setup script for complex dependencies.
github-stars python cuda pytorch 3d-animation Created Mon, 04 May 2026 10:23:02 +0000
Awesome-Deblurring: A comprehensive academic resource on image and video deblurring techniques
Awesome-Deblurring compiles 100+ key papers tracing image and video deblurring from classical optimization to modern deep learning, serving as a go-to bibliography for researchers and developers.
github-stars image-processing computer-vision deep-learning academic-resource Created Mon, 04 May 2026 10:23:02 +0000
DIMO: Distilling Diverse 3D Motion Priors for Arbitrary Object Motion Synthesis
DIMO distills motion priors from text-conditioned and multi-view video models into a shared latent space, enabling diverse 3D motion generation for arbitrary objects using 3D Gaussian splatting and 4D rendering.
github-stars python pytorch 3d-motion 3d-gaussian-splatting Created Mon, 04 May 2026 10:23:02 +0000
Magika: Google's deep learning system for fast, accurate file type detection
Magika replaces magic-byte heuristics with a tiny deep learning model for file type detection, achieving ~99% accuracy across 200+ types with 5ms CPU inference.
github-stars python rust deep-learning file-detection Created Mon, 04 May 2026 10:23:02 +0000