Noureddine RAMDI Dinour

Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation

Organizations

12 results for Reinforcement-Learning

Clear filter

Inside InternVL: Open-Source Multimodal Large Language Models with Reinforcement Learning
InternVL offers open-source multimodal large language models combining vision transformers and LLMs, featuring CascadeRL training and competitive benchmarks like GPT-4o.
github-stars multimodal llm reinforcement-learning vision-transformer Created Mon, 06 Jul 2026 15:15:52 +0000
Exploring GMR: real-time cross-embodiment human motion retargeting for humanoid robots
GMR is a Python library that retargets human motion from multiple formats onto 17+ humanoid robots in real time on CPU, tuned for RL tracking policies and whole-body teleoperation.
github-stars python robotics motion-retargeting reinforcement-learning Created Sat, 23 May 2026 20:41:14 +0000
Graph-R1: Reinforcement learning to train LLMs for reasoning over knowledge graphs
Graph-R1 trains large language models with reinforcement learning to reason over knowledge graphs, cycling through think-query-retrieve-rethink steps for complex knowledge tasks.
github-stars python reinforcement-learning large-language-models knowledge-graphs Created Sat, 23 May 2026 20:41:14 +0000
GS-Playground: High-throughput photorealistic simulation for vision-based robot learning
GS-Playground combines 3D Gaussian Splatting rendering with a velocity-impulse physics engine to enable large-scale visual reinforcement learning at up to 10^4 FPS. Preview release with core simulation API and demos.
github-stars robotics simulation reinforcement-learning 3d-gaussian-splatting Created Sat, 23 May 2026 20:41:14 +0000
ML-From-Scratch: Exploring Machine Learning Fundamentals with Pure Python and NumPy
ML-From-Scratch offers bare-bones Python implementations of key machine learning algorithms using only NumPy, focusing on transparency over efficiency. Explore how it demystifies ML fundamentals.
github-stars machine learning python numpy education Created Sat, 23 May 2026 20:41:14 +0000
UI-Voyager: Self-evolving AI agent for Android GUI automation with SSIM-based trajectory correction
UI-Voyager is a 4B parameter AI agent achieving 81% success on AndroidWorld by self-evolving with SSIM-based trajectory correction, no human labels needed.
github-stars python ai-agent android-automation reinforcement-learning Created Mon, 04 May 2026 10:23:03 +0000
Deploying RL-trained motion tracking policies on legged robots with motion_tracking_controller
motion_tracking_controller is a C++ ROS 2 package deploying RL-trained motion tracking policies on legged robots with ONNX inference and embedded robot control metadata.
github-stars c++ ros2 robotics onnx Created Mon, 04 May 2026 10:23:02 +0000
FinRL-Trading: modular, weight-centric quantitative trading with deployment-consistent backtesting and DRL portfolio allocation
FinRL-Trading offers a modular Python framework for quantitative trading focused on a weight-centric architecture unifying backtesting and live execution, with classical and DRL portfolio methods.
github-stars python quantitative-trading reinforcement-learning portfolio-management Created Mon, 04 May 2026 10:23:02 +0000
Inside Alibaba’s VRAG: Multimodal Retrieval-Augmented Generation with Dynamic Reasoning Graphs
Alibaba’s VRAG models reasoning as a dynamic DAG with multimodal memory and RL-based fine-grained credit assignment, supporting text, image, and video retrieval in a unified framework.
github-stars python multimodal rag reinforcement-learning Created Mon, 04 May 2026 10:23:02 +0000
TradeMaster: A rigorous reinforcement learning platform for quantitative trading research
TradeMaster offers a full pipeline for RL-based quantitative trading with 13+ algorithms and a rigorous 6-axis, 17-measure evaluation framework across multiple asset classes and trading tasks.
github-stars reinforcement-learning quantitative-trading market-simulation portfolio-management Created Mon, 04 May 2026 10:23:02 +0000
FinRL: open-source framework for financial reinforcement learning with a train-test-trade pipeline
FinRL provides an open-source three-layer architecture for financial reinforcement learning with 5 DRL agents and 14+ data sources. Great for learning DRL in finance.
github-stars reinforcement learning finance deep learning stable baselines Created Mon, 04 May 2026 10:23:01 +0000
Inside ToddlerBot: an open-source Python platform for multi-skill humanoid locomotion with depth-based skill classification
ToddlerBot offers a full Python stack for training, classifying, and deploying multi-skill humanoid locomotion policies using stereo depth data and reinforcement learning.
github-stars robotics reinforcement-learning python mujoco Created Mon, 04 May 2026 10:23:01 +0000