<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Reinforcement-Learning on Noureddine RAMDI</title><link>https://ramdi.fr/tags/reinforcement-learning/</link><description>Recent content in Reinforcement-Learning on Noureddine RAMDI</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 23 May 2026 20:41:27 +0000</lastBuildDate><atom:link href="https://ramdi.fr/tags/reinforcement-learning/index.xml" rel="self" type="application/rss+xml"/><item><title>Exploring GMR: real-time cross-embodiment human motion retargeting for humanoid robots</title><link>https://ramdi.fr/github-stars/exploring-gmr-real-time-cross-embodiment-human-motion-retargeting-for-humanoid-robots/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/exploring-gmr-real-time-cross-embodiment-human-motion-retargeting-for-humanoid-robots/</guid><description>GMR is a Python library that retargets human motion from multiple formats onto 17+ humanoid robots in real time on CPU, tuned for RL tracking policies and whole-body teleoperation.</description></item><item><title>Graph-R1: Reinforcement learning to train LLMs for reasoning over knowledge graphs</title><link>https://ramdi.fr/github-stars/graph-r1-reinforcement-learning-to-train-llms-for-reasoning-over-knowledge-graphs/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/graph-r1-reinforcement-learning-to-train-llms-for-reasoning-over-knowledge-graphs/</guid><description>Graph-R1 trains large language models with reinforcement learning to reason over knowledge graphs, cycling through think-query-retrieve-rethink steps for complex knowledge tasks.</description></item><item><title>GS-Playground: High-throughput photorealistic simulation for vision-based robot learning</title><link>https://ramdi.fr/github-stars/gs-playground-high-throughput-photorealistic-simulation-for-vision-based-robot-learning/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/gs-playground-high-throughput-photorealistic-simulation-for-vision-based-robot-learning/</guid><description>GS-Playground combines 3D Gaussian Splatting rendering with a velocity-impulse physics engine to enable large-scale visual reinforcement learning at up to 10^4 FPS. Preview release with core simulation API and demos.</description></item><item><title>ML-From-Scratch: Exploring Machine Learning Fundamentals with Pure Python and NumPy</title><link>https://ramdi.fr/github-stars/ml-from-scratch-exploring-machine-learning-fundamentals-with-pure-python-and-numpy/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/ml-from-scratch-exploring-machine-learning-fundamentals-with-pure-python-and-numpy/</guid><description>ML-From-Scratch offers bare-bones Python implementations of key machine learning algorithms using only NumPy, focusing on transparency over efficiency. Explore how it demystifies ML fundamentals.</description></item><item><title>UI-Voyager: Self-evolving AI agent for Android GUI automation with SSIM-based trajectory correction</title><link>https://ramdi.fr/github-stars/ui-voyager-self-evolving-ai-agent-for-android-gui-automation-with-ssim-based-trajectory-correction/</link><pubDate>Mon, 04 May 2026 10:23:03 +0000</pubDate><guid>https://ramdi.fr/github-stars/ui-voyager-self-evolving-ai-agent-for-android-gui-automation-with-ssim-based-trajectory-correction/</guid><description>UI-Voyager is a 4B parameter AI agent achieving 81% success on AndroidWorld by self-evolving with SSIM-based trajectory correction, no human labels needed.</description></item><item><title>Deploying RL-trained motion tracking policies on legged robots with motion_tracking_controller</title><link>https://ramdi.fr/github-stars/deploying-rl-trained-motion-tracking-policies-on-legged-robots-with-motion-tracking-controller/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/deploying-rl-trained-motion-tracking-policies-on-legged-robots-with-motion-tracking-controller/</guid><description>motion_tracking_controller is a C++ ROS 2 package deploying RL-trained motion tracking policies on legged robots with ONNX inference and embedded robot control metadata.</description></item><item><title>FinRL-Trading: modular, weight-centric quantitative trading with deployment-consistent backtesting and DRL portfolio allocation</title><link>https://ramdi.fr/github-stars/finrl-trading-modular-weight-centric-quantitative-trading-with-deployment-consistent-backtesting-and-drl-portfolio-allocation/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/finrl-trading-modular-weight-centric-quantitative-trading-with-deployment-consistent-backtesting-and-drl-portfolio-allocation/</guid><description>FinRL-Trading offers a modular Python framework for quantitative trading focused on a weight-centric architecture unifying backtesting and live execution, with classical and DRL portfolio methods.</description></item><item><title>Inside Alibaba’s VRAG: Multimodal Retrieval-Augmented Generation with Dynamic Reasoning Graphs</title><link>https://ramdi.fr/github-stars/inside-alibabas-vrag-multimodal-retrieval-augmented-generation-with-dynamic-reasoning-graphs/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-alibabas-vrag-multimodal-retrieval-augmented-generation-with-dynamic-reasoning-graphs/</guid><description>Alibaba&amp;rsquo;s VRAG models reasoning as a dynamic DAG with multimodal memory and RL-based fine-grained credit assignment, supporting text, image, and video retrieval in a unified framework.</description></item><item><title>TradeMaster: A rigorous reinforcement learning platform for quantitative trading research</title><link>https://ramdi.fr/github-stars/trademaster-a-rigorous-reinforcement-learning-platform-for-quantitative-trading-research/</link><pubDate>Mon, 04 May 2026 10:23:02 +0000</pubDate><guid>https://ramdi.fr/github-stars/trademaster-a-rigorous-reinforcement-learning-platform-for-quantitative-trading-research/</guid><description>TradeMaster offers a full pipeline for RL-based quantitative trading with 13+ algorithms and a rigorous 6-axis, 17-measure evaluation framework across multiple asset classes and trading tasks.</description></item><item><title>FinRL: open-source framework for financial reinforcement learning with a train-test-trade pipeline</title><link>https://ramdi.fr/github-stars/finrl-open-source-framework-for-financial-reinforcement-learning-with-a-train-test-trade-pipeline/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/finrl-open-source-framework-for-financial-reinforcement-learning-with-a-train-test-trade-pipeline/</guid><description>FinRL provides an open-source three-layer architecture for financial reinforcement learning with 5 DRL agents and 14+ data sources. Great for learning DRL in finance.</description></item><item><title>Inside ToddlerBot: an open-source Python platform for multi-skill humanoid locomotion with depth-based skill classification</title><link>https://ramdi.fr/github-stars/inside-toddlerbot-an-open-source-python-platform-for-multi-skill-humanoid-locomotion-with-depth-based-skill-classification/</link><pubDate>Mon, 04 May 2026 10:23:01 +0000</pubDate><guid>https://ramdi.fr/github-stars/inside-toddlerbot-an-open-source-python-platform-for-multi-skill-humanoid-locomotion-with-depth-based-skill-classification/</guid><description>ToddlerBot offers a full Python stack for training, classifying, and deploying multi-skill humanoid locomotion policies using stereo depth data and reinforcement learning.</description></item></channel></rss>