Graph-R1 trains large language models with reinforcement learning to reason over knowledge graphs, cycling through think-query-retrieve-rethink steps for complex knowledge tasks.
Explore the Hands-On Large Language Models repo, a Jupyter notebook-based practical guide from fundamentals to fine-tuning, designed for hands-on LLM learning on free Colab GPUs.
MegaTrain enables training 100B+ parameter LLMs on a single GPU by offloading all parameters to CPU RAM and streaming layers to GPU. Supports HuggingFace models and multi-GPU data parallelism without NCCL.