Explore how ai-by-hand-excel implements deep learning architectures like Transformers entirely in Excel formulas, exposing the math behind AI step-by-step without code.
daVinci-MagiHuman uses a 15B-parameter single-stream transformer with a sandwich architecture to generate video and audio from text, achieving competitive quality and fast inference on a single H100 GPU.
MicroGPT-C uses a deterministic C scaffold to coordinate tiny GPT-2 models, achieving 90%+ accuracy on logic games with 8x memory compression and infinite sequence lengths.
A curated repo breaking down large language model internals with numeric attention math, tokenization, and transformer architecture, targeting engineers who want to understand LLMs under the hood.
llm-madness offers a Python-built GPT-style transformer training pipeline with tokenizer training, memory-mapped datasets, and a unique web UI for per-layer attention inspection and loss visualization.
NOVA3R implements a non-pixel-aligned visual transformer for amodal 3D reconstruction from unposed multi-view images, recovering occluded geometry with physical plausibility.