Vision-Transformer on Noureddine RAMDI

Vision-Transformer on Noureddine RAMDIhttps://ramdi.fr/tags/vision-transformer/Recent content in Vision-Transformer on Noureddine RAMDIHugoenSat, 23 May 2026 20:41:27 +0000Nougat: Vision Transformer OCR for academic PDFs extracting LaTeX math and tableshttps://ramdi.fr/github-stars/nougat-vision-transformer-ocr-for-academic-pdfs-extracting-latex-math-and-tables/Sat, 23 May 2026 20:41:14 +0000https://ramdi.fr/github-stars/nougat-vision-transformer-ocr-for-academic-pdfs-extracting-latex-math-and-tables/Nougat is Meta’s neural OCR system for academic PDFs, extracting LaTeX math and tables into structured Markdown using a Vision Transformer encoder-decoder. It offers CLI, API, and training tools.OVIE: Monocular novel view synthesis without multi-view supervisionhttps://ramdi.fr/github-stars/ovie-monocular-novel-view-synthesis-without-multi-view-supervision/Tue, 05 May 2026 13:37:39 +0000https://ramdi.fr/github-stars/ovie-monocular-novel-view-synthesis-without-multi-view-supervision/OVIE trains novel view synthesis models using unpaired internet images, avoiding the need for calibrated multi-view datasets. It uses Vision Transformers and foundation models for pose and depth encoding.