<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Vision-Transformer on Noureddine RAMDI</title><link>https://ramdi.fr/tags/vision-transformer/</link><description>Recent content in Vision-Transformer on Noureddine RAMDI</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 23 May 2026 20:41:27 +0000</lastBuildDate><atom:link href="https://ramdi.fr/tags/vision-transformer/index.xml" rel="self" type="application/rss+xml"/><item><title>Nougat: Vision Transformer OCR for academic PDFs extracting LaTeX math and tables</title><link>https://ramdi.fr/github-stars/nougat-vision-transformer-ocr-for-academic-pdfs-extracting-latex-math-and-tables/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/nougat-vision-transformer-ocr-for-academic-pdfs-extracting-latex-math-and-tables/</guid><description>Nougat is Meta&amp;rsquo;s neural OCR system for academic PDFs, extracting LaTeX math and tables into structured Markdown using a Vision Transformer encoder-decoder. It offers CLI, API, and training tools.</description></item><item><title>OVIE: Monocular novel view synthesis without multi-view supervision</title><link>https://ramdi.fr/github-stars/ovie-monocular-novel-view-synthesis-without-multi-view-supervision/</link><pubDate>Tue, 05 May 2026 13:37:39 +0000</pubDate><guid>https://ramdi.fr/github-stars/ovie-monocular-novel-view-synthesis-without-multi-view-supervision/</guid><description>OVIE trains novel view synthesis models using unpaired internet images, avoiding the need for calibrated multi-view datasets. It uses Vision Transformers and foundation models for pose and depth encoding.</description></item></channel></rss>