Overview
Posts
6
GitHub Stars
1328
Noureddine RAMDI
🚀
Noureddine RAMDI
Dinour
Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation
France
noureddine@ramdi.fr
https://ramdi.fr
Organizations
Overview
Posts
6
GitHub Stars
1328
1
results for
Scientific-Papers
Clear filter
paperetl: a modular ETL pipeline for scientific papers with multi-format ingestion and unified schema
paperetl is a Python ETL library that normalizes PDFs, PubMed, arXiv, TEI, and CSV metadata into a unified article schema, supporting SQLite, JSON, YAML, and Elasticsearch storage.
github-stars
python
etl
scientific-papers
grobid
Created
Sat, 23 May 2026 20:41:14 +0000
Previous
Next