A curated and frequently updated bibliography accompanying the IJCAI 2024 survey paper on LLM-based multi-agent systems, organizing research into five key categories and revealing emerging trends.
The LLM Engineer Handbook catalogs the full lifecycle of large language model engineering, from pretraining to prompt management, guiding engineers beyond demos to production-ready LLM apps.
Neon Vision Editor is a native Swift code editor for macOS/iOS/iPadOS that balances minimalism with sandbox compliance via a unique CLI helper using macOS Launch Services.
NetAlertX is a Python-based network monitoring tool offering continuous asset discovery, IP management, and multi-channel alerts via Docker deployment.
NetFluss is a Swift macOS menubar app that monitors real-time network speeds, per-app traffic, and router bandwidth with a built-in DNS switcher using a privileged helper.
Network Switch is a Kotlin Android app enabling quick toggling of 34 network modes via root or Shizuku, using Android hidden APIs for non-root devices.
Nextcloud Talk offers federated, self-hosted video conferencing and chat integrated tightly with Nextcloud. It supports federation, Matterbridge sync, and TURN for NAT traversal.
NomAI combines a Flutter app with a FastAPI backend using a multi-step LLM pipeline and web-grounded reasoning for nutrition analysis and meal tracking.
nomore403 is a Go CLI tool for security researchers automating HTTP 403/401 bypass testing with heuristic scoring to flag likely bypasses and reduce false positives.
Nougat is Meta’s neural OCR system for academic PDFs, extracting LaTeX math and tables into structured Markdown using a Vision Transformer encoder-decoder. It offers CLI, API, and training tools.
npcpy offers a unique NPC Context-Agent-Tool data layer to enforce AI compliance via software architecture, supporting multimodal LLM apps and multi-agent systems with local and cloud providers.
obsidian-llm-wiki-local generates interlinked Obsidian markdown wikis using local LLMs. Its standout feature is a rejection feedback loop that refines article quality via user input.
OCRFlux is a Python OCR tool optimized for NVIDIA GPUs, enabling fast, high-quality OCR on documents using a conda environment and poppler-utils for PDF rendering.
oh-my-product extends Google’s Gemini CLI with multi-agent orchestration using tmux and slash commands for parallel AI workflows, offering persistent state and lifecycle controls.
Olive blends a traditional timeline NLE with a node-based compositing system in C++, offering GPU-accelerated rendering and professional color management via OpenColorIO. Currently in alpha.
OmniGen2 unifies visual understanding, text-to-image generation, and image editing using distinct decoding pathways for text and images, built on Qwen-VL-2.5 with CPU offloading for accessibility.
OmniVoice Studio is a local desktop app offering zero-shot voice cloning, multi-engine TTS, and video dubbing with GPU-aware offloading and an MCP server for agentic AI integration.
Onefetch is a Rust CLI tool that analyzes local Git repos offline, showing project stats with multi-language support and custom output formats. Here’s how it works.
opcode is a cross-platform Tauri desktop app that wraps Claude Code CLI with a GUI, session checkpointing, custom AI agents, MCP server management, and usage analytics.
Open Computer Use uses a modular three-stage LLM pipeline to control a cloud Linux desktop, combining grounding, vision, and action models for flexible AI-driven automation.