Noureddine RAMDI Dinour

Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation

Organizations

8 results for Autonomous-Agents

Clear filter

Claw-Eval: a rigorous Python harness for trustworthy evaluation of LLM-powered autonomous agents
Claw-Eval offers a Python-based evaluation harness for LLM autonomous agents, featuring 300 tasks and a strict Pass^3 metric to ensure reliable, multi-dimensional benchmarking.
github-stars python llm agent-evaluation sandboxing Created Sat, 23 May 2026 20:41:14 +0000
LLM-MM-Agent: autonomous mathematical modeling with hierarchical method selection
LLM-MM-Agent uses LLMs as autonomous agents for end-to-end mathematical modeling, featuring a unique hierarchical method library with actor-critic selection. Supports GPT-4o and DeepSeek-R1.
github-stars python llm mathematical-modeling autonomous-agents Created Sat, 23 May 2026 20:41:14 +0000
Minds Platform: An enterprise-grade AI foundation for autonomous agents and semantic search
Minds Platform offers a Python-based AI foundation with autonomous agents and semantic search, designed for flexible enterprise deployment across cloud and on-prem environments.
github-stars python ai semantic-search autonomous-agents Created Fri, 15 May 2026 14:23:51 +0000
Goal-Driven: orchestrating long-lived AI agents with prompt-based verification loops
Goal-Driven offers a prompt-based master-subagent architecture to sustain long-running AI problem-solving sessions through a verification-driven orchestration loop without code or frameworks.
github-stars ai multi-agent prompt-engineering orchestration Created Tue, 05 May 2026 16:46:42 +0000
Mapping the AI agent self-evolution ecosystem with the awesome-agent-evolution taxonomy
The awesome-agent-evolution repo organizes 50+ open-source projects into a clear taxonomy of AI agent self-evolution and infrastructure layers, offering a practical ecosystem map for developers.
github-stars ai autonomous-agents taxonomy memory Created Tue, 05 May 2026 16:46:42 +0000
BoxPwnr: benchmarking autonomous LLM agents on cybersecurity challenges with iterative command execution
BoxPwnr benchmarks LLM-based autonomous agents on cybersecurity challenges using iterative command execution in a Kali Docker container, supporting 20+ LLM models and 13+ platforms.
github-stars python llm cybersecurity benchmarking Created Mon, 04 May 2026 10:23:01 +0000
Running autonomous software engineering agents with AWS CDK and EC2 workers
Explore how aws-samples/remote-swe-agents runs autonomous software engineering agents in dedicated EC2 instances orchestrated by AWS CDK with a Next.js interface and Amazon Bedrock LLM integration.
github-stars aws typescript autonomous-agents aws-cdk Created Mon, 04 May 2026 10:23:01 +0000
Symphony: orchestrating autonomous coding agents with work-level management
Symphony by OpenAI orchestrates autonomous coding agents via work boards and proof-of-work validation, shifting AI coding from direct supervision to task-level management.
github-stars elixir autonomous-agents agent-orchestration harness-engineering Created Sun, 03 May 2026 11:08:03 +0000