Autonomous-Agents on Noureddine RAMDI

Autonomous-Agents on Noureddine RAMDIhttps://ramdi.fr/tags/autonomous-agents/Recent content in Autonomous-Agents on Noureddine RAMDIHugoenSat, 23 May 2026 20:41:27 +0000Claw-Eval: a rigorous Python harness for trustworthy evaluation of LLM-powered autonomous agentshttps://ramdi.fr/github-stars/claw-eval-a-rigorous-python-harness-for-trustworthy-evaluation-of-llm-powered-autonomous-agents/Sat, 23 May 2026 20:41:14 +0000https://ramdi.fr/github-stars/claw-eval-a-rigorous-python-harness-for-trustworthy-evaluation-of-llm-powered-autonomous-agents/Claw-Eval offers a Python-based evaluation harness for LLM autonomous agents, featuring 300 tasks and a strict Pass^3 metric to ensure reliable, multi-dimensional benchmarking.LLM-MM-Agent: autonomous mathematical modeling with hierarchical method selectionhttps://ramdi.fr/github-stars/llm-mm-agent-autonomous-mathematical-modeling-with-hierarchical-method-selection/Sat, 23 May 2026 20:41:14 +0000https://ramdi.fr/github-stars/llm-mm-agent-autonomous-mathematical-modeling-with-hierarchical-method-selection/LLM-MM-Agent uses LLMs as autonomous agents for end-to-end mathematical modeling, featuring a unique hierarchical method library with actor-critic selection. Supports GPT-4o and DeepSeek-R1.Minds Platform: An enterprise-grade AI foundation for autonomous agents and semantic searchhttps://ramdi.fr/github-stars/minds-platform-an-enterprise-grade-ai-foundation-for-autonomous-agents-and-semantic-search/Fri, 15 May 2026 14:23:51 +0000https://ramdi.fr/github-stars/minds-platform-an-enterprise-grade-ai-foundation-for-autonomous-agents-and-semantic-search/Minds Platform offers a Python-based AI foundation with autonomous agents and semantic search, designed for flexible enterprise deployment across cloud and on-prem environments.Goal-Driven: orchestrating long-lived AI agents with prompt-based verification loopshttps://ramdi.fr/github-stars/goal-driven-orchestrating-long-lived-ai-agents-with-prompt-based-verification-loops/Tue, 05 May 2026 16:46:42 +0000https://ramdi.fr/github-stars/goal-driven-orchestrating-long-lived-ai-agents-with-prompt-based-verification-loops/Goal-Driven offers a prompt-based master-subagent architecture to sustain long-running AI problem-solving sessions through a verification-driven orchestration loop without code or frameworks.Mapping the AI agent self-evolution ecosystem with the awesome-agent-evolution taxonomyhttps://ramdi.fr/github-stars/mapping-the-ai-agent-self-evolution-ecosystem-with-the-awesome-agent-evolution-taxonomy/Tue, 05 May 2026 16:46:42 +0000https://ramdi.fr/github-stars/mapping-the-ai-agent-self-evolution-ecosystem-with-the-awesome-agent-evolution-taxonomy/The awesome-agent-evolution repo organizes 50+ open-source projects into a clear taxonomy of AI agent self-evolution and infrastructure layers, offering a practical ecosystem map for developers.BoxPwnr: benchmarking autonomous LLM agents on cybersecurity challenges with iterative command executionhttps://ramdi.fr/github-stars/boxpwnr-benchmarking-autonomous-llm-agents-on-cybersecurity-challenges-with-iterative-command-execution/Mon, 04 May 2026 10:23:01 +0000https://ramdi.fr/github-stars/boxpwnr-benchmarking-autonomous-llm-agents-on-cybersecurity-challenges-with-iterative-command-execution/BoxPwnr benchmarks LLM-based autonomous agents on cybersecurity challenges using iterative command execution in a Kali Docker container, supporting 20+ LLM models and 13+ platforms.Running autonomous software engineering agents with AWS CDK and EC2 workershttps://ramdi.fr/github-stars/running-autonomous-software-engineering-agents-with-aws-cdk-and-ec2-workers/Mon, 04 May 2026 10:23:01 +0000https://ramdi.fr/github-stars/running-autonomous-software-engineering-agents-with-aws-cdk-and-ec2-workers/Explore how aws-samples/remote-swe-agents runs autonomous software engineering agents in dedicated EC2 instances orchestrated by AWS CDK with a Next.js interface and Amazon Bedrock LLM integration.Symphony: orchestrating autonomous coding agents with work-level managementhttps://ramdi.fr/github-stars/symphony-orchestrating-autonomous-coding-agents-with-work-level-management/Sun, 03 May 2026 11:08:03 +0000https://ramdi.fr/github-stars/symphony-orchestrating-autonomous-coding-agents-with-work-level-management/Symphony by OpenAI orchestrates autonomous coding agents via work boards and proof-of-work validation, shifting AI coding from direct supervision to task-level management.