Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
-
Updated
Apr 30, 2025 - Python
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
An agent benchmark with tasks in a simulated software company.
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.
A Python toolkit for chain-of-thought prompting 🐍
Generate full fledged PDF reports using LLMs like GPT, Claude, Llama
Automated Deep Research with LLMs, web search, paper parsing, and didactic summarization.
Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.
🌪️ AI research assistant that generates Wikipedia-quality articles through multi-perspective analysis. Based on Stanford's STORM methodology.
Multimodal generative AI resources : talking heads, STT, TTS, image & video generation, and more.
AIRAS - an open-source project for research automation
Template repository for the Werewolf hackathon
Interactive tool for analyzing attention patterns in transformer models with layer-wise visualizations, token importance scoring, and attention flow diagrams
Transformers + Mambas + LSTMS All in One Model
PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final, consensus-driven result. Designed for testing, comparing, and orchestrating local models with ease.
Official implementation of "Automated Algorithmic Discovery for Gravitational-Wave Detection Guided by LLM-Informed Evolutionary Monte Carlo Tree Search" (arXiv:2508.03661).
MCP Persistent memory systems for LLMs - CASCADE 6-layer memory + Faiss GPU search (<2ms). Give any AI persistent memory across conversations. Open source, MIT license.
An open source implementation of Mamba 2 in one file of pytorch
Brain-inspired cognitive architecture implementing basal ganglia RL, hippocampal memory consolidation, and prefrontal meta-cognition. Multi-agent system with dynamic attention control, procedural learning, and theory of mind - genuine cognitive continuity beyond context windows.
A comprehensive collection of PyTorch implementations for the VGG (Visual Geometry Group) models
AI Research Trend Dashboard — Real-time visualization of 30+ AI research subfields using OpenAlex (automatic yearly updates, mini trend charts, rankings, heatmap).
Add a description, image, and links to the ai-research topic page so that developers can more easily learn about it.
To associate your repository with the ai-research topic, visit your repo's landing page and select "manage topics."