Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
-
Updated
Jan 9, 2026 - Go
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
China Unicom's Yuanjing Wanwu Agent Platform is an enterprise-grade, multi-tenant AI agent development platform. It helps users build applications such as intelligent agents, workflows, and rag, and also supports model management. The platform features a developer-friendly license, and we welcome all developers to build upon the platform.
Distributed vector search for AI-native applications
Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.
♾️ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.
🕵️♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.
AI coding agent for your terminal.
Kubernetes for AI Agents. Build and run AI like microservices - scalable, observable, and identity-aware from day one.
The Kubernetes operator for K8ssandra
A lightweight, production-ready RAG (Retrieval Augmented Generation) library in Go.
Cut LLM costs by up to 80% and unlock sub-millisecond responses with intelligent semantic caching.A drop-in, provider-agnostic LLM proxy written in Go with sub-millisecond response
🧠 LLM-Driven Intelligent Memory & Context Management System (AI记忆管理与智能上下文感知平台) AI记忆管理平台 | 智能上下文感知 | RAG检索增强生成 | 向量检索引擎
An agent framework for Go with graph-aware memory, UTCP-native tools, and multi-agent orchestration. Built for production.
The implementation of Model Context Protocol (MCP) server for VictoriaMetrics
A diverse, simple, and secure all-in-one LLMOps platform
Go implementation of @qdrant/fastembed.
Add a description, image, and links to the rag topic page so that developers can more easily learn about it.
To associate your repository with the rag topic, visit your repo's landing page and select "manage topics."