#

ml-validation

Here are 4 public repositories matching this topic...

giskard-oss

Giskard-AI / giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

ai-security mlops fairness-ai responsible-ai ml-validation red-team-tools trustworthy-ai ml-testing llm ai-red-team ai-testing llmops llm-security llm-eval llm-evaluation rag-evaluation agent-evaluation

Updated Nov 18, 2025
Python

moonwatcher-ai / moonwatcher

Evaluation & testing framework for computer vision models

computer-vision ai-safety ethical-artificial-intelligence ai-security mlops ml-safety ml-validation trustworthy-ai ml-testing

Updated Jun 20, 2024
Python

Doleus / doleus

Build confidence in your AI with systematic slice-based testing

python machine-learning quality-control computer-vision pytorch slice quality-assurance fairness ai-safety ethical-artificial-intelligence ai-security mlops ml-safety ml-validation trustworthy-ai ml-testing torchmetrics eu-ai-act

Updated Dec 16, 2025
Python

Phinchanbora / llm-evaluation

🎯 Benchmark LLMs effectively with over 10 tests and 108,000 real questions to assess model performance and enhance AI evaluation.

deep-learning evaluation llama gpt evaluation-metrics ai-security ml-validation red-team-tools trustworthy-ai ml-testing large-language-models llm chatgpt llm-security llm-eval llm-evaluation rag-evaluation llm-evaluation-framework

Updated Jan 11, 2026
Python

Improve this page

Add a description, image, and links to the ml-validation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ml-validation topic, visit your repo's landing page and select "manage topics."