cpu-inference

Star

Here are 11 public repositories matching this topic...

kennethleungty / Llama-2-Open-Source-LLM-CPU-Inference

Star

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

Updated Nov 6, 2023
Python

laelhalawani / gguf_llama

Star

Wrapper for simplified use of Llama2 GGUF quantized models.

llama quantization cpu-inference llamacpp llama2 gguf

Updated Jan 14, 2024
Python

lahcenkh / rag-network-docs

Star

Privacy-focused RAG chatbot for network documentation. Chat with your PDFs locally using Ollama, Chroma & LangChain. CPU-only, fully offline.

ai python3 network-programming cpu-inference vector-database-embedding rag-chatbot

Updated Sep 7, 2025
Python

isshiki-dev / docker-model-runner

Star

Self-hosted Anthropic API Compatible Inference Server with Claude Code support, Interleaved Thinking, and HuggingFace Spaces deployment

docker streaming ai transformers self-hosted inference-server fastapi huggingface openai-api cpu-inference llm tool-calling anthropic-api claude-code

Updated Dec 7, 2025
Python

SrabanMondal / voice-assistant-v2

Star

CPU-first, turn-aware local voice assistant with multiprocessing, streaming STT→LLM→TTS, and interruption-safe orchestration.

natural-language-processing text-to-speech streaming multiprocessing concurrency speech-recognition event-driven low-latency inter-process-communication voice-assistant real-time-systems system-design wake-word-detection onnx edge-ai cpu-inference llm ollama offline-ai

Updated Jan 4, 2026
Python

bhimanbaghel / llama-streamlit-app

Star

🤖 AI Text Completion App built with Streamlit and Llama-3.2-1B. Generate creative text completions with an intuitive web interface. GPU & CPU optimized, easy to deploy, perfect for content creation and AI experimentation.

python nlp machine-learning ai transformers text-generation webapp llama huggingface streamlit streamlit-webapp cpu-inference

Updated Jun 21, 2025
Python

NeuroTinkerLab / local-rag-chat-with-foundry

Star

Un sistema RAG per chattare con documenti locali usando Foundry e modelli LLM su CPU

ai cpu-inference llm local-ai document-chat rag-chatbot

Updated Oct 27, 2025
Python

MckAnissa / echo-rag-chatbot

Star

Personal project. Local RAG chatbot (personal project) using Mistralv0.2/TinyLlama with TF-IDF retrieval. Streamlit interface for CPU-optimized inference without GPU requirements.

python nlp privacy ai chatbot philosophy mistral ethics conversational-ai rag streamlit cpu-inference local-llm retrieval-augmented-generation mistral-7b rag-chatbot

Updated Nov 17, 2025
Python

YashTandon05 / car-damage-detection-service

Star

FastAPI service for car damage detection and damage type classification using PyTorch

python api machine-learning ai computer-vision deep-learning pytorch image-classification mlops fast-api cpu-inference

Updated Dec 22, 2025
Python

deepixel-inc / FaceLandmarkTracker

Star

High-performance facial landmark detection and tracking library by Deepixel. CPU-only, real-time inference using TensorFlow Lite, OpenCV, and DeepCore. Outputs 106 facial landmarks with head pose estimation and Python API support.

python opencv computer-vision wasm ios-app android-app landmark-detection face-tracking face-landmarks head-pose-estimation tensorflow-lite on-device-ai cpu-inference monocular-vision real-time-inference deepixel

Updated Sep 4, 2025
Python

Raxcore-dev / rax-offline-cli

Star

🚀 Advanced offline AI assistant with one-click installation. Complete privacy, no GPU required. Built by RaxCore.

Updated Nov 27, 2025
Python

Improve this page

Add a description, image, and links to the cpu-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cpu-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu-inference

Here are 11 public repositories matching this topic...

kennethleungty / Llama-2-Open-Source-LLM-CPU-Inference

laelhalawani / gguf_llama

lahcenkh / rag-network-docs

isshiki-dev / docker-model-runner

SrabanMondal / voice-assistant-v2

bhimanbaghel / llama-streamlit-app

NeuroTinkerLab / local-rag-chat-with-foundry

MckAnissa / echo-rag-chatbot

YashTandon05 / car-damage-detection-service

deepixel-inc / FaceLandmarkTracker

Raxcore-dev / rax-offline-cli

Improve this page

Add this topic to your repo