Загрузка...

Cluaiz Engine: 66.8 TPS with Bonsai 4B (1.58bit) on RTX 3050 #aiinference #bitnet #gguf #cluaiz #ai

Cluaiz is ai model runtime engine, It running ai modles on your phone Computer Server, it using you can run ai unlimited free, it's written rust so that have very efficient netively run on divice.
Website: https://cluaiz.com
Repo: ❤https://github.com/cluaiz/cluaiz.

Got tired of standard Python/Docker wrappers choking on my 4GB GPU, so I built a bare-metal inference engine.
Watch the Cluaiz kernel execute the Bonsai 4B (BitNet 1.58bit) model at a blazing 66.8 Tokens/Second on a budget RTX 3050 (4GB VRAM). Built with Rust and C++ for direct-to-silicon execution, zero efficiency tax, and dynamic KV-cache management. True offline, high-speed local AI for consumer hardware.
⚙️ Hardware: RTX 3050 (4GB VRAM)
🧠 Model: prism-ml/Bonsai-4B-gguf (1.58-bit)
⚡ Engine: Cluaiz (Rust/C++)
Drop a comment if your local LLM setup is giving you OOM errors!
GitHub: https://github.com/cluaiz/cluaiz
website cluaiz.com
#LocalLLaMA #BitNet #RTX3050 #RustLang #ArtificialIntelligence #MachineLearning #Ollama #vLLM #OpenSource #Cluaiz #GGUF #AIInference #Tech

Видео Cluaiz Engine: 66.8 TPS with Bonsai 4B (1.58bit) on RTX 3050 #aiinference #bitnet #gguf #cluaiz #ai канала cluaiz
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять