cryptopoly / ChaosEngineAI Sponsor Star 25 Code Issues Pull requests Local AI workstation — discover, run, chat, benchmark, and generate images from open-weight models. DFlash/DDTree speculative decoding, TurboQuant & TriAttention cache compression strategies, MLX + llama.cpp + vLLM + MTPLX backends. desktop-app python machine-learning typescript ai image-generation mlx tauri huggingface apple-silicon openai-api cache-compression llm stable-diffusion llama-cpp vllm local-ai gguf speculative-decoding dflash Updated Jun 19, 2026 Python