Skip to content
#

token-optimization

Here are 241 public repositories matching this topic...

Cut your Claude / OpenAI / Gemini bill 70–95% on AI coding. Local proxy that compresses context, keeps provider caches hot, and verifies LLM output ($0 hallucination guard). Drop-in for Cursor, Claude Code, Codex, Aider + 34 more and custom providers — 30s, no code changes

  • Updated Jun 17, 2026
  • Python
prompt-refiner

🚀 Lightweight Python library for building production LLM applications with smart context management and automatic token optimization. Save 10-20% on API costs while fitting RAG docs, chat history, and prompts into your token budget.

  • Updated Apr 12, 2026
  • Python
token-goat

Token burn reducer and focus keeper for Claude Code, Codex, Gemini CLI, Cline, Windsurf, Aider, Cursor, Copilot, pi, and more: session-aware read hints, 130+ bash output filters, compact manifest injection, image shrinking, prompt injection protection, and much more.

  • Updated Jun 17, 2026
  • Python

Improve this page

Add a description, image, and links to the token-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-optimization topic, visit your repo's landing page and select "manage topics."

Learn more