Token-efficiency

Published on
December 25, 2025
Hybrid Context Compaction: Managing Token Growth in Agentic Loops
llm context-engineering token-efficiency ai
How adding observation masking to our existing LLM summarization reduced token costs, improved response latency, and eased rate limit pressure. Adapted from JetBrains research.
Published on
October 26, 2025
Compressing LLM Context Windows: Efficient Data Formats and Context Management
llm context-compression token-efficiency context-engineering
Explore techniques to reduce token usage in LLM contexts by using compact data formats, summarization, and intelligent retrieval.

Hybrid Context Compaction: Managing Token Growth in Agentic Loops