Transformers' Math Struggles and Smarter Context Compression for AI Agents

1. Why Can’t Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

2. ACON: Optimizing Context Compression for Long-horizon LLM Agents