Reduce effective AI batch size so streaming requests finish before timeout. Keep token-based batching but cap each request to 80 messages or about 9k content tokens, and recursively split failed batches instead of marking the whole batch failed. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>