feat: naive token estimation via tiktoken by lizradway · Pull Request #2031 · strands-agents/sdk-python

lizradway · 2026-04-01T14:43:40Z

Description

Adds _estimate_tokens() method to the Model base class for estimating input token count before sending to the model, enabling proactive context management (e.g., triggering compression at a threshold)
Uses tiktoken (cl100k_base encoding) as a universal fallback for all 11 providers — individual providers can override with native counting APIs later
Handles all content block types: text, toolUse, toolResult, reasoningContent, guardContent, citationsContent, system_prompt_content; gracefully skips non-serializable content (e.g., image bytes) while still counting serializable parts. Non-serializable content can be covered in follow up model-native count tokens API.
Adds tiktoken as a optional dependency
- This dependency is currently optional since _estimate_tokens is internal facing only for now and will only be installed for users utilizing proactive context management features: ( [FEATURE] Proactive Context Compression #555, [FEATURE] Large Tool Result Externalization via AfterToolCallEvent Hook #1296, [FEATURE] In-envent-loop cycle context management #298)

Related Issues

#1294

Documentation PR

This should be internally facing, documentation not required

Type of Change

New feature

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

tests/strands/models/test_model.py

src/strands/models/model.py

github-actions · 2026-04-01T15:14:58Z

Review Summary (Round 2)

Assessment: Approve ✅

All previously identified blocking issues have been addressed in this update.

Changes Verified

Previous Issue	Resolution
API review label required	✅ Method renamed to private `_estimate_tokens()` - no API review needed
Documentation PR required	✅ Private method is correctly internal-facing
Encoding caching suggestion	✅ Implemented `_get_encoding()` with `_cached_encoding`
Docstring expansion	✅ Now includes limitations and override guidance
Missing guardContent test	✅ Added `test_estimate_tokens_guard_content_block`

Additional Improvements

Added edge case tests for non-serializable tool specs
Added test for toolUse with binary data
Comprehensive test coverage for all content block types

Note: Codecov shows 92.85% patch coverage with 4 partial lines - these are likely error handling branches (except clauses) which are acceptable.

Good implementation with clean code structure, appropriate error handling, and thorough test coverage. 👍

opieter-aws · 2026-04-08T12:06:02Z

/strands review

github-actions · 2026-04-08T12:11:33Z

Review Summary (Round 3)

Assessment: Approve ✅

Significant improvements since Round 2 — tiktoken is now an optional dependency with lazy import, system_prompt_content support was added, and token counting is more granular and resilient.

Details

Optional dependency: Good decision to make tiktoken optional with a clear ImportError message guiding users to pip install strands-agents[token-estimation].
system_prompt_content: The priority logic (structured content over plain string) correctly avoids double-counting.
Improved counting: toolUse now counts name + input separately, toolResult iterates text items only — both more accurate and resilient to non-serializable content.
Test quality: Exact token count assertions make tests deterministic and the test_get_encoding_raises_without_tiktoken monkeypatch test validates the optional import path.
Test dependency fragility (non-blocking): token-estimation is not in the all extra — tests rely on tiktoken being a transitive dep of litellm. See inline comment.

All previous blocking issues remain resolved. Clean implementation with thorough test coverage.

opieter-aws · 2026-04-08T12:38:19Z

src/strands/models/model.py

+    if "guardContent" in block:
+        guard = block["guardContent"]
+        if "text" in guard:
+            total += len(encoding.encode(guard["text"]["text"]))


If "text" key is missing from guard["text"] this would throw. Do we want to add an extra check here to be more defensive like the other handlers?

mkmeral · 2026-04-08T13:57:42Z

src/strands/models/model.py

+    global _cached_encoding
+    if _cached_encoding is None:
+        try:
+            import tiktoken


I get caching, but why do we keep importing inside the method? Is this intentionally lazy loading?

Maybe token estimation should be it's own file?

This is intentional, since tiktoken is an optional dependency

mkmeral · 2026-04-08T13:59:18Z

src/strands/models/model.py

+
+        Used for proactive context management (e.g., triggering compression at a
+        threshold). This is a naive approximation using tiktoken's cl100k_base encoding.
+        Accuracy varies by model provider but is typically within 5-10% for most providers.


Is this AI garbage or actual claim?

This research shows a comparison, where they found a Mean Absolute Percentage Error range of 6.5-11.7% when using tiktoken's cl100k_base

mkmeral · 2026-04-08T14:08:28Z

src/strands/models/model.py

+
+    for message in messages:
+        for block in message["content"]:
+            total += _count_content_block_tokens(block, encoding)


nit, one trick we can do to improve accuracy: instead of getting all of the token count for messages array, just keep track of the latest consumed tokens, and just estimate latest one.

So then your error margin for the history is 0% (bc we literally know the token count), and the only error happens in the latest added message

I like this suggestion. There's separate work going on to expose the latest token count which makes this possible. Once that's set up we can implement this as a follow-up optimization

github-actions bot added the size/m label Apr 1, 2026

lizradway temporarily deployed to manual-approval April 1, 2026 14:43 — with GitHub Actions Inactive

lizradway temporarily deployed to auto-approve April 1, 2026 14:44 — with GitHub Actions Inactive

github-actions bot added the strands-running label Apr 1, 2026

This comment was marked as outdated.

Sign in to view

github-actions bot reviewed Apr 1, 2026

View reviewed changes

tests/strands/models/test_model.py Show resolved Hide resolved

github-actions bot reviewed Apr 1, 2026

View reviewed changes

src/strands/models/model.py Outdated Show resolved Hide resolved

github-actions bot reviewed Apr 1, 2026

View reviewed changes

src/strands/models/model.py Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

github-actions bot removed the strands-running label Apr 1, 2026

lizradway force-pushed the token-estimation branch from 6c2c1c5 to 55bd572 Compare April 1, 2026 15:01

github-actions bot added size/m and removed size/m labels Apr 1, 2026

lizradway had a problem deploying to auto-approve April 1, 2026 15:01 — with GitHub Actions Failure

lizradway temporarily deployed to manual-approval April 1, 2026 15:01 — with GitHub Actions Inactive

lizradway temporarily deployed to auto-approve April 1, 2026 15:12 — with GitHub Actions Inactive

github-actions bot added the strands-running label Apr 1, 2026

github-actions bot removed the strands-running label Apr 1, 2026

lizradway marked this pull request as ready for review April 1, 2026 15:34

lizradway requested a deployment to manual-approval April 1, 2026 15:34 — with GitHub Actions Waiting

lizradway added the area-context Session or context related label Apr 1, 2026

lizradway mentioned this pull request Apr 1, 2026

feat(context): track context tokens #2009

Merged

7 tasks

lizradway marked this pull request as draft April 3, 2026 17:02

github-actions bot added size/m and removed size/m labels Apr 3, 2026

lizradway temporarily deployed to auto-approve April 3, 2026 17:02 — with GitHub Actions Inactive

lizradway requested a deployment to manual-approval April 3, 2026 17:02 — with GitHub Actions Waiting

github-actions bot added size/m and removed size/m labels Apr 3, 2026

lizradway force-pushed the token-estimation branch from 68e8484 to a536a86 Compare April 3, 2026 18:57

github-actions bot added size/m and removed size/m labels Apr 3, 2026

lizradway had a problem deploying to auto-approve April 3, 2026 18:57 — with GitHub Actions Failure

lizradway temporarily deployed to manual-approval April 3, 2026 18:57 — with GitHub Actions Inactive

github-actions bot added strands-running and removed strands-running labels Apr 3, 2026

add monkey patch to test import

2120099

github-actions bot added size/m and removed size/m labels Apr 3, 2026

lizradway temporarily deployed to manual-approval April 3, 2026 19:01 — with GitHub Actions Inactive

lizradway temporarily deployed to auto-approve April 3, 2026 19:01 — with GitHub Actions Inactive

github-actions bot added strands-running and removed strands-running labels Apr 3, 2026

lizradway marked this pull request as ready for review April 3, 2026 19:08

lizradway temporarily deployed to manual-approval April 3, 2026 19:08 — with GitHub Actions Inactive

github-actions bot added strands-running and removed strands-running labels Apr 3, 2026

github-actions bot added the strands-running label Apr 8, 2026

github-actions bot removed the strands-running label Apr 8, 2026

opieter-aws reviewed Apr 8, 2026

View reviewed changes

mkmeral reviewed Apr 8, 2026

View reviewed changes

update docstring and add safety check

41ee032

github-actions bot added size/m and removed size/m labels Apr 9, 2026

opieter-aws requested a deployment to manual-approval April 9, 2026 09:40 — with GitHub Actions Waiting

opieter-aws had a problem deploying to auto-approve April 9, 2026 09:40 — with GitHub Actions Failure

opieter-aws requested a review from mkmeral April 9, 2026 11:35

Conversation

lizradway commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Documentation PR

Type of Change

Testing

Checklist

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

github-actions bot commented Apr 1, 2026

Review Summary (Round 2)

Uh oh!

opieter-aws commented Apr 8, 2026

Uh oh!

github-actions bot commented Apr 8, 2026

Review Summary (Round 3)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lizradway commented Apr 1, 2026 •

edited

Loading