Arm backend: Add experimental support for new TOSAQuantizer by AdrianLundell · Pull Request #18100 · pytorch/executorch

AdrianLundell · 2026-03-11T12:32:12Z

Allows initializing TOSA/EthosU/Vgf quantizers with use_composable_quantizer=True to use a new implementation of the quantizer following the Cortex-M. See
#17701 for more details.

Creates a new temporary TOSAQuantizer API layer for switching between the two versions
Adds a TOSAQuantizationConfig encapturing TOSA-specific qspec requirements for certain ops.
Adds quantizer_support.py for defining what operators are supported by the quantizer.
Align mark_node_as_annotated in cortex-m backend to TOSAQuantizer behaviour.
Update quantizer reporter to handle TOSA qspecs as they are dynamically created.

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

Allows initializing TOSA/EthosU/Vgf quantizers with use_composable_quantizer=True to use a new implementation of the quantizer following the Cortex-M. See pytorch#17701 for more details. - Creates a new temporary TOSAQuantizer API layer for switching between the two versions - Adds a TOSAQuantizationConfig encapturing TOSA-specific qspec requirements for certain ops. - Adds quantizer_support.py for defining what operators are supported by the quantizer. - Align mark_node_as_annotated in cortex-m backend to TOSAQuantizer behaviour. - Update quantizer reporter to handle TOSA qspecs as they are dynamically created. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Icbca66ff86e6f78ffa1c8dcec55e17c25f97d8ca

pytorch-bot · 2026-03-11T12:32:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18100

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 5c02aa7 with merge base 096f10c ():

NEW FAILURES - The following jobs have failed:

trunk / test-multimodal-macos (gemma3-4b) / macos-job (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 1
trunk / unittest-release / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_linear_model

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Id81e0c39d13a94a749206441fce60664c80a0af8

zingo · 2026-03-11T14:50:47Z

Hi @SS-JIA / @digantdesai this adds a file, do you want/need to check this?
This is also something we would like to get into 1.2 is possible.

AdrianLundell · 2026-03-11T15:59:23Z

Fails unrelated

rascani · 2026-03-12T00:04:09Z

backends/arm/quantizer/__init__.py


+# Lazily import heavy quantizer classes to avoid circular imports with
+# Cortex-M quantization configs.
+_LAZY_EXPORTS = {


_LAZY_IMPORTS?

rascani · 2026-03-12T22:16:51Z

backends/arm/quantizer/quantizer_support.py

+        (torch.ops.aten.sqrt.default,),
+        (torch.ops.aten.silu.default,),
+        (torch.ops.aten.silu_.default,),
+        (torch.ops.aten.logit.default,),


torch.ops.aten.logit.default listed twice (also at line 126).

rascani · 2026-03-12T22:17:29Z

backends/cortex_m/quantizer/quantizer_reporter.py

+            if qspec.is_dynamic != key.is_dynamic:
+                continue
+            return val
+    return "UNREGISTRED_QSPEC"


Typo: UNREGISTERED_QSPEC

rascani · 2026-03-12T22:18:24Z

backends/cortex_m/quantizer/quantizer.py

        targeted_nodes_description = str(self.node_finder)
        quantization_config_path = SUPPORTED_QCONFIGS.get(
-            self.quantization_config, "CUSTOM_QCONFIG"
+            self.quantization_config, "UNREGISTRED_QCONFIG"


Typo: "UNREGISTERED_QCONFIG"

rascani · 2026-03-12T22:21:02Z

backends/cortex_m/quantizer/quantizer_reporter.py

+                continue
+            if qspec.is_dynamic != key.is_dynamic:
+                continue
+            return val


Should we add a check for qspec.ch_axis?

rascani · 2026-03-12T23:06:15Z

backends/arm/quantizer/TARGETS

        "//executorch/backends/arm:constants",
        "//executorch/backends/arm:ethosu",
        "//executorch/backends/arm:vgf",
+        "//executorch/backends/cortex_m/quantizer:quantizer",


Pulling this in exposed that the cortex_m/quantizer:quantizer is broken on main. Here's summary of what needs to change:

backends/cortex_m/quantizer/TARGETS — add missing srcs and deps: python_library( name = "quantizer", srcs = [ "__init__.py", "node_finders.py", # MISSING "operator_configs.py", # NEEDS REMOVING "pattern_checkers.py", # MISSING "pattern_matcher.py", # MISSING "quantization_configs.py", "quantizer.py", "quantizer_reporter.py", # MISSING "quantizer_support.py", # MISSING ], deps = [ "//caffe2:torch", "//executorch/backends/arm:common", # MISSING "//executorch/backends/arm:constants", # MISSING "//executorch/backends/arm/quantizer:quantization_annotator", # MISSING "//executorch/backends/arm/quantizer:quantization_config", "//pytorch/ao:torchao", "fbsource//third-party/pypi/tabulate:tabulate", # MISSING ], )

I'll put together a separate PR, but feel free to incorporate into this one to unblock.

AdrianLundell requested review from digantdesai and rascani as code owners March 11, 2026 12:32

AdrianLundell added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: arm Changes to the ARM backend delegate labels Mar 11, 2026

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 11, 2026

Merge branch 'main' into change-1183485

5431876

zingo added this to the 1.2.0 milestone Mar 11, 2026

Fix lint error + BUCK

f3d752e

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Id81e0c39d13a94a749206441fce60664c80a0af8

Merge branch 'main' into change-1183485

5c02aa7

rascani approved these changes Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm backend: Add experimental support for new TOSAQuantizer#18100

Arm backend: Add experimental support for new TOSAQuantizer#18100
AdrianLundell wants to merge 4 commits intopytorch:mainfrom
AdrianLundell:change-1183485

AdrianLundell commented Mar 11, 2026 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 11, 2026 •

edited

Loading

Uh oh!

zingo commented Mar 11, 2026

Uh oh!

AdrianLundell commented Mar 11, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AdrianLundell commented Mar 11, 2026 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18100

❌ 2 New Failures

Uh oh!

zingo commented Mar 11, 2026

Uh oh!

AdrianLundell commented Mar 11, 2026

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AdrianLundell commented Mar 11, 2026 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 11, 2026 •

edited

Loading