Metal backend: compute init/execute times #16639

manuelcandales · 2026-01-15T21:54:05Z

This pull request introduces detailed performance tracking and reporting for the Metal backend, focusing on timing statistics for both initialization and execution phases. It adds infrastructure to collect, reset, and print timing data, and integrates statistics output into the Parakeet model example. Additionally, a preprocessor macro is defined to signal Metal backend availability.

Metal Backend Performance Statistics:

Added timing statistics collection for init() and execute() calls in the Metal backend, including total time, call count, and per-method breakdowns. Accessor and reset functions are provided in stats.h and implemented in metal_backend.cpp.
Introduced a new stats.cpp file with a function to print all collected Metal backend statistics, including per-method breakdowns for both initialization and execution.

Build and Integration Improvements:

Added runtime/stats.cpp to the Metal backend build sources and defined the ET_BUILD_METAL preprocessor macro to indicate Metal backend support.

Example Model Enhancements:

Updated the Parakeet example (main.cpp) to show Metal backend timing stats after model execution if Metal is enabled.

These changes provide better visibility into Metal backend performance and make it easier to profile and optimize model execution on Apple devices.

pytorch-bot · 2026-01-15T21:54:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16639

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit 3cfefbf with merge base d58c8ee ():

NEW FAILURE - The following job has failed:

Test CUDA Builds / test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t 1ec5d087864ed692f9008e67efb951820c51bc6c8c36eb8b0d62226b7009910a /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / android / run-emulator (gh) (trunk failure)
The process '/usr/bin/sh' failed with exit code 255
pull / test-samsung-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

This PR adds timing instrumentation to the Metal backend for measuring initialization and execution performance, and exposes these statistics in the Parakeet example application.

Changes:

Added timing measurement for Metal backend init() and execute() methods with per-method granularity
Created a new stats API module with accessor and print functions for timing data
Integrated Metal backend timing statistics into the Parakeet example to display performance metrics

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
backends/apple/metal/runtime/metal_backend.cpp	Added timing instrumentation using `std::chrono` to measure and track init/execute times in global variables
backends/apple/metal/runtime/stats.h	Defined public API for accessing Metal backend timing statistics
backends/apple/metal/runtime/stats.cpp	Implemented function to print formatted timing statistics to stdout
backends/apple/metal/CMakeLists.txt	Added stats.cpp to build and defined ET_BUILD_METAL preprocessor macro
examples/models/parakeet/main.cpp	Added performance statistics output section that calls Metal backend stats when built with Metal support

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/apple/metal/runtime/metal_backend.cpp

backends/apple/metal/runtime/stats.h

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/apple/metal/runtime/metal_backend.cpp

backends/apple/metal/runtime/stats.cpp

manuelcandales added 4 commits January 15, 2026 16:37

metal backend execute stats

dd17287

per method statistics: execute

70e5aec

init stats

a1d3c52

conditional compilation

ee2a8b5

Copilot AI review requested due to automatic review settings January 15, 2026 21:54

manuelcandales requested review from cccclai, lucylq and shoumikhin as code owners January 15, 2026 21:54

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 15, 2026

manuelcandales requested review from JacobSzwejbka and mergennachin and removed request for cccclai, Copilot, lucylq and shoumikhin January 15, 2026 21:54

Copilot started reviewing on behalf of manuelcandales January 15, 2026 21:54 View session

manuelcandales added the release notes: none Do not include this in the release notes label Jan 15, 2026

manuelcandales changed the title ~~Parakeet on Metal: compute metal backend init/execute times~~ Metal backend: compute init/execute times Jan 15, 2026

manuelcandales added 2 commits January 15, 2026 17:21

refactor to metal/runtime/stats.[h/.cpp]

9b8d336

define C++ preprocessor macro

88c36d5

Copilot AI review requested due to automatic review settings January 15, 2026 22:37

manuelcandales requested review from kirklandsign and larryliu0820 as code owners January 15, 2026 22:37

Copilot started reviewing on behalf of manuelcandales January 15, 2026 22:37 View session

Copilot AI reviewed Jan 15, 2026

View reviewed changes

backends/apple/metal/runtime/metal_backend.cpp Outdated Show resolved Hide resolved

backends/apple/metal/runtime/stats.h Show resolved Hide resolved

manuelcandales added 3 commits January 15, 2026 17:50

add mutex

a1f0384

rename reset function

69ce03e

clean up

00d2bbf

Copilot AI review requested due to automatic review settings January 15, 2026 23:03

Copilot started reviewing on behalf of manuelcandales January 15, 2026 23:04 View session

Copilot AI reviewed Jan 15, 2026

View reviewed changes

backends/apple/metal/runtime/metal_backend.cpp Outdated Show resolved Hide resolved

backends/apple/metal/runtime/stats.cpp Show resolved Hide resolved

backends/apple/metal/runtime/stats.cpp Show resolved Hide resolved

manuelcandales added 2 commits January 15, 2026 18:15

singleton pattern

e22f542

new line

3cfefbf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Metal backend: compute init/execute times #16639

Metal backend: compute init/execute times #16639

manuelcandales commented Jan 15, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 15, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Metal backend: compute init/execute times #16639

Are you sure you want to change the base?

Metal backend: compute init/execute times #16639

Conversation

manuelcandales commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16639

❌ 1 New Failure, 2 Unrelated Failures

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

manuelcandales commented Jan 15, 2026 •

edited

Loading

pytorch-bot bot commented Jan 15, 2026 •

edited

Loading