Improve commutation checking of Pauli product rotations and measurements by alexanderivrii · Pull Request #15815 · Qiskit/qiskit

alexanderivrii · 2026-03-16T09:44:49Z

Summary

This PR improves commutation checking of pairs of Pauli-based objects, that is of PauliProductRotationGate and PauliProductMeasurement. Without this PR, we construct the generators for operations for PPRs and PPMs as SparseObservables and then check if the two SparseObservables commute. With this PR, we first instead construct the generatoors as Paulis (represented using Z and X components) and check if two Paulis commute. The latter check is quite a bit faster than the former for large gates.

~~Based on top of #15810.~~

Details and comments

I run this on the 100 representative HamLib benchmarks from benchpress using the following script

ham_records = json.load(open("./100_representative.json", "r"))

for i, h in enumerate(ham_records):
    nq = h.pop("ham_qubits")
    terms = h.pop("ham_hamlib_hamiltonian_terms")
    coefficients = h.pop("ham_hamlib_hamiltonian_coefficients")

    # Construct circuit from PPRs
    qc = QuantumCircuit(nq)
    for t, c in zip(terms, coefficients):
        ppr = PauliProductRotationGate(Pauli(t), c)
        qc.append(ppr, range(nq))

    # Convert to DAG
    dag = circuit_to_dag(qc)

    # Run commutative optimization on DAG, measuring the time
    time_start = time.perf_counter()
    dagt = CommutativeOptimization().run(dag)
    time_end = time.perf_counter()
    print(f"Test {i}: {time_end-time_start}")

Here is the plot showing improvement in runtime (where commutation checker is used within CommutativeOptimization) .

Note that CommutativeOptimization does not actually improve the quality of any of these HamLib benchmarks, however it tends to make a huge number of commutativity checks.

Here is an additional experiment is in the spirit of our FT compiler pipeline (in which case CommutativeOptimization can remove/merge many gates).

qc = qft_circuit(1000)
qc = transpile(qc, basis_gates=get_clifford_gate_names()+["rz"])
qc = LitinskiTransformation(fix_clifford=False, use_ppr=True)(qc)
time_start = time.perf_counter()
qc = CommutativeOptimization()(qc)
time_end = time.perf_counter()
print(f"Time: {time_end-time_start:.4f}")

Without this PR, the average time for CommutativeOptimization is 3.46 seconds, with this PR it's 0.29.

LLM tools used

Used copilot to suggest various micro-optimization and alternative implementations, but in the end the implementation and all possible bugs are purely mine.

Additional optimization (as part of `CommutativeOptimization`)

In CommutativeOptimization pass we now sort qargs for PauliProductRotationGate and PauliProductMeasurement by qubit index. This allows an even more efficient implementation using the standard method to compute intersection of two sorted vectors.

For pairs of PPRs/PPMs we can construct generators as Paulis rather than SparseObservables, and we can check commutativity by checking the commutativity of Paulis.

qiskit-bot · 2026-03-16T09:44:54Z

One or more of the following people are relevant to this code:

@Cryoris
@Qiskit/terra-core
@ajavadia

Cryoris · 2026-03-31T14:21:29Z

+            let max_q1 = qargs1.iter().map(|q| q.index()).max().unwrap_or(0);
+            let mut in_q1 = vec![usize::MAX; max_q1 + 1];
+            for (i, &q) in qargs1.iter().enumerate() {
+                in_q1[q.index()] = i;
+            }


I think we discussed this offline but I don't remember the answer: would it be faster to sort both qargs (log timing) and then iterate, rather than finding the max (linear)?

I tried this now and it became 30% slower, see #15815 for how the experiments were run.

I agree that for robustness it's best to avoid linear scaling with the total number of qubits in the circuit, so I changed the implementation to use a HashMap instead of a vector. For large Pauli strings over all of circuit qubits (as in the HamLib experiment mentioned in the summary) this makes the implementation about 5% slower, however for short Pauli strings with large qubit indices this improves the implementation. I also moved the optional reversing of the operations to be done earlier, so that we are filling the hashmap with the smaller number of qubits. See 06cbb55 and f4a9df6.

alexanderivrii · 2026-03-31T16:04:45Z

An update: 087362f implements the "sorted vectors" intersection in addition to "unsorted vectors" intersection. Since CommutativeOptimization now canonicalizes PPRs and PPMs by default (canonicalization for PPMs is added in de13108 in this PR), it uses "sorted vectors" intersection.

Running the experiment from the code snippet in this PRs summary (that is, running CommutativeOptimization on our 100 representative HamLib examples) further improves the time by about 25% (in addition to the improvement mentioned in the summary).

On the other hand, skipping canonicalization for PPRs/PPMs in CommutativeOptimization and sorting them inside commutation checker (as suggested in #15815) makes the total runtime slower by about 30%.

Note that CommutativeOptimization may check commutation of the same PPR with many other PPRs, so what these experiments show is that it's better to sort each PPR once and then use "sorted vectors" intersection, rather than sort each PPR every time it's used.

This reverts commit 087362f.

Cryoris

The changes look good to me, but the test coverage is a bit thin. Could we (a) add tests checking PPM commutation with Gucci and (b) scramble the indices of the existing PPR tests in Gucci a bit more (right now there's only 1 index swap)?

* Improved tests for commutations between different types of pauli-based gates * Added tests with varying qubit indices

alexanderivrii · 2026-04-15T06:32:10Z

Marking this "on hold" because we need to fix commutation of pauli product measurement instructions with the same clbit (see #16023).

alexanderivrii · 2026-04-16T12:42:57Z

Following the bugfix in #16023, I have updated the commutation checker to efficiently check the commutation of two PPMs writing to the same clbit (and removed the temporary function).

I have also updated the commutative optimization pass to only canonicalize but not try to merge PPM gates (in particular it does not need to worry about commutativity of two PPM gates).

@Cryoris, this is ready for review now.

coveralls · 2026-04-16T13:05:13Z

Coverage Report for CI Build 24510449984

Warning

Build has drifted: This PR's base is out of sync with its target branch, so coverage data may include unrelated changes.
Quick fix: rebase this PR. Learn more →

Coverage increased (+0.01%) to 87.49%

Details

Coverage increased (+0.01%) from the base build.
Patch coverage: 4 uncovered changes across 1 file (115 of 119 lines covered, 96.64%).
18 coverage regressions across 5 files.

Uncovered Changes

File	Changed	Covered	%
crates/transpiler/src/passes/commutative_optimization.rs	63	59	93.65%

Coverage Regressions

18 previously-covered lines in 5 files lost coverage.

File	Lines Losing Coverage	Coverage
crates/circuit/src/parameter/symbol_expr.rs	6	73.93%
crates/qasm2/src/parse.rs	6	97.63%
crates/qasm2/src/lex.rs	4	91.77%
crates/circuit/src/parameter/parameter_expression.rs	1	90.53%
crates/transpiler/src/commutation_checker.rs	1	88.27%

Coverage Stats


Relevant Lines:	119596
Covered Lines:	104634
Line Coverage:	87.49%
Coverage Strength:	979624.38 hits per line

💛 - Coveralls

Cryoris · 2026-04-16T15:47:51Z

+    // To check commutation of two Pauli-based gates, we extract their Pauli generators
+    // and check whether they commute.
+    // Note that we have previously removed all PPRs equivalent to identity up to a global
+    // phase, so this is both a necessary and a sufficient condition.


Suggested change

// To check commutation of two Pauli-based gates, we extract their Pauli generators

// and check whether they commute.

// Note that we have previously removed all PPRs equivalent to identity up to a global

// phase, so this is both a necessary and a sufficient condition.

// To check commutation of two Pauli-based gates, we extract their Pauli generators

// and check whether they commute. This is not done through the commutation checker,

// since we here know that the Pauli strings are sorted by qubit index already, which

// allows for a more efficient check.

//

// Note that we have previously removed all PPRs equivalent to identity up to a global

// phase, so this is both a necessary and a sufficient condition.

For additional clarity, I have moved the Pauli-based commutation code to a separate function, and improved comments/docstrings, see fb03d53.

* moving commutation of pauli-based gates to a separate function * clarifying comments

Cryoris · 2026-04-21T09:53:28Z

+    //   As a result, commutation of Pauli generators is both a necessary and sufficient condition.
+    let (z1, x1, z2, x2) = match (op1, op2) {
+        (OperationRef::PauliProductMeasurement(_), _) => {
+            unreachable!(


unreachable is for unreachable statements in the compiled path -- but this is well reachable if someone calls this function with PPMs. It would be nice to make this safer.

We could for example change the type of op1 to be a &PauliProductRotation which avoids the fallible case and gives the caller the responsibility of giving the right object

This line is truly unreachable, after the change in a685517. Since we can't merge PPMs with anything, we no longer try to commute it with other gates, meaning that the first gate passed to commute is not a PPM. I have also added explicit tests for circuits with multiple PPMs.

Actually, are there any other gates that can't be merged/canceled with anything?

Further improved the unreachability message in a67a060.

Cryoris · 2026-04-23T11:39:05Z

    let tol = 1e-12_f64.max(1. - approximation_degree);
+    let error_cutoff_fn = |_inst: &PackedInstruction| -> f64 { tol };


I'm still not thrilled about removing identities here, especially if we might be using a different default tolerance than the RemoveIdentityEquiv pass. The other inconsistent case is if we have a Target and RemoveIdentityEquiv takes it into account but CommutativeOptimization doesn't.

In #16070 we should add this Target support. Until then, can we at least use the same constant from the remove_identity_equiv.rs file as tolerance, to ensure we query it from the same place?

Done in d13a51a. I now think that we should have a single place of truth to define MINIMUM_TOL for all passes (currently every pass redefines the same constant).

Cryoris · 2026-04-23T11:39:44Z

+    circuits containing :class:`.PauliProductRotationGate` and :class:`.PauliProductMeasurement`
+    objects.
+  - |
+    The :class:`.CommutativeOptimization` transpiler pass now removes gates that are quivalent to


Suggested change

The :class:`.CommutativeOptimization` transpiler pass now removes gates that are quivalent to

The :class:`.CommutativeOptimization` transpiler pass now removes gates that are equivalent to

Oops, I can't type a sentence without a typo -- fixed in d13a51a. (And apparently, I can't also type a commit message with typos.)

Cryoris

Thanks for the improvements, Sasha!

alexanderivrii added 5 commits March 15, 2026 11:40

Coercing integer types to floats when appending a PPR to a circuit

5d2d7d0

Extending RemoveIdentityEquiv and CommutativeOptimization to handle PPRs

dff5668

reno

49960e1

Improve commutation checking of PPRs and PPMs

0dde194

For pairs of PPRs/PPMs we can construct generators as Paulis rather than SparseObservables, and we can check commutativity by checking the commutativity of Paulis.

reno

4cd1ff4

alexanderivrii requested a review from a team as a code owner March 16, 2026 09:44

alexanderivrii added performance Changelog: Added Add an "Added" entry in the GitHub Release changelog. labels Mar 16, 2026

alexanderivrii added this to the 2.5.0 milestone Mar 16, 2026

alexanderivrii added this to Qiskit 2.5 Mar 16, 2026

github-project-automation Bot moved this to Ready in Qiskit 2.5 Mar 16, 2026

jan-an self-requested a review March 23, 2026 09:39

alexanderivrii mentioned this pull request Mar 23, 2026

Extend RemoveIdentityEquivalent and CommutativeOptimization with PauliProductRotations #15810

Merged

ShellyGarion added the fault tolerance related to fault tolerance compilation label Mar 26, 2026

Merge branch 'main' into opt_ppr_ppm_in_cc

789e1c6

Cryoris reviewed Mar 31, 2026

View reviewed changes

alexanderivrii added 2 commits March 31, 2026 17:22

canonicalizing PPM instruction as well

de13108

faster method based on comparing two sorted vectors

087362f

alexanderivrii requested review from a team and Cryoris March 31, 2026 16:08

Cryoris reviewed Apr 1, 2026

View reviewed changes

Comment thread crates/transpiler/src/commutation_checker.rs Outdated

alexanderivrii added 2 commits April 1, 2026 10:06

Revert "faster method based on comparing two sorted vectors"

08f9fa5

This reverts commit 087362f.

moving the efficient check to commutative optimization

1beefa4

Cryoris reviewed Apr 1, 2026

View reviewed changes

alexanderivrii added 3 commits April 3, 2026 11:05

merge with main

e45c48a

More tests for commutations of pauli-based gates

6cbce16

* Improved tests for commutations between different types of pauli-based gates * Added tests with varying qubit indices

Adding CommutativeOptimization tests

b654130

alexanderivrii requested a review from Cryoris April 3, 2026 09:22

fix to actually simplify the circuit + tests

55e93d1

alexanderivrii mentioned this pull request Apr 15, 2026

Fixed commutation checking between two Pauli product measurements #16023

Merged

3 tasks

alexanderivrii added the on hold Can not fix yet label Apr 15, 2026

Cryoris moved this from Ready to In review in Qiskit 2.5 Apr 15, 2026

mergify Bot mentioned this pull request Apr 15, 2026

Fixed commutation checking between two Pauli product measurements (backport #16023) #16040

Merged

3 tasks

alexanderivrii added 3 commits April 16, 2026 14:11

merge with main + a few fixes

c90835c

improve commutation checker

b3e6456

updating commutative optimization for PPMs

a685517

alexanderivrii removed the on hold Can not fix yet label Apr 16, 2026

Cryoris reviewed Apr 17, 2026

View reviewed changes

alexanderivrii added 4 commits April 17, 2026 11:31

merge with main

9f7a432

code cleanup

fb03d53

* moving commutation of pauli-based gates to a separate function * clarifying comments

typos and comments, following review

c2c3ca5

combining tests

a669772

alexanderivrii requested a review from Cryoris April 17, 2026 19:25

Cryoris reviewed Apr 21, 2026

View reviewed changes

Addressing review comments

a67a060

alexanderivrii requested a review from Cryoris April 23, 2026 06:10

Cryoris reviewed Apr 23, 2026

View reviewed changes

ore review comments

d13a51a

Cryoris approved these changes Apr 23, 2026

View reviewed changes

Cryoris enabled auto-merge April 23, 2026 12:02

Cryoris added this pull request to the merge queue Apr 23, 2026

Merged via the queue into Qiskit:main with commit e8f058e Apr 23, 2026
26 checks passed

github-project-automation Bot moved this from In review to Done in Qiskit 2.5 Apr 23, 2026

		let tol = 1e-12_f64.max(1. - approximation_degree);
		let error_cutoff_fn = \|_inst: &PackedInstruction\| -> f64 { tol };

	The :class:`.CommutativeOptimization` transpiler pass now removes gates that are quivalent to
	The :class:`.CommutativeOptimization` transpiler pass now removes gates that are equivalent to

Conversation

alexanderivrii commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details and comments

LLM tools used

Additional optimization (as part of CommutativeOptimization)

Uh oh!

qiskit-bot commented Mar 16, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexanderivrii Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexanderivrii commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Cryoris left a comment

Choose a reason for hiding this comment

Uh oh!

alexanderivrii commented Apr 15, 2026

Uh oh!

alexanderivrii commented Apr 16, 2026

Uh oh!

coveralls commented Apr 16, 2026

Coverage Report for CI Build 24510449984

Coverage increased (+0.01%) to 87.49%

Details

Uncovered Changes

Coverage Regressions

Coverage Stats

💛 - Coveralls

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexanderivrii Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexanderivrii Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Cryoris left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

alexanderivrii commented Mar 16, 2026 •

edited

Loading

Additional optimization (as part of `CommutativeOptimization`)

alexanderivrii Apr 4, 2026 •

edited

Loading

alexanderivrii commented Mar 31, 2026 •

edited

Loading

alexanderivrii Apr 21, 2026 •

edited

Loading

alexanderivrii Apr 23, 2026 •

edited

Loading