最適化問題をハミルトニアンとして表現する一般的な方法はありますか？

たとえば、次の形式の最適化問題があるとします。

min_{x} f (x) g_{i} (x) \leq 0, i = 1, . . ., m h_{j} (x) = 0, j = 1, . . ., p,

$\min_x f(x) \\ g_i(x) \leq 0, i = 1, ..., m \\ h_j(x) = 0, j = 1, ..., p,$

ここで、 $f(x)$ は目的関数、 $g_i(x)$ は不等式制約、 $h_j(x)$ は等式制約です。

最近、私は断熱量子計算について読んでいました。ウィキペディアは言う：

最初に、（潜在的に複雑な）ハミルトニアンが見つかり、その基底状態が対象の問題の解法を記述します。次に、単純なハミルトニアンをもつシステムが準備され、基底状態に初期化されます。最後に、単純なハミルトニアンは、断熱的に進化して、目的の複雑なハミルトニアンになります。断熱定理により、システムは基底状態のままになるため、最後にシステムの状態は問題の解決策を記述します。断熱量子計算は、回路モデルにおける従来の量子計算と多項的に同等であることが示されています。

断熱量子計算で使用されるハミルトニアン形式で最適化問題を（たとえば、上記のように）表現する一般的な方法はありますか？

adiabatic-model optimization

— brzepkowski
ソース

どれほど正式な回答が必要かはわかりませんが、通常、ソリューションから大きく離れ、ソリューションで最小になるコスト関数を定義します。次に、このコスト関数をパウリスピン言語に変換します（このステップは明確にしたいと思いますか？）。コスト関数がスピン言語になると、ハミルトニアンになります。たとえば、バイナリ文字列を検索する場合、（I-Zi）/ 2がビットiの値を返すという事実を使用できます。もしこれがあなたの望むものなら、私が時間があれば明日書こうとすることができます

— bRost03

答えの例をいくつか示していただけますか？それは素晴らしいでしょう:)

— brzepkowski

多くの例については、arxiv.org / abs / 1302.5843（Lucas Ising 2014）を参照してください。

— Paradox

コメントで要求されているように、ここに作業例があります。本体は、特定の問題に対する $f(x)$ 最小化を扱います。下部には、制約の簡単な説明と、一般的なケースに関する簡単な説明があります。

これから加重最大カット問題を解決しましょう

比較的単純な例です
古典的に難しい
文献で比較的一般的な例です（例：https : //journals.aps.org/prl/abstract/10.1103/PhysRevLett.90.067903）
物理的なハミルトニアン（Isingスピングラス）と明確な関係がある

問題を理解するために、我々は無向グラフで始まる $n$ 頂点 $\{V\}$ の各頂点、 $v_i\in V$ 重量を有し $w_i\geq0$ と接続各エッジ $v_i$ 及び $v_j$ 重み有する $w_{ij}\geq0$ 。次に、グラフを2つに分割します。カットは直線である必要はありませんが、自己交差してはならず、エッジを2回カットすることはできません。次に、「支払い」を計算します $P$ 私たちのカットのために。ペイアウトは、カットしたエッジのウェイトの合計と、カットの片側の頂点のウェイトの合計です。 $^1$

ソース

この画像では、支払いは、エッジの $1+4+3+3+2 = 13$ に加えて、頂点の $5+6+1 = 12$ $\to P=25$ （各頂点内の数がその重みであると仮定）。最適化の問題は、特定のグラフの $P$ を最大化することです。 $^2$

これを数学的に書くために、ビット文字列の観点から考えることができます。我々は、文字列により切断を定義 $s\in\{0,1\}^n$ ここで $s_i=0\to v_i$ されていない和で計数および $s_i=1\to v_i$ れる和で計数しました。計算を少しわかりやすくするために、グラフが完全に接続されていない場合は、グラフを完全に接続し、接続されていないペアに対して $w_{ij}=0$ を設定します。 $v_i,v_j$

たとえば、上の画像をもう一度見て、頂点内の数値を、上記で想定したような重みではなく、頂点インデックスであると解釈してみましょう。次に、描かれたカットは $s=100011$ 対応します。 $s_1=s_5=s_6=1\to v_1, v_5, v_6$ はカットの「良い」側にあり、カウントされますが、 $s_2=s_3=s_4=0$ は「悪い」側ですカットの側面とカウントされません。

これにより、

P (s) = \sum_{i} s_{i} w_{i} + \sum_{i, j} s_{i} (1 - s_{j}) w_{i j}

$P(s) = \sum_i s_i w_i + \sum_{i,j} s_i(1-s_j)w_{ij}$

最初の項は、カットの「良い」側のすべての頂点の重みを数えるだけです。2番目の項は、エッジが接続する頂点がカットの反対側にある場合、エッジの重みをカウントします。それはときにのみ、エッジをカウントしますので、このないのダブルカウントに注意してください $s_i=1, s_j=0$ ときといない $s_i=0, s_j=1$ 。

したがって、ここでの最適化の問題は、を最大化する文字列 $s$ を見つけることです。ここでの考え方は、をシステムのエネルギーの尺度として、をシステムの状態として考えることです。これは、をハミルトニアンに関連付けることができることを意味します。ここで、を最大化しようとしていることを少し微妙に説明しますが、通常はハミルトニアンの基底状態を見つけることについて話します。これは問題ではありませんが、私はそれを指摘したかった-私たちは、代わりに（あなたがする場合は抗基底状態）最高エネルギーの励起状態を見たり、使用することができます $P(s)$ $P(s)$ $s$ $P(s)$ $P(s)$ $-P(s)$ エネルギー関数は通常のように基底状態で動作します。最高の励起状態で作業し、 $P$ を最大化しましょう。

その最高のエネルギー状態が次のようになるようにハミルトニアンを作成します $|s_0\rangle$ よう $P(s_0)$ 最大です。基本的に我々は有効にする $P(s)$ に、エネルギー関数を、、エネルギー事業者。注意してこれを行います我々は持っている $\hat{H}$ $|s\rangle\in\{|0\rangle,|1\rangle\}$

\frac{I - Z}{2} | s ⟩ = s | s ⟩ \to define {\hat{s}}_{i} = \frac{I - Z_{i}}{2}

$\frac{I-Z}{2}|s\rangle=s|s\rangle\to\text{ define } \hat{s}_i=\frac{I-Z_i}{2}$

ここで、 $Z_i$ はキュービット作用するパウリ $Z$ です。今、私たちは、交換することにより、当社のハミルトニアンを得ると（とし、1 で） $i$ $s$ $\hat{s}$ $I$ $P$

H = \sum_{i} {\hat{s}}_{i} w_{i} + \sum_{i, j} {\hat{s}}_{i} (I - {\hat{s}}_{j}) w_{i, j} = \sum_{i} \frac{I - Z_{i}}{2} w_{i} + \sum_{i, j} \frac{I - Z_{i}}{2} (I - \frac{I - Z_{j}}{2}) w_{i, j}

$H=\sum_i \hat{s}_i w_i + \sum_{i,j} \hat{s}_i(I-\hat{s}_j)w_{i,j}=\sum_i\frac{I-Z_i}{2} w_i + \sum_{i,j} \frac{I-Z_i}{2}\left(I-\frac{I-Z_j}{2}\right)w_{i,j}$

これは、拡大すると見て、クリーンアップすることができ $\sum_{i,j}(Z_i-Z_j)=0\to$

H = \sum_{i} \frac{w_{i}}{2} (I - Z_{i}) + \sum_{i, j} \frac{w_{i j}}{4} (I - Z_{i} Z_{j}) = \sum_{i} \frac{w_{i}}{2} (I - Z_{i}) + \sum_{i < j} \frac{w_{i j}}{2} (I - Z_{i} Z_{j})

$H=\sum_i \frac{w_i}{2}\left(I-Z_i\right) + \sum_{i,j} \frac{w_{ij}}{4}\left(I-Z_iZ_j\right)=\sum_i \frac{w_i}{2}\left(I-Z_i\right) + \sum_{i<j} \frac{w_{ij}}{2}\left(I-Z_iZ_j\right)$

2を掛けて一定のエネルギーシフトを削除することで、これをさらにクリーンアップできます（ $I$ 項を削除します）。スケーリングおよびシフトされた固有値を持つ同じ固有状態を持つ新しいハミルトニアン（明らかに、最大エネルギーはこれらの変換の影響を受けません）

H = - \sum_{i} w_{i} Z_{i} - \sum_{i < j} w_{i j} Z_{i} Z_{j}

$H=-\sum_i w_iZ_i - \sum_{i<j} w_{ij}Z_iZ_j$

あなたが物性物理学者なら、おそらくこのハミルトニアンをイジングスピングラスとして認識するでしょう。問題にはあまり関係ありませんが、それはクールだと思います。

これで、ハミルトニアンが得られ、その（反）基底状態はビット文字列 $s_0$ をエンコードし、 $P(s)$ を最大化して問題を解決します。

最後に必要なのは初期ハミルトニアン $H_0$ であり、これをゆっくりと（断熱的に）最終ハミルトニアン $H$ 変換して、完全なハミルトニアン

H_{T} (t) = (1 - f (t)) H_{0} + f (t) H : f (0) = 0, f (t_{f}) = 1

$H_T(t)=(1-f(t))H_0 + f(t)H: f(0)=0, f(t_f)=1$

開始点として、 $f(t)\propto t$ は簡単にするためによく使用されます。望ましい精度とスペクトルギャップによって決定される最小の $t_f$ 。スペクトルギャップは、（反）基底状態と次のエネルギー状態の間のすべてのにわたる最小エネルギー差です。ギャップの分析は非常に重要であり（https://arxiv.org/abs/quant-ph/0509162を参照）、アルゴリズムの複雑さ/効率を決定します。ギャップが0のアルゴリズムは、まったく機能することが保証されていません。 $^3$ $t$

したがって、次のような $H_0$ が必要です。

（対）基底状態を簡単に見つけて準備できます
The spectral gap of $H$ is not exponentially small in the size of the problem

For this problem, a good initial Hamiltonian is $H_0 = \sum_i X_i$ because it's highest energy state is easy to find, it's $|+\rangle^{\otimes n}$ . It's easy to prepare, just apply $H^{\otimes n}$ to $|0\rangle^{\otimes n}$ . I don't have time to get into the analysis of the spectral gap but this Hamiltonian is unlikely to be ideal in that regard (see https://arxiv.org/abs/1701.05584).

With this choice of $H_0$ and taking $f(t)=t/t_f$ we are done. Our Hamiltonian is

H (t) = (1 - f (t)) \sum_{i} X_{i} - f (t) [\sum_{i} w_{i} Z_{i} + \sum_{i < j} w_{i j} Z_{i} Z_{j}]

$H(t) = \left(1-f(t)\right)\sum_i X_i-f(t)\left[\sum_i w_iZ_i + \sum_{i<j} w_{ij}Z_iZ_j\right]$

Starting in state $|\psi_0\rangle = H^{\otimes n}|0\rangle^{\otimes n}$ , evolving according to the above Hamiltonian for time $t_f$ (choosing a suitable $t_f$ is, again, generally highly non-trivial) then measuring in the computational basis should return (with high probability) the string $s=s_0$ which maximizes $P(s)$ .

$^1$ This is ambiguous since by symmetry either side will do. We can make this rigorous by, for example, making the cut directed then taking the vertices to the left of the cut when walking along the direction of the cut.

$^2$ I had said in the comment we minimize a cost function, if you like this better just take cost $=-$ payout and minimize cost.

$^3$ I'm sweeping some details about what "slow" means under the rug but can be related to the energy scale of the problem (i.e. multiplying $H$ by a constant will change the speed).

Constraints

Let's say we want to modify the problem above to require that exactly $5$ vertices are on the "good" side of our cut. Mathematically this is $\sum_i s_i-5=0$ . To enforce this, we add a penalty term into our Hamiltonian for solutions that break this constraint. So we add a term like $H_c = -\alpha\left(\sum_i \hat{s}_i -5I\right)^2$ choosing $\alpha$ big enough to ensure a state violating this constraint can't be the highest energy state.

Let's say instead we want to require that there are no more than $5$ vertices on the "good" side of our cut. This, it seems, is rather hard to do. In https://arxiv.org/abs/1702.06248 they state that approximating an inequality constraint to order $k$ requires $\mathcal{O}\left(N^{2k}\right)$ $k$ -spin couplings which would require even more overhead to break them down into 2-qubit couplings which is often necessary on a given architecture. Essentially the strategy is to approximate a step function using a $k^\text{th}$ order polynomial. This seems like a terrible way to go about it - but I can't think of better way. This is coming from Troyer in 2017 so it's relatively unlikely, though certainly possible, that a better way is currently known.

The general case

The question asks about a general method for encoding an optimization problem into a Hamiltonian. Specifically we want to minimize $f(x)$ subject to a set of constraints. In the section above I discussed adding the constraints to the Hamiltonian. So for a completely general $f(x)$ , is there a way to encode it into a Hamiltonian? The general method for this in the literature is to assume we have access to an efficient quantum oracle that implements $f(x)$ . We can think of this as having a black box operation (i.e. quantum oracle) $\hat{f}(x)$ such that $\hat{f}(x)|x\rangle=f(x) |x\rangle$ . Then we may construct our Hamiltonian as

H = \sum_{x} \hat{f} (x) | x ⟩ ⟨ x |

$H = \sum_x \hat{f}(x)|x\rangle\langle x|$ Of course this just pushes the difficult part into finding/constructing

\hat{f} (x)

$\hat{f}(x)$ . In fact, simple counting arguments show that almost all (in the mathematical sense) quantum oracles are exponentially inefficient to implement (see http://www.ar-tiste.com/imp-oracles/imps2.pdf). So while this is a general encoding of an optimization problem into a Hamiltonian - it's not really practical. It would seem to be the case that if you want to encode your optimization problem into a Hamiltonian in a useful way - you'll need to leverage some structure of

f (x)

$f(x)$ . My understanding is that the specifics of exactly how to do this and how to do this in the best manner is not fully understood and is the subject of active research.

— bRost03
ソース

The maxcut problem is well explained in this answer. However he optimization problem is stated in a way that it deviates a bit from the max-cut problem regarding the equality and inequality constraints .

— Bram

I don't do too much with optimization in my work. Can you give a specific example that conforms to the given form? I can take a stab at coming up with a Hamiltonian for it

— bRost03

I have edited the answer to include an equality constraint and discuss the difficulty of implementing an inequality constraint

— bRost03

Edited further to add a blurb about the general case

— bRost03

Great answer! I was especially interested in the part explaining transition between

s

$s$ and

\hat{s}

$\hat{s}$ .

— brzepkowski