SVMの最適なCおよびガンマパラメータを決定するための検索範囲は？

32

分類にSVMを使用しており、線形カーネルとRBFカーネルの最適なパラメーターを決定しようとしています。線形カーネルの場合、交差検証されたパラメーター選択を使用してCを決定し、RBFカーネルの場合、グリッド検索を使用してCおよびガンマを決定します。

私は20（数値）機能と70のトレーニング例を7つのクラスに分類する必要があります。

Cおよびガンマパラメータの最適値を決定するために、どの検索範囲を使用する必要がありますか？

classification svm kernel-trick

— キウィア
ソース

31

チェックアウトSVM分類に実用的なガイドいくつかのポインタのための、特に5ページ。

$C$ $\gamma$ $(C,\gamma)$ $C$ and $\gamma$ is a practical method to identify good parameters (for example, $C = 2^{-5},2^{-3},\ldots,2^{15};\gamma = 2^{-15},2^{-13},\ldots,2^{3}$ ).

Remember to normalize your data first and if you can, gather more data because from the looks of it, your problem might be heavily underdetermined.

— ciri
ソース

Should peer testing be done manually? there is not a library to achieve it?

— x-rw

11

Check out section 2.3.2 of this paper by Chapelle and Zien. They have a nice heuristic to select a good search range for $\sigma$ of the RBF kernel and $C$ for the SVM. I quote

To determine good values of the remaining free parameters (eg, by CV), it is important to search on the right scale. We therefore fix default values for $C$ and $\sigma$ that have the right order of magnitude. In a $c$ -class problem we use the $1/c$ quantile of the pairwise distances $D^\rho_{ij}$ of all data-points as a default for $\sigma$ . The default for $C$ is the inverses of the empirical variance $s^2$ in features space, which can be calculated by $s^2 = \frac{1}{n} \sum_i K_{ii} - \frac{1}{n^2}\sum_{i,j} K_{ij}$ from a $n\times n$ kernel matrix $K$ .

Afterwards, they use multiples (e.g. $2^k$ for $k\in \{-2,...,2\}$ ) of the default value as search range in a grid-search using cross-validation. That always worked very well for me.

Of course, we @ciri said, normalizing the data etc. is always a good idea.

— fabee
ソース

I think there are several equal rbf kernel formulations. One with gamma and another with sigma, i.e. gamma = 1/2sigma^2. Does the gamma in the above heuristic correspond to gamma, sigma or sigma^2? I have found other descriptions of the same heurstic which are for gamma.

— machinery

If you check the linked paper, it is

\frac{1}{2 σ^{2}}

$\frac{1}{2\sigma^2}$

— fabee

@fabee Should peer testing be done manually? there is not a library to achieve it?

— x-rw