脊检测

脊检测（英语：Ridge detection）为试图查找照片中的脊的位置之方法。在数学及电脑视觉等领域中，一个双变量光滑函数的脊（或是称作为脊集合）是所有至少在一个维度中是局部最大值的点所形成的曲线的集合，就如同地理上山脊的概念。而一个 $N$ 变量函数的脊，是所有在 $N-1$ 个维度中是局部最大值的点所形成的曲线的集合。脊可以看作是局部最大值的延伸。相对的，一个函数的谷可以以类似的方式定义，只要以局部最小值取代脊定义中的局部最大值的部分。

脊集合与谷集合表现了一个函数直觉上的几何信息，因此可以提供一种对于函数特征的简洁呈现。图像分析及电脑视觉为了获得照片中对象延伸特性的信息，利用了脊集合与谷集合的特性而有脊检测及谷检测。相关的应用像是在分水岭算法中被使用以进行图像分割，或是利用照片中的脊与谷去计算对象可能的形状等等。此种几何信息的表示方式，若只有进行单一尺度的计算，容易会受到噪声的干扰。根据尺度空间理论，多重尺度的脊与谷可以透过照片经过高斯金字塔的预先处理获得，借此应该可以产生对于对象更稳定的描述。

二维照片中单一尺度的脊与谷的微分几何定义

令 $f(x,y)$ 为一个二维函数，而 $L$ 为 $f(x,y)$ 的尺度空间表示，此种表示可以透过 $f(x,y)$ 与高斯函数的卷积获得。

在单一尺度下，高斯函数中的 $t$ 为一定值。

g(x,y,t)={\frac {1}{2\pi t}}e^{-(x^{2}+y^{2})/2t}

.

透过旋转的方式，可以找到一组直角座标 $(p,q)$ 使得 $L_{pq}=0$ ，其中的方向导数运算符，

\partial _{p}=\sin \beta \partial _{x}-\cos \beta \partial _{y},\partial _{q}=\cos \beta \partial _{x}+\sin \beta \partial _{y}

进一步带入海森矩阵中。

H={\begin{bmatrix}L_{xx}&L_{xy}\\L_{xy}&L_{yy}\end{bmatrix}}={\begin{bmatrix}\sin \beta &-\cos \beta \\\cos \beta &\sin \beta \end{bmatrix}}{\begin{bmatrix}L_{pp}&L_{pq}\\L_{pq}&L_{qq}\end{bmatrix}}{\begin{bmatrix}\sin \beta &\cos \beta \\-\cos \beta &\sin \beta \end{bmatrix}}

不难看出 $L_{pp},L_{qq}$ 即为海森矩阵的特征值，而旋转矩阵的旋转角度可以由海森矩阵的特征向量所决定。

\cos \beta ={\sqrt {{\frac {1}{2}}\left(1+{\frac {L_{xx}-L_{yy}}{\sqrt {(L_{xx}-L_{yy})^{2}+4L_{xy}^{2}}}}\right)}}

,

\sin \beta =\operatorname {sgn}(L_{xy}){\sqrt {{\frac {1}{2}}\left(1-{\frac {L_{xx}-L_{yy}}{\sqrt {(L_{xx}-L_{yy})^{2}+4L_{xy}^{2}}}}\right)}}

.

有了恰当的旋转后，便可以正式的定义函数 $f(x,y)$ 在单一尺度下的脊，脊包含所以符合以下条件的点。 ^[1]

L_{p}=0,L_{pp}\leq 0,|L_{pp}|\geq |L_{qq}|.

相对应的谷，谷包含所有符合以下条件的点。

L_{q}=0,L_{qq}\geq 0,|L_{qq}|\geq |L_{pp}|.

相似的，也可以将座标旋转，使得新座标 $(u,v)$ 中， $v$ 的方向平行照片的梯度，而 $u$ 的方向垂直照片的梯度。

\partial _{u}=\sin \alpha \partial _{x}-\cos \alpha \partial _{y},\partial _{v}=\cos \alpha \partial _{x}+\sin \alpha \partial _{y}

此时的旋转角度为，

\cos \alpha ={\frac {L_{x}}{\sqrt {L_{x}^{2}+L_{y}^{2}}}},\sin \alpha ={\frac {L_{y}}{\sqrt {L_{x}^{2}+L_{y}^{2}}}}

以下的定义可以被证明与前述的是等价的^[2]

L_{uv}=0,L_{uu}^{2}-L_{vv}^{2}\geq 0

其中

L_{v}^{2}L_{uu}=L_{x}^{2}L_{yy}-2L_{x}L_{y}L_{xy}+L_{y}^{2}L_{xx},

L_{v}^{2}L_{uv}=L_{x}L_{y}(L_{xx}-L_{yy})-(L_{x}^{2}-L_{y}^{2})L_{xy},

L_{v}^{2}L_{vv}=L_{x}^{2}L_{xx}+2L_{x}L_{y}L_{xy}+L_{y}^{2}L_{yy}

由 $L_{uu}$ 的正负号决定一个点是脊或是谷， $L_{uu}<0$ 是脊而 $L_{uu}>0$ 是谷.

计算二维照片在变化尺度下的脊

单一尺度脊的主要问题是对于噪声非常敏感，实验证实尺度的选择需要特别的调整才能得到一组能够反应照片中结构的脊。为了在缺乏过往经验时处理这个问题，而有了尺度空间脊的概念，此时尺度大小这个变量被当作是脊定义的固有特性，尺度可以在尺度空间中变化。此种概念使得尺度可以被自动的调整到反应照片结构的大小，以下有多种方法皆是基于此。

令 $R(x,y,t)$ 为一个描述脊强度的函数（底下有详细定义）。则对于一个二维照片，尺度空间脊包含所有符合以下条件的点，

L_{p}=0,L_{pp}\leq 0,\partial _{t}(R)=0,\partial _{tt}(R)\leq 0,

其中 $t$ 为尺度空间表示中的尺度. 相似的，尺度空间谷包含所有符合以下条件的点，

L_{q}=0,L_{qq}\geq 0,\partial _{t}(R)=0,\partial _{tt}(R)\leq 0.

如此定义下，可以想像尺度空简脊为三维空间中一些一维曲线的集合（原本照片的二维加上尺度空间一维），而最后呈现的脊便是这些曲线在照片平面上的投影。

当初Lindeberg (1996, 1998)^[3] 提出了尺度空间脊时，他考虑了三个描述脊强度的函数。

主曲率

L_{pp,\gamma -norm}={\frac {t^{\gamma }}{2}}\left(L_{xx}+L_{yy}-{\sqrt {(L_{xx}-L_{yy})^{2}+4L_{xy}^{2}}}\right)

特征值差的平方经 $\gamma$ -标准化后的平方

N_{\gamma -norm}=\left(L_{pp,\gamma -norm}^{2}-L_{qq,\gamma -norm}^{2}\right)^{2}=t^{4\gamma }(L_{xx}+L_{yy})^{2}\left((L_{xx}-L_{yy})^{2}+4L_{xy}^{2}\right).

特征值差经 $\gamma$ -标准化后的平方

A_{\gamma -norm}=\left(L_{pp,\gamma -norm}-L_{qq,\gamma -norm}\right)^{2}=t^{2\gamma }\left((L_{xx}-L_{yy})^{2}+4L_{xy}^{2}\right).

$L_{pp,\gamma -norm}$ 是一个通用性的描述函数，常被用在血管侦测及道路提取等应用中，而 $A_{\gamma -norm}$ 则被用在指纹的增强^[4]，即时手部追踪及手势识别^[5]，以及利用局部影像统计侦测追踪影像或影片中的人。^[6]

历史

脊与谷第一次被使用在数字图像的领域中是在由Haralick于1983^[7]及Crowley于1984对于高斯金字塔的想法^[8]^[9] ，脊在医学影像中的应用则有Pizer及其同事的深入研究^[10]^[11]^[12] 及他们所提出的M-reps。^[13] 脊检测因Lindeberg加入了 $\gamma$ -标准化导数及尺度空间脊等概念而有所提升。这些概念之后由Steger等人使用在道路提取^[14]^[15] ，由Frangi等人使用在血管的分割^[16] 以及由Satos等人及Krissian等人使用在曲线及管状结构的侦测中。^[17]^[18]

N维空间中脊与谷的定义

脊的概念，推广了实数函数的局部最大值。一个在 $f:\mathbb {R} ^{n}\rightarrow \mathbb {R}$ 定义域中的点 $\mathbf {x} _{0}$ ，若存在一个距离 $\delta >0$ 使得所有在这个距离内的都符合 $f(\mathbf {x} )<f(\mathbf {x} _{0})$ ，则点 $\mathbf {x} _{0}$ 是一个局部最大值。

稍微放宽一下这个条件，若所有在 $\mathbf {x} _{0}$ 的附近的点 $\mathbf {x}$ 中 $n-1$ 维的子集合，皆符合 $f(\mathbf {x} )<f(\mathbf {x} _{0})$ ，则 $\mathbf {x} _{0}$ 便是脊上的一个点。这样的放宽，给予了脊一个维度的自由，也就是说脊会是一个一维的曲线。同样的概念套用到局部最小值上，可以得到一维的谷曲线。

以下的定义是根据Eberly的著作^[19]，可以被当作是之前的脊的定义的推广。令 $U\subset \mathbb {R} ^{n}$ 为一个开放集合，且 $f:U\rightarrow \mathbb {R}$ 是一个光滑函数。令 $\mathbf {x} _{0}\in U$ 。令 $\nabla _{\mathbf {x} _{0}}f$ 为 $f$ 在点 $\mathbf {x} _{0}$ 的梯度。令 $H_{\mathbf {x} _{0}}(f)$ 为函数 $f$ 在点 $\mathbf {x} _{0}$ 的 $n\times n$ 海森矩阵。令 $\lambda _{1}\leq \lambda _{2}\leq \cdots \leq \lambda _{n}$ 为 $n$ 个 $H_{\mathbf {x} _{0}}(f)$ 的排序好的特征值，并令 $\mathbf {e} _{i}$ 为对应到 $\lambda _{i}$ 的单位特征向量。(在此，假设所有特征值是相异的)

点 $\mathbf {x} _{0}$ 在一维的脊上若:

$\lambda _{n-1}<0$ 且
$\nabla _{\mathbf {x} _{0}}f\cdot \mathbf {e} _{i}=0$ for $i=1,2,\ldots ,n-1$ .

这精确地限制了 $f$ 在特定的 $n-1$ 维中在点 $\mathbf {x} _{0}$ 有局部最大值。

这样的定义很自然的可以被推广到成k维的脊，一个点 $\mathbf {x} _{0}$ 在k维脊上若

$\lambda _{n-k}<0$ 且
$\nabla _{\mathbf {x} _{0}}f\cdot \mathbf {e} _{i}=0$ for $i=1,2,\ldots ,n-k$ .

最大尺度脊

以下的定义可以被追溯到Fritsch^[20]，当初他对如何从二维灰阶照片中提取几何信息十分感兴趣，他使用了medialness滤镜处理照片，得到了一种类似各点到边缘距离在尺度空间的资料，而这种资料的脊若是再叠加回原本的照片上，与原本照片的形状骨骼（如Blum的中轴）十分相似。

最大尺度脊定义在一个三维的函数上，其中二维是照片平面，一维是尺度空间。其中我们想要以下的条件为真，若 $(\mathbf {x} ,\sigma )$ 是一个在最大尺度脊上的点，则函数在这个点上的值在尺度轴上是极大值。令 $f(\mathbf {x} ,\sigma )$ 为一个光滑可微分的函数于 $U\subset \mathbb {R} ^{2}\times \mathbb {R} _{+}$ 。则 $(\mathbf {x} ,\sigma )$ 是在最大尺度脊上的点当且仅当

${\frac {\partial f}{\partial \sigma }}=0$ and ${\frac {\partial ^{2}f}{\partial \sigma ^{2}}}<0$ , and
$\nabla f\cdot \mathbf {e} _{1}=0$ and $\mathbf {e} _{1}^{t}H(f)\mathbf {e} _{1}<0$ .

参见

参考资料

^ T. Lindeberg. Scale-space. Encyclopedia of Computer Science and Engineering (Benjamin Wah, ed), John Wiley and Sons. 2008/2009, IV: 2495–2504 [2015-07-01]. doi:10.1002/9780470050118.ecse609. （原始内容存档于2019-09-07）. 请检查|date=中的日期值 (帮助)
^ Lindeberg, T. Scale-space theory: A basic tool for analysing structures at different scales. J. of Applied Statistics. 1994, 21 (2): 224–270 [2015-07-01]. doi:10.1080/757582976. （原始内容存档于2019-09-07）.
^ Lindeberg, T. Edge detection and ridge detection with automatic scale selection. International Journal of Computer Vision. 1998, 30 (2): 117–154 [2015-07-01]. doi:10.1023/A:1008097225773. （原始内容存档于2006-07-09）. Earlier version presented at IEEE Conference on Pattern Recognition and Computer Vision, CVPR'96, San Francisco, California, pages 465–470, june 1996
^ Almansa, A., Lindeberg, T. Fingerprint Enhancement by Shape Adaptation of Scale-Space Operators with Automatic Scale-Selection. IEEE Transactions on Image Processing. 2000, 9 (12): 2027–42 [2015-07-01]. PMID 18262941. doi:10.1109/83.887971. （原始内容存档于2019-09-07）.
^ L. Bretzner, I. Laptev and T. Lindeberg: Hand Gesture Recognition using Multi-Scale Colour Features, Hierarchical Models and Particle Filtering, Proc. IEEE Conference on Face and Gesture 2002, Washington DC, 423–428.. [2015-07-01]. （原始内容存档于2019-07-22）.
^ Sidenbladh, H., Black, M. Learning the statistics of people in images and video (PDF). International Journal of Computer Vision. 2003, 54 (1–2): 183–209 [2015-07-01]. （原始内容存档 (PDF)于2017-07-05）.
^ Haralick, R. Ridges and Valleys on Digital Images. Computer Vision, Graphics, and Image Processing. April 1983, 22 (10): 28–38. doi:10.1016/0734-189X(83)90094-4.
^ Crowley, J.L., Parker, A.C. A Representation for Shape Based on Peaks and Ridges in the Difference of Low Pass Transform (PDF). IEEE Trans Pattern Anal Mach Intell. March 1984, 6 (2): 156–170 [2015-07-01]. PMID 21869180. doi:10.1109/TPAMI.1984.4767500. （原始内容存档 (PDF)于2016-03-03）.
^ Crowley, J.L., Sanderson, A. Multiple Resolution Representation and Probabilistic Matching of 2-D Gray-Scale Shape (PDF). IEEE Trans Pattern Anal Mach Intell. January 1987, 9 (1): 113–121 [2015-07-01]. doi:10.1109/TPAMI.1987.4767876. （原始内容存档 (PDF)于2017-08-30）.
^ Gauch, J.M., Pizer, S.M. Multiresolution Analysis of Ridges and Valleys in Grey-Scale Images. IEEE Trans Pattern Anal Mach Intell. June 1993, 15 (6): 635–646. doi:10.1109/34.216734.
^ Eberly D., Gardner R., Morse B., Pizer S., Scharlach C. Ridges for image analysis. Journal of Mathematical Imaging and Vision. December 1994, 4 (4): 353–373. doi:10.1007/BF01262402.
^ Pizer, Stephen M., Eberly, David, Fritsch, Daniel S. Zoom-invariant vision of figural shape: the mathematics of cores. Computer Vision and Image Understanding. January 1998, 69 (1): 55–71. doi:10.1006/cviu.1997.0563.
^ S. Pizer, S. Joshi, T. Fletcher, M. Styner, G. Tracton, J. Chen (2001) "Segmentation of Single-Figure Objects by Deformable M-reps", Proceedings of the 4th International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer Lecture Notes In Computer Science; Vol. 2208, pp. 862–871. [2015-07-01]. （原始内容存档于2007-03-10）.
^ Steger C. An unbiased detector of curvilinear structures. IEEE Trans Pattern Anal Mach Intell. 1998, 20 (2): 113–125 [2022-03-26]. doi:10.1109/34.659930. （原始内容存档于2012-02-05）.
^ Laptev I., Mayer H., Lindeberg T., Eckstein W., Steger C., Baumgartner A. Automatic extraction of roads from aerial images based on scale-space and snakes (PDF). Machine Vision and Applications. 2000, 12 (1): 23 [2015-07-01]. doi:10.1007/s001380050121. （原始内容存档 (PDF)于2021-04-15）.
^ Frangi AF, Niessen WJ, Hoogeveen RM, van Walsum T, Viergever MA. Model-based quantitation of 3-D magnetic resonance angiographic images. IEEE Trans Med Imaging. October 1999, 18 (10): 946–56. PMID 10628954. doi:10.1109/42.811279.
^ Sato Y, Nakajima S, Shiraga N, Atsumi H, Yoshida S; et al. Three-dimensional multi-scale line filter for segmentation and visualization of curvilinear structures in medical images (PDF). Medical Image Analysis. 1998, 2 (2): 143–168 [2015-07-01]. doi:10.1016/s1361-8415(98)80009-1. （原始内容存档 (PDF)于2018-03-24）.
^ Krissian K., Malandain G., Ayache N., Vaillan R., Trousset Y. Model-based detection of tubular structures in 3D images. Computer Vision and Image Understanding. 2000, 80 (2): 130–171. doi:10.1006/cviu.2000.0866.
^ Eberly, D. Ridges in Image and Data Analysis. Kluwer. 1996. ISBN 0-7923-4268-2.
^ Fritsch, DS, Eberly,D., Pizer, SM, and McAuliffe, MJ. "Stimulated cores and their applications in medical imaging." Information Processing in Medical Imaging, Y. Bizais, C Barillot, R DiPaola, eds., Kluwer Series in Computational Imaging and Vision, pp. 365–368.

[1] T. Lindeberg. Scale-space. Encyclopedia of Computer Science and Engineering (Benjamin Wah, ed), John Wiley and Sons. 2008/2009, IV: 2495–2504 [2015-07-01]. doi:10.1002/9780470050118.ecse609. （原始内容存档于2019-09-07）. 请检查|date=中的日期值 (帮助)

[2] Lindeberg, T. Scale-space theory: A basic tool for analysing structures at different scales. J. of Applied Statistics. 1994, 21 (2): 224–270 [2015-07-01]. doi:10.1080/757582976. （原始内容存档于2019-09-07）.

[3] Lindeberg, T. Edge detection and ridge detection with automatic scale selection. International Journal of Computer Vision. 1998, 30 (2): 117–154 [2015-07-01]. doi:10.1023/A:1008097225773. （原始内容存档于2006-07-09）. Earlier version presented at IEEE Conference on Pattern Recognition and Computer Vision, CVPR'96, San Francisco, California, pages 465–470, june 1996

[4] Almansa, A., Lindeberg, T. Fingerprint Enhancement by Shape Adaptation of Scale-Space Operators with Automatic Scale-Selection. IEEE Transactions on Image Processing. 2000, 9 (12): 2027–42 [2015-07-01]. PMID 18262941. doi:10.1109/83.887971. （原始内容存档于2019-09-07）.

[5] L. Bretzner, I. Laptev and T. Lindeberg: Hand Gesture Recognition using Multi-Scale Colour Features, Hierarchical Models and Particle Filtering, Proc. IEEE Conference on Face and Gesture 2002, Washington DC, 423–428.. [2015-07-01]. （原始内容存档于2019-07-22）.

[6] Sidenbladh, H., Black, M. Learning the statistics of people in images and video (PDF). International Journal of Computer Vision. 2003, 54 (1–2): 183–209 [2015-07-01]. （原始内容存档 (PDF)于2017-07-05）.

[7] Haralick, R. Ridges and Valleys on Digital Images. Computer Vision, Graphics, and Image Processing. April 1983, 22 (10): 28–38. doi:10.1016/0734-189X(83)90094-4.

[8] Crowley, J.L., Parker, A.C. A Representation for Shape Based on Peaks and Ridges in the Difference of Low Pass Transform (PDF). IEEE Trans Pattern Anal Mach Intell. March 1984, 6 (2): 156–170 [2015-07-01]. PMID 21869180. doi:10.1109/TPAMI.1984.4767500. （原始内容存档 (PDF)于2016-03-03）.

[9] Crowley, J.L., Sanderson, A. Multiple Resolution Representation and Probabilistic Matching of 2-D Gray-Scale Shape (PDF). IEEE Trans Pattern Anal Mach Intell. January 1987, 9 (1): 113–121 [2015-07-01]. doi:10.1109/TPAMI.1987.4767876. （原始内容存档 (PDF)于2017-08-30）.

[10] Gauch, J.M., Pizer, S.M. Multiresolution Analysis of Ridges and Valleys in Grey-Scale Images. IEEE Trans Pattern Anal Mach Intell. June 1993, 15 (6): 635–646. doi:10.1109/34.216734.

[11] Eberly D., Gardner R., Morse B., Pizer S., Scharlach C. Ridges for image analysis. Journal of Mathematical Imaging and Vision. December 1994, 4 (4): 353–373. doi:10.1007/BF01262402.

[12] Pizer, Stephen M., Eberly, David, Fritsch, Daniel S. Zoom-invariant vision of figural shape: the mathematics of cores. Computer Vision and Image Understanding. January 1998, 69 (1): 55–71. doi:10.1006/cviu.1997.0563.

[13] S. Pizer, S. Joshi, T. Fletcher, M. Styner, G. Tracton, J. Chen (2001) "Segmentation of Single-Figure Objects by Deformable M-reps", Proceedings of the 4th International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer Lecture Notes In Computer Science; Vol. 2208, pp. 862–871. [2015-07-01]. （原始内容存档于2007-03-10）.

[14] Steger C. An unbiased detector of curvilinear structures. IEEE Trans Pattern Anal Mach Intell. 1998, 20 (2): 113–125 [2022-03-26]. doi:10.1109/34.659930. （原始内容存档于2012-02-05）.

[15] Laptev I., Mayer H., Lindeberg T., Eckstein W., Steger C., Baumgartner A. Automatic extraction of roads from aerial images based on scale-space and snakes (PDF). Machine Vision and Applications. 2000, 12 (1): 23 [2015-07-01]. doi:10.1007/s001380050121. （原始内容存档 (PDF)于2021-04-15）.

[16] Frangi AF, Niessen WJ, Hoogeveen RM, van Walsum T, Viergever MA. Model-based quantitation of 3-D magnetic resonance angiographic images. IEEE Trans Med Imaging. October 1999, 18 (10): 946–56. PMID 10628954. doi:10.1109/42.811279.

[17] Sato Y, Nakajima S, Shiraga N, Atsumi H, Yoshida S; et al. Three-dimensional multi-scale line filter for segmentation and visualization of curvilinear structures in medical images (PDF). Medical Image Analysis. 1998, 2 (2): 143–168 [2015-07-01]. doi:10.1016/s1361-8415(98)80009-1. （原始内容存档 (PDF)于2018-03-24）.

[18] Krissian K., Malandain G., Ayache N., Vaillan R., Trousset Y. Model-based detection of tubular structures in 3D images. Computer Vision and Image Understanding. 2000, 80 (2): 130–171. doi:10.1006/cviu.2000.0866.

[19] Eberly, D. Ridges in Image and Data Analysis. Kluwer. 1996. ISBN 0-7923-4268-2.

[20] Fritsch, DS, Eberly,D., Pizer, SM, and McAuliffe, MJ. "Stimulated cores and their applications in medical imaging." Information Processing in Medical Imaging, Y. Bizais, C Barillot, R DiPaola, eds., Kluwer Series in Computational Imaging and Vision, pp. 365–368.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]