PCA: definitions

2008-11-30 15:20:38 +00:00 · 2008-11-30 15:20:38 +00:00 · b83c6bd02a
parent a81834c926
commit b83c6bd02a
3 changed files with 9 additions and 19 deletions
--- a/Principal_component_analysis/doc_tex/Principal_component_analysis/PkgDescription.tex
+++ b/Principal_component_analysis/doc_tex/Principal_component_analysis/PkgDescription.tex
@ -1,7 +1,7 @@

 \begin{ccPkgDescription}{Principal Component Analysis\label{Pkg:PrincipalComponentAnalysisD}}
 \ccPkgHowToCiteCgal{cgal:ap-pcad-08}
-\ccPkgSummary{This package provides functions to compute global information on the shape of a set of 2D or 3D objects. It provides the computation of axis-aligned bounding boxes for sets of bounded objects, and barycenters of weighted point sets. In addition, it provides computation of centroids (center of mass) and linear least squares fitting for point sets as well as for sets of other bounded objects in 2D and 3D. More specifically, these objects include segments, circles, disks, rectangles, triangles, cuboids, tetrahedra, spheres and balls. The common interface to these functions takes an iterator range of objects.}
+\ccPkgSummary{This package provides functions to compute global information about the shape of a set of 2D or 3D objects. It provides the computation of axis-aligned bounding boxes for sets of bounded objects, and barycenters of weighted point sets. In addition, it provides computation of centroids (center of mass) and linear least squares fitting for point sets as well as for sets of other bounded objects. More specifically, these objects include segments, circles, disks, rectangles, triangles, cuboids, tetrahedra, spheres and balls. The common interface to these functions takes an iterator range of objects.}

 %\ccPkgDependsOn{}
 \ccPkgIntroducedInCGAL{3.2}
--- a/Principal_component_analysis/doc_tex/Principal_component_analysis/intro.tex
+++ b/Principal_component_analysis/doc_tex/Principal_component_analysis/intro.tex
@ -5,15 +5,13 @@ This package provides functions to analyze sets of objects in 2D and 3D. It prov

 A \emph{bounding box} for a set of objects is a cuboid that contains the set. An \emph{axis-aligned bounding box} is a an expression of the maximum extents of all objects from the set within their coordinate system, i.e., a bounding box aligned with the axes of the coordinate system. Axis-aligned bounding boxes are frequently used in geometric algorithms as an indication of the general position of a data set, for either display, first-approximation spatial query, or spatial indexing purposes. \\

-A \emph{centroid} of a set of objects is their center of mass. \\
+A \emph{centroid} of a set of objects is their center of mass, i.e., the point whose coordinates are computed by means of coordinates of all points composing the objects. Note that although the general definition of center of mass incorporates a density function (and hence weighted means), the current implementation assumes a uniform density (see barycenter below defined for weighted points). For a point set $\{X_1,X_2,...,X_N\}$ the centroid $\bar{X}$ is computed as $$\bar{X} = \frac{1}{N} \sum_{i=1}{N} X_i.$$ For a set of segments $\{S_1,S_2,...,S_N\}$ the centroid $\bar{X}$ is computed as $$\bar{X} = \frac{1}{\sum_{i=1}{N}\|S_i\|} \sum_{i=1}{N} \|S_i\|} \bar{S_i},$$ where $\|S_i\|$ stands for the length of a segment and $\bar{S_i}$ stands for its uniform barycenter (midpoint). For a set of triangles $\{T_1,T_2,...,T_N\}$ the centroid $\bar{X}$ is computed as $$\bar{X} = \frac{1}{\sum_{i=1}{N}\|T_i\|} \sum_{i=1}{N} \|T_i\|} \bar{T_i},$$ where $\|T_i\|$ stands for the area of a triangle and $\bar{T_i}$ stands for its uniform barycenter. Such definition still holds for more general objects where the Lebesgue measure (length in 1D, area in 2D, volume in 3D) is used for weighting the object barycenters. Centers of mass are used to summarize data sets for either approximation, spatial query or spatial indexing purposes.\\

+A \emph{barycenter} of a set of weighted points is the point whose coordinates are computed by means of weighted coordinates of all weighted points from the set. When all weights are equal the barycenter coincides with the centroid.\\

-A \emph{barycenter} of weighted point sets is defined as weighted
-average of position. When all weights are equal the barycenter coincides with the centroid.\\
-Centers of mass are used to summarize data sets for approximation.
+Given a point set, \emph{linear least squares fitting} amounts to find the linear sub-space which minimizes the sum of squared distances from the points to their projection onto this linear sub-space. 

-
-Given a point set, \emph{linear least squares fitting} amounts to find the linear sub-space which minimizes the sum of squared distances from the points to their projection onto this linear sub-space. This problem is equivalent to search for the linear sub-space which maximizes the variance of projected points, the latter being obtained by eigen decomposition of the covariance matrix of the point set. Eigenvectors corresponding to large eigenvalues are the
+This problem is equivalent to search for the linear sub-space which maximizes the variance of projected points, the latter being obtained by eigen decomposition of the covariance matrix of the point set. Eigenvectors corresponding to large eigenvalues are the
 directions in which the data has strong component, or equivalently large variance. If eigenvalues are the same there is no preferable sub-space.\\

 Given an object set, \emph{linear least squares fitting} amounts to find the linear sub-space which minimizes the sum of squared distances from all points in the set to their projection onto this linear sub-space. This problem is equivalent to the one of fitting a linear sub-space to a point set, except that the covariance matrix is now derived (closed form formula) from a continuous integral over the objects instead of a discrete sum over the points.
--- a/Principal_component_analysis/doc_tex/Principal_component_analysis/main.tex
+++ b/Principal_component_analysis/doc_tex/Principal_component_analysis/main.tex
@ -11,33 +11,25 @@

 \subsection{Bounding Box of a Point Set}

-In the following example we use \stl\ containers of 2D and 3D points, and
-compute their axis-aligned bounding box. The kernel from which the input points
-come is automatically deduced by the function.
+In the following example we use \stl\ containers of 2D and 3D points, and compute their axis-aligned bounding box. The kernel from which the input points originate is automatically deduced by the function.

 \ccIncludeExampleCode{Principal_component_analysis/bounding_box.cpp}

 \subsection{Centroid of a Point Set}

-In the following example we use \stl\ containers of 2D and 3D points, and
-compute their centroid. The kernel from which the input points
-come is automatically deduced by the function.
+In the following example we use \stl\ containers of 2D and 3D points, and compute their centroid. The kernel from which the input points originate is automatically deduced by the function.

 \ccIncludeExampleCode{Principal_component_analysis/centroid.cpp}

 \subsection{Barycenter of a Set of Weighted Points}

-In the following example we use \stl\ containers of 2D and 3D weighted points,
-and compute their barycenter. The kernel from which the input points come is
-automatically deduced by the function.
+In the following example we use \stl\ containers of 2D and 3D weighted points, and compute their barycenter. The kernel from which the input points originate is automatically deduced by the function.

 \ccIncludeExampleCode{Principal_component_analysis/barycenter.cpp}

 \subsection{Best Fitting Line of a 2D Point Set}

-In the following example we use an \stl\ container of 2D points, and
-compute the best fitting line. The kernel from which the input points
-come is automatically deduced by the function.
+In the following example we use an \stl\ container of 2D points, and compute the best fitting line in the least squares sense. The kernel from which the input points originate is automatically deduced by the function.

 \ccIncludeExampleCode{Principal_component_analysis/linear_least_squares_fitting_points_2.cpp}